Human methylome studies SRP332046 Track Settings
 
The WGBS and ATAC-seq from human serum and amniotic cell [Blood]

Track collection: Human methylome studies

+  All tracks in this collection (426)

Maximum display mode:       Reset to defaults   
Select views (Help):
PMD       CpG methylation ▾       CpG reads ▾       AMR       HMR      
Select subtracks by views and experiment:
 All views PMD  CpG methylation  CpG reads  AMR  HMR 
experiment
SRX11715542 
SRX11715543 
SRX11715544 
SRX11715545 
SRX11715546 
SRX11715547 
SRX11715548 
SRX11715549 
SRX11715550 
List subtracks: only selected/visible    all    ()
  experiment↓1 views↓2   Track Name↓3  
hide
 SRX11715542  HMR  Blood / SRX11715542 (HMR)   Data format 
hide
 Configure
 SRX11715542  CpG methylation  Blood / SRX11715542 (CpG methylation)   Data format 
hide
 SRX11715543  HMR  Blood / SRX11715543 (HMR)   Data format 
hide
 Configure
 SRX11715543  CpG methylation  Blood / SRX11715543 (CpG methylation)   Data format 
hide
 SRX11715544  HMR  Blood / SRX11715544 (HMR)   Data format 
hide
 Configure
 SRX11715544  CpG methylation  Blood / SRX11715544 (CpG methylation)   Data format 
hide
 SRX11715545  HMR  Blood / SRX11715545 (HMR)   Data format 
hide
 Configure
 SRX11715545  CpG methylation  Blood / SRX11715545 (CpG methylation)   Data format 
hide
 SRX11715546  HMR  Blood / SRX11715546 (HMR)   Data format 
hide
 Configure
 SRX11715546  CpG methylation  Blood / SRX11715546 (CpG methylation)   Data format 
hide
 SRX11715547  HMR  Blood / SRX11715547 (HMR)   Data format 
hide
 Configure
 SRX11715547  CpG methylation  Blood / SRX11715547 (CpG methylation)   Data format 
hide
 SRX11715548  HMR  Blood / SRX11715548 (HMR)   Data format 
hide
 Configure
 SRX11715548  CpG methylation  Blood / SRX11715548 (CpG methylation)   Data format 
hide
 SRX11715549  HMR  Blood / SRX11715549 (HMR)   Data format 
hide
 Configure
 SRX11715549  CpG methylation  Blood / SRX11715549 (CpG methylation)   Data format 
hide
 SRX11715550  HMR  Blood / SRX11715550 (HMR)   Data format 
hide
 Configure
 SRX11715550  CpG methylation  Blood / SRX11715550 (CpG methylation)   Data format 
    
Assembly: Human Dec. 2013 (GRCh38/hg38)

Study title: The WGBS and ATAC-seq from human serum and amniotic cell
SRA: SRP332046
GEO: GSE181854
Pubmed: 37051325

Experiment Label Methylation Coverage HMRs HMR size AMRs AMR size PMDs PMD size Conversion Title
SRX11715542 Blood 0.814 17.8 57905 1042.8 1650 922.4 3130 12630.7 0.985 GSM5512941: KS_1; Homo sapiens; Bisulfite-Seq
SRX11715543 Blood 0.805 19.5 60460 1020.8 1735 945.8 3331 11148.0 0.981 GSM5512942: KS_2; Homo sapiens; Bisulfite-Seq
SRX11715544 Blood 0.788 17.3 50242 1085.3 2001 925.3 3059 10363.4 0.984 GSM5512943: KS_3; Homo sapiens; Bisulfite-Seq
SRX11715545 Blood 0.811 19.8 59211 1046.7 2042 947.7 3589 11817.5 0.981 GSM5512944: NM_1; Homo sapiens; Bisulfite-Seq
SRX11715546 Blood 0.811 19.3 70559 973.8 1164 1046.1 3674 13108.1 0.982 GSM5512945: NM_2; Homo sapiens; Bisulfite-Seq
SRX11715547 Blood 0.803 19.8 63417 1020.5 1518 955.3 3394 12684.4 0.981 GSM5512946: NM_3; Homo sapiens; Bisulfite-Seq
SRX11715548 Blood 0.803 17.7 61829 1017.5 897 819.3 3697 11174.2 0.982 GSM5512947: NF_1; Homo sapiens; Bisulfite-Seq
SRX11715549 Blood 0.812 19.6 63998 1009.5 956 920.8 3495 11638.7 0.982 GSM5512948: NF_2; Homo sapiens; Bisulfite-Seq
SRX11715550 Blood 0.811 17.9 61886 1009.4 1014 860.0 3486 9780.0 0.982 GSM5512949: NF_3; Homo sapiens; Bisulfite-Seq

Methods

All analysis was done using a bisulfite sequnecing data analysis pipeline DNMTools developed in the Smith lab at USC.

Mapping reads from bisulfite sequencing: Bisulfite treated reads are mapped to the genomes with the abismal program. Input reads are filtered by their quality, and adapter sequences in the 3' end of reads are trimmed. This is done with cutadapt. Uniquely mapped reads with mismatches/indels below given threshold are retained. For pair-end reads, if the two mates overlap, the overlapping part of the mate with lower quality is discarded. After mapping, we use the format command in dnmtools to merge mates for paired-end reads. We use the dnmtools uniq command to randomly select one from multiple reads mapped exactly to the same location. Without random oligos as UMIs, this is our best indication of PCR duplicates.

Estimating methylation levels: After reads are mapped and filtered, the dnmtools counts command is used to obtain read coverage and estimate methylation levels at individual cytosine sites. We count the number of methylated reads (those containing a C) and the number of unmethylated reads (those containing a T) at each nucleotide in a mapped read that corresponds to a cytosine in the reference genome. The methylation level of that cytosine is estimated as the ratio of methylated to total reads covering that cytosine. For cytosines in the symmetric CpG sequence context, reads from the both strands are collapsed to give a single estimate. Very rarely do the levels differ between strands (typically only if there has been a substitution, as in a somatic mutation), and this approach gives a better estimate.

Bisulfite conversion rate: The bisulfite conversion rate for an experiment is estimated with the dnmtools bsrate command, which computes the fraction of successfully converted nucleotides in reads (those read out as Ts) among all nucleotides in the reads mapped that map over cytosines in the reference genome. This is done either using a spike-in (e.g., lambda), the mitochondrial DNA, or the nuclear genome. In the latter case, only non-CpG sites are used. While this latter approach can be impacted by non-CpG cytosine methylation, in practice it never amounts to much.

Identifying hypomethylated regions (HMRs): In most mammalian cells, the majority of the genome has high methylation, and regions of low methylation are typically the interesting features. (This seems to be true for essentially all healthy differentiated cell types, but not cells of very early embryogenesis, various germ cells and precursors, and placental lineage cells.) These are valleys of low methylation are called hypomethylated regions (HMR) for historical reasons. To identify the HMRs, we use the dnmtools hmr command, which uses a statistical model that accounts for both the methylation level fluctations and the varying amounts of data available at each CpG site.

Partially methylated domains: Partially methylated domains are large genomic regions showing partial methylation observed in immortalized cell lines and cancerous cells. The pmd program is used to identify PMDs.

Allele-specific methylation: Allele-Specific methylated regions refers to regions where the parental allele is differentially methylated compared to the maternal allele. The program allelic is used to compute allele-specific methylation score can be computed for each CpG site by testing the linkage between methylation status of adjacent reads, and the program amrfinder is used to identify regions with allele-specific methylation.

For more detailed description of the methods of each step, please refer to the DNMTools documentation.