Human methylome studies SRP022041 Track Settings
 
Buccals are likely to be a more informative surrogate tissue than blood for epigenome-wide association studies [Bisulfite-Seq] [Normal Buccal Cells]

Track collection: Human methylome studies

+  All tracks in this collection (424)

Maximum display mode:       Reset to defaults   
Select views (Help):
CpG reads ▾       PMD       AMR       HMR       CpG methylation ▾      
Select subtracks by views and experiment:
 All views CpG reads  PMD  AMR  HMR  CpG methylation 
experiment
SRX273392 
SRX273393 
SRX273397 
SRX273402 
SRX273403 
SRX273405 
SRX273407 
SRX273411 
SRX273412 
SRX273413 
SRX273415 
List subtracks: only selected/visible    all    ()
  experiment↓1 views↓2   Track Name↓3  
hide
 SRX273392  HMR  Normal Buccal Cells / SRX273392 (HMR)   Data format 
hide
 Configure
 SRX273392  CpG methylation  Normal Buccal Cells / SRX273392 (CpG methylation)   Data format 
hide
 SRX273393  HMR  Normal Buccal Cells / SRX273393 (HMR)   Data format 
hide
 Configure
 SRX273393  CpG methylation  Normal Buccal Cells / SRX273393 (CpG methylation)   Data format 
hide
 SRX273397  HMR  Normal Buccal Cells / SRX273397 (HMR)   Data format 
hide
 Configure
 SRX273397  CpG methylation  Normal Buccal Cells / SRX273397 (CpG methylation)   Data format 
hide
 SRX273402  HMR  Normal Buccal Cells / SRX273402 (HMR)   Data format 
hide
 Configure
 SRX273402  CpG methylation  Normal Buccal Cells / SRX273402 (CpG methylation)   Data format 
hide
 SRX273403  HMR  Normal Buccal Cells / SRX273403 (HMR)   Data format 
hide
 Configure
 SRX273403  CpG methylation  Normal Buccal Cells / SRX273403 (CpG methylation)   Data format 
hide
 SRX273405  HMR  Normal Buccal Cells / SRX273405 (HMR)   Data format 
hide
 Configure
 SRX273405  CpG methylation  Normal Buccal Cells / SRX273405 (CpG methylation)   Data format 
hide
 SRX273407  HMR  Normal Buccal Cells / SRX273407 (HMR)   Data format 
hide
 Configure
 SRX273407  CpG methylation  Normal Buccal Cells / SRX273407 (CpG methylation)   Data format 
hide
 SRX273411  HMR  Normal Buccal Cells / SRX273411 (HMR)   Data format 
hide
 Configure
 SRX273411  CpG methylation  Normal Buccal Cells / SRX273411 (CpG methylation)   Data format 
hide
 SRX273412  HMR  Normal Buccal Cells / SRX273412 (HMR)   Data format 
hide
 Configure
 SRX273412  CpG methylation  Normal Buccal Cells / SRX273412 (CpG methylation)   Data format 
hide
 SRX273413  HMR  Normal Buccal Cells / SRX273413 (HMR)   Data format 
hide
 Configure
 SRX273413  CpG methylation  Normal Buccal Cells / SRX273413 (CpG methylation)   Data format 
hide
 SRX273415  HMR  Normal Buccal Cells / SRX273415 (HMR)   Data format 
hide
 Configure
 SRX273415  CpG methylation  Normal Buccal Cells / SRX273415 (CpG methylation)   Data format 
    
Assembly: Human Dec. 2013 (GRCh38/hg38)

Study title: Buccals are likely to be a more informative surrogate tissue than blood for epigenome-wide association studies [Bisulfite-Seq]
SRA: SRP022041
GEO: GSE46572
Pubmed: 23538714

Experiment Label Methylation Coverage HMRs HMR size AMRs AMR size PMDs PMD size Conversion Title
SRX273392 Normal Buccal Cells 0.491 4.1 39582 1330.4 388 971.2 174 52102.3 0.979 GSM1132625: BSSeq_Buccal_rep_1_5; Homo sapiens; Bisulfite-Seq
SRX273393 Normal Buccal Cells 0.649 5.1 47963 1439.3 320 1002.7 861 29482.5 0.996 GSM1132626: BSSeq_Buccal_rep_1_6; Homo sapiens; Bisulfite-Seq
SRX273397 Normal Buccal Cells 0.652 4.8 39434 1391.4 772 976.8 789 32667.8 0.995 GSM1132630: BSSeq_Buccal_rep_1_10; Homo sapiens; Bisulfite-Seq
SRX273402 Normal Buccal Cells 0.618 3.4 39675 1668.9 243 993.4 558 46522.0 0.991 GSM1132635: BSSeq_Buccal_rep_2_1; Homo sapiens; Bisulfite-Seq
SRX273403 Normal Buccal Cells 0.588 5.5 51851 1394.2 720 1067.1 683 27966.3 0.995 GSM1132636: BSSeq_Buccal_rep_2_2; Homo sapiens; Bisulfite-Seq
SRX273405 Normal Buccal Cells 0.587 3.3 35594 1958.5 594 1119.5 547 44843.4 0.994 GSM1132638: BSSeq_Buccal_rep_2_4; Homo sapiens; Bisulfite-Seq
SRX273407 Normal Buccal Cells 0.637 5.3 49422 1411.6 358 1030.2 815 30301.1 0.996 GSM1132640: BSSeq_Buccal_rep_2_6; Homo sapiens; Bisulfite-Seq
SRX273411 Normal Buccal Cells 0.657 4.8 39025 1392.6 805 942.0 812 31572.9 0.995 GSM1132644: BSSeq_Buccal_rep_2_10; Homo sapiens; Bisulfite-Seq
SRX273412 Normal Buccal Cells 0.675 1.9 29444 2010.0 64 913.1 353 57093.3 0.996 GSM1132645: BSSeq_Buccal_rep_2_11; Homo sapiens; Bisulfite-Seq
SRX273413 Normal Buccal Cells 0.665 6.3 37288 1278.0 2234 934.6 664 29012.7 0.996 GSM1132646: BSSeq_Buccal_rep_2_12; Homo sapiens; Bisulfite-Seq
SRX273415 Normal Buccal Cells 0.677 3.5 34036 1602.7 259 909.7 569 33644.7 0.996 GSM1132648: BSSeq_Buccal_rep_2_14; Homo sapiens; Bisulfite-Seq

Methods

All analysis was done using a bisulfite sequnecing data analysis pipeline DNMTools developed in the Smith lab at USC.

Mapping reads from bisulfite sequencing: Bisulfite treated reads are mapped to the genomes with the abismal program. Input reads are filtered by their quality, and adapter sequences in the 3' end of reads are trimmed. This is done with cutadapt. Uniquely mapped reads with mismatches/indels below given threshold are retained. For pair-end reads, if the two mates overlap, the overlapping part of the mate with lower quality is discarded. After mapping, we use the format command in dnmtools to merge mates for paired-end reads. We use the dnmtools uniq command to randomly select one from multiple reads mapped exactly to the same location. Without random oligos as UMIs, this is our best indication of PCR duplicates.

Estimating methylation levels: After reads are mapped and filtered, the dnmtools counts command is used to obtain read coverage and estimate methylation levels at individual cytosine sites. We count the number of methylated reads (those containing a C) and the number of unmethylated reads (those containing a T) at each nucleotide in a mapped read that corresponds to a cytosine in the reference genome. The methylation level of that cytosine is estimated as the ratio of methylated to total reads covering that cytosine. For cytosines in the symmetric CpG sequence context, reads from the both strands are collapsed to give a single estimate. Very rarely do the levels differ between strands (typically only if there has been a substitution, as in a somatic mutation), and this approach gives a better estimate.

Bisulfite conversion rate: The bisulfite conversion rate for an experiment is estimated with the dnmtools bsrate command, which computes the fraction of successfully converted nucleotides in reads (those read out as Ts) among all nucleotides in the reads mapped that map over cytosines in the reference genome. This is done either using a spike-in (e.g., lambda), the mitochondrial DNA, or the nuclear genome. In the latter case, only non-CpG sites are used. While this latter approach can be impacted by non-CpG cytosine methylation, in practice it never amounts to much.

Identifying hypomethylated regions (HMRs): In most mammalian cells, the majority of the genome has high methylation, and regions of low methylation are typically the interesting features. (This seems to be true for essentially all healthy differentiated cell types, but not cells of very early embryogenesis, various germ cells and precursors, and placental lineage cells.) These are valleys of low methylation are called hypomethylated regions (HMR) for historical reasons. To identify the HMRs, we use the dnmtools hmr command, which uses a statistical model that accounts for both the methylation level fluctations and the varying amounts of data available at each CpG site.

Partially methylated domains: Partially methylated domains are large genomic regions showing partial methylation observed in immortalized cell lines and cancerous cells. The pmd program is used to identify PMDs.

Allele-specific methylation: Allele-Specific methylated regions refers to regions where the parental allele is differentially methylated compared to the maternal allele. The program allelic is used to compute allele-specific methylation score can be computed for each CpG site by testing the linkage between methylation status of adjacent reads, and the program amrfinder is used to identify regions with allele-specific methylation.

For more detailed description of the methods of each step, please refer to the DNMTools documentation.