Human methylome studies SRP033252 Track Settings
 
Genome-wide profiling of the functional DNA methylation landscape at base-pair resolution in human cancer types [BS-seq] [Blood (B-cells), Brain, Brain (Grey Matter), Brain (White Matter), Breast, Colon, Liver, Lung, Placenta, Prostate]

Track collection: Human methylome studies

+  All tracks in this collection (424)

Maximum display mode:       Reset to defaults   
Select views (Help):
HMR       PMD       CpG methylation ▾       AMR       CpG reads ▾      
Select subtracks by views and experiment:
 All views HMR  PMD  CpG methylation  AMR  CpG reads 
experiment
SRX381553 
SRX381569 
SRX381585 
SRX381601 
SRX381611 
SRX381621 
SRX381631 
SRX381646 
SRX381661 
SRX381666 
SRX381671 
SRX381678 
SRX381693 
SRX381697 
SRX381701 
SRX381706 
SRX381710 
SRX381713 
SRX381716 
SRX381719 
SRX381722 
SRX381725 
experiment
 All views HMR  PMD  CpG methylation  AMR  CpG reads 
List subtracks: only selected/visible    all    ()
  experiment↓1 views↓2   Track Name↓3  
hide
 SRX381553  HMR  Colon / SRX381553 (HMR)   Data format 
hide
 Configure
 SRX381553  CpG methylation  Colon / SRX381553 (CpG methylation)   Data format 
hide
 SRX381569  HMR  Colon / SRX381569 (HMR)   Data format 
hide
 Configure
 SRX381569  CpG methylation  Colon / SRX381569 (CpG methylation)   Data format 
hide
 Configure
 SRX381585  CpG methylation  Colon / SRX381585 (CpG methylation)   Data format 
hide
 Configure
 SRX381601  CpG methylation  Prostate / SRX381601 (CpG methylation)   Data format 
hide
 Configure
 SRX381611  CpG methylation  Breast / SRX381611 (CpG methylation)   Data format 
hide
 Configure
 SRX381621  CpG methylation  Breast / SRX381621 (CpG methylation)   Data format 
hide
 SRX381631  HMR  Breast / SRX381631 (HMR)   Data format 
hide
 Configure
 SRX381631  CpG methylation  Breast / SRX381631 (CpG methylation)   Data format 
hide
 SRX381646  HMR  Prostate / SRX381646 (HMR)   Data format 
hide
 Configure
 SRX381646  CpG methylation  Prostate / SRX381646 (CpG methylation)   Data format 
hide
 Configure
 SRX381661  CpG methylation  Liver / SRX381661 (CpG methylation)   Data format 
hide
 SRX381666  HMR  Liver / SRX381666 (HMR)   Data format 
hide
 Configure
 SRX381666  CpG methylation  Liver / SRX381666 (CpG methylation)   Data format 
hide
 SRX381671  HMR  Liver / SRX381671 (HMR)   Data format 
hide
 Configure
 SRX381671  CpG methylation  Liver / SRX381671 (CpG methylation)   Data format 
hide
 Configure
 SRX381678  CpG methylation  Prostate / SRX381678 (CpG methylation)   Data format 
hide
 SRX381693  HMR  Brain (Grey Matter) / SRX381693 (HMR)   Data format 
hide
 Configure
 SRX381693  CpG methylation  Brain (Grey Matter) / SRX381693 (CpG methylation)   Data format 
hide
 Configure
 SRX381697  CpG methylation  Brain / SRX381697 (CpG methylation)   Data format 
hide
 Configure
 SRX381701  CpG methylation  Brain / SRX381701 (CpG methylation)   Data format 
hide
 SRX381706  HMR  Brain (White Matter) / SRX381706 (HMR)   Data format 
hide
 Configure
 SRX381706  CpG methylation  Brain (White Matter) / SRX381706 (CpG methylation)   Data format 
hide
 Configure
 SRX381710  CpG methylation  Placenta / SRX381710 (CpG methylation)   Data format 
hide
 SRX381713  HMR  Lung / SRX381713 (HMR)   Data format 
hide
 Configure
 SRX381713  CpG methylation  Lung / SRX381713 (CpG methylation)   Data format 
hide
 Configure
 SRX381716  CpG methylation  Lung / SRX381716 (CpG methylation)   Data format 
hide
 Configure
 SRX381719  CpG methylation  Lung / SRX381719 (CpG methylation)   Data format 
hide
 Configure
 SRX381722  CpG methylation  Lung / SRX381722 (CpG methylation)   Data format 
hide
 SRX381725  HMR  Blood (B-cells) / SRX381725 (HMR)   Data format 
hide
 Configure
 SRX381725  CpG methylation  Blood (B-cells) / SRX381725 (CpG methylation)   Data format 
    
Assembly: Human Dec. 2013 (GRCh38/hg38)

Study title: Genome-wide profiling of the functional DNA methylation landscape at base-pair resolution in human cancer types [BS-seq]
SRA: SRP033252
GEO: GSE52271
Pubmed: 26813288

Experiment Label Methylation Coverage HMRs HMR size AMRs AMR size PMDs PMD size Conversion Title
SRX381553 Colon 0.672 46.0 39500 1123.0 12293 1132.0 2227 11334.7 0.998 Bsseq normal colon
SRX381569 Colon 0.657 43.9 64108 2489.8 13113 1127.5 2137 434987.3 0.998 Bsseq tumor colon
SRX381585 Colon 0.617 47.0 66552 5455.0 11292 1110.2 1912 541396.1 0.998 Bsseq metastasis colon
SRX381601 Prostate 0.541 29.1 85795 7945.6 2023 1055.5 3396 270015.6 0.999 Bsseq tumor prostate
SRX381611 Breast 0.566 37.3 160561 5524.4 2044 993.8 2680 390350.1 0.999 Bsseq metastasis breast
SRX381621 Breast 0.438 37.4 157544 6936.3 4057 1115.5 6328 211256.8 0.999 Bsseq tumor breast
SRX381631 Breast 0.715 34.7 60673 1115.2 7147 965.4 2866 15254.4 0.997 Bsseq normal breast
SRX381646 Prostate 0.729 37.4 64626 1125.7 4927 932.6 3254 10576.9 0.998 Bsseq normal prostate
SRX381661 Liver 0.496 50.7 68528 8706.7 12170 1034.2 3051 270125.8 0.999 Bsseq tumor liver
SRX381666 Liver 0.629 52.5 39936 1462.8 34089 1772.3 1529 817861.8 0.998 Bsseq tumor liver
SRX381671 Liver 0.718 55.5 55188 1203.7 6553 1028.2 3711 21288.8 0.995 Bsseq normal liver
SRX381678 Prostate 0.532 40.0 134037 5837.8 2234 1019.2 3992 205716.8 0.999 Bsseq tumor prostate
SRX381693 Brain (Grey Matter) 0.729 15.8 43145 1160.6 2226 944.7 4041 15343.4 0.981 Bsseq normal brain grey-matter
SRX381697 Brain 0.640 16.2 82379 8318.9 4369 1304.2 2302 356703.4 0.995 Bsseq tumor neuroblastoma
SRX381701 Brain 0.546 17.0 83080 8071.6 1539 1052.2 3024 254719.9 0.995 Bsseq tumor glioblastoma
SRX381706 Brain (White Matter) 0.738 16.0 60013 1098.1 894 1031.2 3562 19610.3 0.991 Bsseq normal brain white-matter
SRX381710 Placenta 0.565 29.7 53819 11001.0 5272 1016.6 2610 392655.4 0.996 Bsseq normal placenta
SRX381713 Lung 0.711 18.8 40823 1110.5 2102 903.3 2792 10945.9 0.996 Bsseq normal lung
SRX381716 Lung 0.475 20.6 115726 9586.3 456 936.9 3852 345753.1 0.997 Bsseq adenocarcinoma lung
SRX381719 Lung 0.416 21.0 107730 11488.8 1548 1000.5 3476 503769.8 0.997 Bsseq squamous_cell tumor lung
SRX381722 Lung 0.629 21.1 91060 6082.8 1459 1025.4 2812 235689.5 0.996 Bsseq small_cell tumor lung
SRX381725 Blood (B-cells) 0.698 18.1 52213 987.5 1747 980.7 2783 8926.3 0.996 Bsseq normal CD19

Methods

All analysis was done using a bisulfite sequnecing data analysis pipeline DNMTools developed in the Smith lab at USC.

Mapping reads from bisulfite sequencing: Bisulfite treated reads are mapped to the genomes with the abismal program. Input reads are filtered by their quality, and adapter sequences in the 3' end of reads are trimmed. This is done with cutadapt. Uniquely mapped reads with mismatches/indels below given threshold are retained. For pair-end reads, if the two mates overlap, the overlapping part of the mate with lower quality is discarded. After mapping, we use the format command in dnmtools to merge mates for paired-end reads. We use the dnmtools uniq command to randomly select one from multiple reads mapped exactly to the same location. Without random oligos as UMIs, this is our best indication of PCR duplicates.

Estimating methylation levels: After reads are mapped and filtered, the dnmtools counts command is used to obtain read coverage and estimate methylation levels at individual cytosine sites. We count the number of methylated reads (those containing a C) and the number of unmethylated reads (those containing a T) at each nucleotide in a mapped read that corresponds to a cytosine in the reference genome. The methylation level of that cytosine is estimated as the ratio of methylated to total reads covering that cytosine. For cytosines in the symmetric CpG sequence context, reads from the both strands are collapsed to give a single estimate. Very rarely do the levels differ between strands (typically only if there has been a substitution, as in a somatic mutation), and this approach gives a better estimate.

Bisulfite conversion rate: The bisulfite conversion rate for an experiment is estimated with the dnmtools bsrate command, which computes the fraction of successfully converted nucleotides in reads (those read out as Ts) among all nucleotides in the reads mapped that map over cytosines in the reference genome. This is done either using a spike-in (e.g., lambda), the mitochondrial DNA, or the nuclear genome. In the latter case, only non-CpG sites are used. While this latter approach can be impacted by non-CpG cytosine methylation, in practice it never amounts to much.

Identifying hypomethylated regions (HMRs): In most mammalian cells, the majority of the genome has high methylation, and regions of low methylation are typically the interesting features. (This seems to be true for essentially all healthy differentiated cell types, but not cells of very early embryogenesis, various germ cells and precursors, and placental lineage cells.) These are valleys of low methylation are called hypomethylated regions (HMR) for historical reasons. To identify the HMRs, we use the dnmtools hmr command, which uses a statistical model that accounts for both the methylation level fluctations and the varying amounts of data available at each CpG site.

Partially methylated domains: Partially methylated domains are large genomic regions showing partial methylation observed in immortalized cell lines and cancerous cells. The pmd program is used to identify PMDs.

Allele-specific methylation: Allele-Specific methylated regions refers to regions where the parental allele is differentially methylated compared to the maternal allele. The program allelic is used to compute allele-specific methylation score can be computed for each CpG site by testing the linkage between methylation status of adjacent reads, and the program amrfinder is used to identify regions with allele-specific methylation.

For more detailed description of the methods of each step, please refer to the DNMTools documentation.