Affiliation:
1. UiT-The Arctic University of Norway
Abstract
Abstract
Background
The analysis of WGBS (whole genome bisulfite sequencing) datasets is challenging. The large number of CpG sites requires significant computing power and can lead to harsh multiple correction penalties. Typically, the number of CpG sites found in DMRs (differentially regulated regions) represent a very small proportion of the initial number of CpG sites. This is because methylation levels of the majority of CpG sites do not vary significantly between samples, and/or the CpG sites are too far dispersed to be considered a contiguous region. DMRs are like likely to be found in relatively compact CpG rich regions that vary in methylation levels. Isolating these regions could greatly reduce downstream computational and statistical challenges without any previous knowledge of sample groups.
Results
The proposed method was able to isolate compact CpG rich variable regions using distance, covariation, and user parameters without a priori sample information. Results were verified with EpiDISH cell deconvolution and comparable with to a complementary method DMRSeq. Isolated regions averaged just 293 bp in length yet contained an average of 29 CpG sites per region.
Conclusions
By defining compact CpG rich variable regions, the method hopes to provide a valid and simpler starting point for further downstream analyses. This method is applicable to any dataset containing total CpG and total CpG methylated count matrices.
Publisher
Research Square Platform LLC
Reference28 articles.
1. Bernstein BE, Meissner A, Lander ES. The Mammalian Epigenome. Cell. 2007;128:669–81.
2. CpG islands as genomic footprints of promoters that are associated with replication origins;Antequera F;Current Biology,1999
3. CG dinucleotides enhance promoter activity independent of DNA methylation;Hartl D;Genome Research,2019
4. Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data;Qin Z;Stat Biosci,2016
5. Differential landscape of non-CpG methylation in embryonic stem cells and neurons caused by DNMT3s OPEN;Lee J-H