A hidden markov model for identifying differentially methylated sites in bisulfite sequencing data.
Blood cell-separated data
Differentially methylated region
Next-generation sequencing
Read-depth
Journal
Biometrics
ISSN: 1541-0420
Titre abrégé: Biometrics
Pays: United States
ID NLM: 0370625
Informations de publication
Date de publication:
03 2019
03 2019
Historique:
received:
01
08
2017
revised:
01
07
2018
accepted:
01
08
2018
pubmed:
1
9
2018
medline:
18
12
2019
entrez:
1
9
2018
Statut:
ppublish
Résumé
DNA methylation studies have enabled researchers to understand methylation patterns and their regulatory roles in biological processes and disease. However, only a limited number of statistical approaches have been developed to provide formal quantitative analysis. Specifically, a few available methods do identify differentially methylated CpG (DMC) sites or regions (DMR), but they suffer from limitations that arise mostly due to challenges inherent in bisulfite sequencing data. These challenges include: (1) that read-depths vary considerably among genomic positions and are often low; (2) both methylation and autocorrelation patterns change as regions change; and (3) CpG sites are distributed unevenly. Furthermore, there are several methodological limitations: almost none of these tools is capable of comparing multiple groups and/or working with missing values, and only a few allow continuous or multiple covariates. The last of these is of great interest among researchers, as the goal is often to find which regions of the genome are associated with several exposures and traits. To tackle these issues, we have developed an efficient DMC identification method based on Hidden Markov Models (HMMs) called "DMCHMM" which is a three-step approach (model selection, prediction, testing) aiming to address the aforementioned drawbacks. Our proposed method is different from other HMM methods since it profiles methylation of each sample separately, hence exploiting inter-CpG autocorrelation within samples, and it is more flexible than previous approaches by allowing multiple hidden states. Using simulations, we show that DMCHMM has the best performance among several competing methods. An analysis of cell-separated blood methylation profiles is also provided.
Substances chimiques
Sulfites
0
hydrogen sulfite
OJ9787WBLU
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
210-221Subventions
Organisme : CIHR
ID : 130344
Pays : Canada
Informations de copyright
© 2018, The International Biometric Society.