Prediction model construction of mouse stem cell pluripotency using CpG and non-CpG DNA methylation markers.
DNA-methylation
Non-CpG methylation
Stem cell pluripotency
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
04 May 2020
04 May 2020
Historique:
received:
12
11
2019
accepted:
06
03
2020
entrez:
6
5
2020
pubmed:
6
5
2020
medline:
21
7
2020
Statut:
epublish
Résumé
Genome-wide studies of DNA methylation across the epigenetic landscape provide insights into the heterogeneity of pluripotent embryonic stem cells (ESCs). Differentiating into embryonic somatic and germ cells, ESCs exhibit varying degrees of pluripotency, and epigenetic changes occurring in this process have emerged as important factors explaining stem cell pluripotency. Here, using paired scBS-seq and scRNA-seq data of mice, we constructed a machine learning model that predicts degrees of pluripotency for mouse ESCs. Since the biological activities of non-CpG markers have yet to be clarified, we tested the predictive power of CpG and non-CpG markers, as well as a combination thereof, in the model. Through rigorous performance evaluation with both internal and external validation, we discovered that a model using both CpG and non-CpG markers predicted the pluripotency of ESCs with the highest prediction performance (0.956 AUC, external test). The prediction model consisted of 16 CpG and 33 non-CpG markers. The CpG and most of the non-CpG markers targeted depletions of methylation and were indicative of cell pluripotency, whereas only a few non-CpG markers reflected accumulations of methylation. Additionally, we confirmed that there exists the differing pluripotency between individual developmental stages, such as E3.5 and E6.5, as well as between induced mouse pluripotent stem cell (iPSC) and somatic cell. In this study, we investigated CpG and non-CpG methylation in relation to mouse stem cell pluripotency and developed a model thereon that successfully predicts the pluripotency of mouse ESCs.
Sections du résumé
BACKGROUND
BACKGROUND
Genome-wide studies of DNA methylation across the epigenetic landscape provide insights into the heterogeneity of pluripotent embryonic stem cells (ESCs). Differentiating into embryonic somatic and germ cells, ESCs exhibit varying degrees of pluripotency, and epigenetic changes occurring in this process have emerged as important factors explaining stem cell pluripotency.
RESULTS
RESULTS
Here, using paired scBS-seq and scRNA-seq data of mice, we constructed a machine learning model that predicts degrees of pluripotency for mouse ESCs. Since the biological activities of non-CpG markers have yet to be clarified, we tested the predictive power of CpG and non-CpG markers, as well as a combination thereof, in the model. Through rigorous performance evaluation with both internal and external validation, we discovered that a model using both CpG and non-CpG markers predicted the pluripotency of ESCs with the highest prediction performance (0.956 AUC, external test). The prediction model consisted of 16 CpG and 33 non-CpG markers. The CpG and most of the non-CpG markers targeted depletions of methylation and were indicative of cell pluripotency, whereas only a few non-CpG markers reflected accumulations of methylation. Additionally, we confirmed that there exists the differing pluripotency between individual developmental stages, such as E3.5 and E6.5, as well as between induced mouse pluripotent stem cell (iPSC) and somatic cell.
CONCLUSIONS
CONCLUSIONS
In this study, we investigated CpG and non-CpG methylation in relation to mouse stem cell pluripotency and developed a model thereon that successfully predicts the pluripotency of mouse ESCs.
Identifiants
pubmed: 32366211
doi: 10.1186/s12859-020-3448-3
pii: 10.1186/s12859-020-3448-3
pmc: PMC7199378
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
175Subventions
Organisme : Ministry of Science, ICT (KR)
ID : NRF-2017M3A9C4092978
Organisme : GIST Research Institute
ID : GRI2020
Organisme : National Research Foundation of Korea
ID : NRF-2017M3C9A6047625
Références
Nat Methods. 2011 Apr;8(4):315-7
pubmed: 21378979
Exp Mol Med. 2017 Apr 28;49(4):e322
pubmed: 28450738
Cell. 2006 Aug 25;126(4):663-76
pubmed: 16904174
Genome Res. 2010 Mar;20(3):320-31
pubmed: 20133333
Nat Methods. 2014 Aug;11(8):817-820
pubmed: 25042786
Nat Methods. 2017 Oct;14(10):979-982
pubmed: 28825705
Sci Rep. 2015 Mar 10;5:8973
pubmed: 25754700
Nucleic Acids Res. 2016 Jul 27;44(13):e117
pubmed: 27179027
Biochem Biophys Res Commun. 1987 Jun 15;145(2):888-94
pubmed: 3593377
Genes Dev. 2005 May 15;19(10):1129-55
pubmed: 15905405
Adv Exp Med Biol. 2013;786:307-28
pubmed: 23696364
Sci Rep. 2017 Sep 12;7(1):11295
pubmed: 28900200
Proc Natl Acad Sci U S A. 2002 Apr 2;99(7):4391-6
pubmed: 11917100
Cell Stem Cell. 2015 Oct 1;17(4):471-85
pubmed: 26431182
Genome Biol. 2016 May 23;17(1):106
pubmed: 27215581
Epigenetics. 2014 Jun;9(6):823-8
pubmed: 24717538
Circ Res. 2014 Jul 7;115(2):311-24
pubmed: 24989490
Stem Cells Dev. 2008 Dec;17(6):1043-52
pubmed: 18393635
Cell Transplant. 2016 Oct;25(10):1879-1892
pubmed: 26980118
Pediatr Res. 2006 Apr;59(4 Pt 2):21R-5R
pubmed: 16549544
Blood Res. 2015 Dec;50(4):194-203
pubmed: 26770947
Nature. 2009 Nov 19;462(7271):315-22
pubmed: 19829295
BMC Bioinformatics. 2011 Aug 04;12:323
pubmed: 21816040
Nat Commun. 2016 Jan 29;7:10458
pubmed: 26822956
Nat Methods. 2016 Mar;13(3):229-232
pubmed: 26752769
J Stem Cells. 2014;9(3):163-97
pubmed: 25157450
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
Nat Genet. 2007 Apr;39(4):457-66
pubmed: 17334365
J Econ Entomol. 1946 Apr;39:269
pubmed: 20983181
Cell. 2008 Feb 22;132(4):661-80
pubmed: 18295582
Bioinformatics. 2011 Jun 1;27(11):1571-2
pubmed: 21493656