Single nucleotide polymorphism genes and mitochondrial DNA haplogroups as biomarkers for early prediction of knee osteoarthritis structural progressors: use of supervised machine learning classifiers.
Biomarkers
Early prognosis
Knee osteoarthritis
Machine learning
Prediction
Single nucleotide polymorphism genes
Structural progressors
mtDNA haplogroup
Journal
BMC medicine
ISSN: 1741-7015
Titre abrégé: BMC Med
Pays: England
ID NLM: 101190723
Informations de publication
Date de publication:
12 09 2022
12 09 2022
Historique:
received:
02
02
2022
accepted:
20
07
2022
entrez:
11
9
2022
pubmed:
12
9
2022
medline:
14
9
2022
Statut:
epublish
Résumé
Knee osteoarthritis is the most prevalent chronic musculoskeletal debilitating disease. Current treatments are only symptomatic, and to improve this, we need a robust prediction model to stratify patients at an early stage according to the risk of joint structure disease progression. Some genetic factors, including single nucleotide polymorphism (SNP) genes and mitochondrial (mt)DNA haplogroups/clusters, have been linked to this disease. For the first time, we aim to determine, by using machine learning, whether some SNP genes and mtDNA haplogroups/clusters alone or combined could predict early knee osteoarthritis structural progressors. Participants (901) were first classified for the probability of being structural progressors. Genotyping included SNP genes TP63, FTO, GNL3, DUS4L, GDF5, SUPT3H, MCF2L, and TGFA; mtDNA haplogroups H, J, T, Uk, and others; and clusters HV, TJ, KU, and C-others. They were considered for prediction with major risk factors of osteoarthritis, namely, age and body mass index (BMI). Seven supervised machine learning methodologies were evaluated. The support vector machine was used to generate gender-based models. The best input combination was assessed using sensitivity and synergy analyses. Validation was performed using tenfold cross-validation and an external cohort (TASOAC). From 277 models, two were defined. Both used age and BMI in addition for the first one of the SNP genes TP63, DUS4L, GDF5, and FTO with an accuracy of 85.0%; the second profits from the association of mtDNA haplogroups and SNP genes FTO and SUPT3H with 82.5% accuracy. The highest impact was associated with the haplogroup H, the presence of CT alleles for rs8044769 at FTO, and the absence of AA for rs10948172 at SUPT3H. Validation accuracy with the cross-validation (about 95%) and the external cohort (90.5%, 85.7%, respectively) was excellent for both models. This study introduces a novel source of decision support in precision medicine in which, for the first time, two models were developed consisting of (i) age, BMI, TP63, DUS4L, GDF5, and FTO and (ii) the optimum one as it has one less variable: age, BMI, mtDNA haplogroup, FTO, and SUPT3H. Such a framework is translational and would benefit patients at risk of structural progressive knee osteoarthritis.
Sections du résumé
BACKGROUND
Knee osteoarthritis is the most prevalent chronic musculoskeletal debilitating disease. Current treatments are only symptomatic, and to improve this, we need a robust prediction model to stratify patients at an early stage according to the risk of joint structure disease progression. Some genetic factors, including single nucleotide polymorphism (SNP) genes and mitochondrial (mt)DNA haplogroups/clusters, have been linked to this disease. For the first time, we aim to determine, by using machine learning, whether some SNP genes and mtDNA haplogroups/clusters alone or combined could predict early knee osteoarthritis structural progressors.
METHODS
Participants (901) were first classified for the probability of being structural progressors. Genotyping included SNP genes TP63, FTO, GNL3, DUS4L, GDF5, SUPT3H, MCF2L, and TGFA; mtDNA haplogroups H, J, T, Uk, and others; and clusters HV, TJ, KU, and C-others. They were considered for prediction with major risk factors of osteoarthritis, namely, age and body mass index (BMI). Seven supervised machine learning methodologies were evaluated. The support vector machine was used to generate gender-based models. The best input combination was assessed using sensitivity and synergy analyses. Validation was performed using tenfold cross-validation and an external cohort (TASOAC).
RESULTS
From 277 models, two were defined. Both used age and BMI in addition for the first one of the SNP genes TP63, DUS4L, GDF5, and FTO with an accuracy of 85.0%; the second profits from the association of mtDNA haplogroups and SNP genes FTO and SUPT3H with 82.5% accuracy. The highest impact was associated with the haplogroup H, the presence of CT alleles for rs8044769 at FTO, and the absence of AA for rs10948172 at SUPT3H. Validation accuracy with the cross-validation (about 95%) and the external cohort (90.5%, 85.7%, respectively) was excellent for both models.
CONCLUSIONS
This study introduces a novel source of decision support in precision medicine in which, for the first time, two models were developed consisting of (i) age, BMI, TP63, DUS4L, GDF5, and FTO and (ii) the optimum one as it has one less variable: age, BMI, mtDNA haplogroup, FTO, and SUPT3H. Such a framework is translational and would benefit patients at risk of structural progressive knee osteoarthritis.
Identifiants
pubmed: 36089590
doi: 10.1186/s12916-022-02491-1
pii: 10.1186/s12916-022-02491-1
pmc: PMC9465912
doi:
Substances chimiques
Biomarkers
0
DNA, Mitochondrial
0
GNL3 protein, human
0
Nuclear Proteins
0
Alpha-Ketoglutarate-Dependent Dioxygenase FTO
EC 1.14.11.33
FTO protein, human
EC 1.14.11.33
GTP-Binding Proteins
EC 3.6.1.-
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
316Subventions
Organisme : NIAMS NIH HHS
ID : N01AR22262
Pays : United States
Organisme : NIAMS NIH HHS
ID : N01AR22261
Pays : United States
Organisme : NIAMS NIH HHS
ID : N01AR22260
Pays : United States
Organisme : NIAMS NIH HHS
ID : N01AR22259
Pays : United States
Organisme : NHLBI NIH HHS
ID : N01AR22258
Pays : United States
Informations de copyright
© 2022. The Author(s).
Références
Sci Rep. 2019 Nov 14;9(1):16880
pubmed: 31727952
Medicine (Baltimore). 2016 Feb;95(7):e2811
pubmed: 26886631
Rheumatol Int. 2015 Feb;35(2):337-44
pubmed: 25086630
Cells. 2019 Jun 18;8(6):
pubmed: 31216686
Curr Opin Rheumatol. 2017 Jan;29(1):103-109
pubmed: 27755178
Nat Rev Mol Cell Biol. 2016 Apr;17(4):213-26
pubmed: 26956194
J Pediatr Gastroenterol Nutr. 2021 Feb 1;72(2):262-269
pubmed: 33003163
Front Med (Lausanne). 2021 Nov 15;8:773417
pubmed: 34869491
PLoS One. 2014 Nov 12;9(11):e112735
pubmed: 25390621
Ann Rheum Dis. 2011 Sep;70(9):1599-604
pubmed: 21613308
J Med Imaging (Bellingham). 2017 Apr;4(2):024507
pubmed: 28653015
BMC Med Genomics. 2020 Jul 7;13(1):97
pubmed: 32635923
Nat Genet. 2008 Aug;40(8):994-8
pubmed: 18622395
Mitochondrion. 2014 Mar;15:18-23
pubmed: 24632472
J Med Genet. 2013 Nov;50(11):715-24
pubmed: 23868913
Hum Genet. 2014 Apr;133(4):435-58
pubmed: 24305784
Ann Rheum Dis. 2011 Apr;70(4):646-52
pubmed: 21177294
Osteoarthritis Cartilage. 2017 Dec;25(12):2014-2021
pubmed: 28899843
Osteoarthritis Cartilage. 2017 Dec;25(12):1926-1941
pubmed: 28847624
Nature. 2016 Jul 28;535(7613):498-500
pubmed: 27383787
Arthritis Care Res (Hoboken). 2022 Mar 4;:
pubmed: 35245407
Ann Rheum Dis. 2013 Oct;72(10):1594-604
pubmed: 23887285
Nat Genet. 2010 Nov;42(11):937-48
pubmed: 20935630
Nat Commun. 2021 Sep 22;12(1):5565
pubmed: 34552089
Ann Rheum Dis. 2017 Jun;76(6):939-941
pubmed: 28069577
Ann Rheum Dis. 2017 Jun;76(6):1114-1122
pubmed: 27919866
Nat Rev Genet. 2008 Dec;9(12):911-22
pubmed: 19002143
Am J Hum Genet. 2011 Sep 9;89(3):446-50
pubmed: 21871595
Arthritis Res Ther. 2010;12(5):R187
pubmed: 20939878
Lipids Health Dis. 2021 Nov 8;20(1):158
pubmed: 34749748
BMC Musculoskelet Disord. 2015 Oct 22;16:312
pubmed: 26494421
Clin Rheumatol. 2020 Apr;39(4):1027-1037
pubmed: 31897963
BMC Med Genet. 2014 May 04;15:53
pubmed: 24886551
PLoS One. 2016 Feb 09;11(2):e0148724
pubmed: 26859664
Lancet. 2012 Sep 1;380(9844):815-23
pubmed: 22763110
Nat Genet. 2018 Apr;50(4):549-558
pubmed: 29559693
Front Genet. 2020 Jan 17;10:1335
pubmed: 32010192
Ther Adv Musculoskelet Dis. 2021 Feb 23;13:1759720X21993254
pubmed: 33747150
Cancer Imaging. 2021 Jan 26;21(1):17
pubmed: 33499939
J Orthop Res. 2017 Oct;35(10):2243-2250
pubmed: 28084653
Proc Natl Acad Sci U S A. 2020 Oct 6;117(40):24709-24719
pubmed: 32958644
Best Pract Res Clin Rheumatol. 2008 Dec;22(6):1061-74
pubmed: 19041077
Eur J Clin Nutr. 2018 Apr;72(4):587-592
pubmed: 29386643
Trends Cell Biol. 2017 Jun;27(6):453-463
pubmed: 28274652
Nat Rev Rheumatol. 2018 Jun;14(6):327-340
pubmed: 29670212
Front Genet. 2021 Feb 11;12:626260
pubmed: 33659027
Rheumatology (Oxford). 2017 Feb;56(2):263-270
pubmed: 27864563
Ther Adv Musculoskelet Dis. 2020 Aug 13;12:1759720X20933468
pubmed: 32849918
Osteoarthritis Cartilage. 2007;15 Suppl A:A1-56
pubmed: 17320422
Arthritis Rheumatol. 2019 Jul;71(7):1191-1200
pubmed: 30747498
Arthritis Rheum. 2008 Aug;58(8):2387-96
pubmed: 18668590
Nat Genet. 2006 Feb;38(2):218-22
pubmed: 16429159
PLoS Genet. 2016 Oct 4;12(10):e1006260
pubmed: 27701424
Neurobiol Aging. 2009 Nov;30(11):1749-55
pubmed: 18308428
Mitochondrion. 2010 Mar;10(2):102-7
pubmed: 19900587
Arthritis Res Ther. 2011;13(6):247
pubmed: 22136179
Ann Rheum Dis. 2014 Dec;73(12):2082-6
pubmed: 23921993
Arthritis Res Ther. 2010;12(6):R222
pubmed: 21190554
Int Breastfeed J. 2021 Jan 4;16(1):2
pubmed: 33397423
Genome Biol. 2016 Jul 29;17(1):166
pubmed: 27473438
Osteoarthritis Cartilage. 2019 Jul;27(7):994-1001
pubmed: 31002938
Nature. 2015 Feb 12;518(7538):197-206
pubmed: 25673413
PLoS One. 2014 Oct 23;9(10):e108896
pubmed: 25340756
Clin Biomech (Bristol, Avon). 2017 Aug;47:87-95
pubmed: 28618311