Machine learning application for development of a data-driven predictive model able to investigate quality of life scores in a rare disease.

Alkaptonuria Machine learning Precision medicine QoL scores Rare disease

Journal

Orphanet journal of rare diseases
ISSN: 1750-1172
Titre abrégé: Orphanet J Rare Dis
Pays: England
ID NLM: 101266602

Informations de publication

Date de publication:
12 02 2020
Historique:
received: 26 07 2019
accepted: 14 01 2020
entrez: 14 2 2020
pubmed: 14 2 2020
medline: 22 6 2021
Statut: epublish

Résumé

Alkaptonuria (AKU) is an ultra-rare autosomal recessive disease caused by a mutation in the homogentisate 1,2-dioxygenase (HGD) gene. One of the main obstacles in studying AKU, and other ultra-rare diseases, is the lack of a standardized methodology to assess disease severity or response to treatment. Quality of Life scores (QoL) are a reliable way to monitor patients' clinical condition and health status. QoL scores allow to monitor the evolution of diseases and assess the suitability of treatments by taking into account patients' symptoms, general health status and care satisfaction. However, more comprehensive tools to study a complex and multi-systemic disease like AKU are needed. In this study, a Machine Learning (ML) approach was implemented with the aim to perform a prediction of QoL scores based on clinical data deposited in the ApreciseKUre, an AKU- dedicated database. Data derived from 129 AKU patients have been firstly examined through a preliminary statistical analysis (Pearson correlation coefficient) to measure the linear correlation between 11 QoL scores. The variable importance in QoL scores prediction of 110 ApreciseKUre biomarkers has been then calculated using XGBoost, with K-nearest neighbours algorithm (k-NN) approach. Due to the limited number of data available, this model has been validated using surrogate data analysis. We identified a direct correlation of 6 (age, Serum Amyloid A, Chitotriosidase, Advanced Oxidation Protein Products, S-thiolated proteins and Body Mass Index) out of 110 biomarkers with the QoL health status, in particular with the KOOS (Knee injury and Osteoarthritis Outcome Score) symptoms (Relative Absolute Error (RAE) 0.25). The error distribution of surrogate-model (RAE 0.38) was unequivocally higher than the true-model one (RAE of 0.25), confirming the consistency of our dataset. Our data showed that inflammation, oxidative stress, amyloidosis and lifestyle of patients correlates with the QoL scores for physical status, while no correlation between the biomarkers and patients' mental health was present (RAE 1.1). This proof of principle study for rare diseases confirms the importance of database, allowing data management and analysis, which can be used to predict more effective treatments.

Sections du résumé

BACKGROUND
Alkaptonuria (AKU) is an ultra-rare autosomal recessive disease caused by a mutation in the homogentisate 1,2-dioxygenase (HGD) gene. One of the main obstacles in studying AKU, and other ultra-rare diseases, is the lack of a standardized methodology to assess disease severity or response to treatment. Quality of Life scores (QoL) are a reliable way to monitor patients' clinical condition and health status. QoL scores allow to monitor the evolution of diseases and assess the suitability of treatments by taking into account patients' symptoms, general health status and care satisfaction. However, more comprehensive tools to study a complex and multi-systemic disease like AKU are needed. In this study, a Machine Learning (ML) approach was implemented with the aim to perform a prediction of QoL scores based on clinical data deposited in the ApreciseKUre, an AKU- dedicated database.
METHOD
Data derived from 129 AKU patients have been firstly examined through a preliminary statistical analysis (Pearson correlation coefficient) to measure the linear correlation between 11 QoL scores. The variable importance in QoL scores prediction of 110 ApreciseKUre biomarkers has been then calculated using XGBoost, with K-nearest neighbours algorithm (k-NN) approach. Due to the limited number of data available, this model has been validated using surrogate data analysis.
RESULTS
We identified a direct correlation of 6 (age, Serum Amyloid A, Chitotriosidase, Advanced Oxidation Protein Products, S-thiolated proteins and Body Mass Index) out of 110 biomarkers with the QoL health status, in particular with the KOOS (Knee injury and Osteoarthritis Outcome Score) symptoms (Relative Absolute Error (RAE) 0.25). The error distribution of surrogate-model (RAE 0.38) was unequivocally higher than the true-model one (RAE of 0.25), confirming the consistency of our dataset. Our data showed that inflammation, oxidative stress, amyloidosis and lifestyle of patients correlates with the QoL scores for physical status, while no correlation between the biomarkers and patients' mental health was present (RAE 1.1).
CONCLUSIONS
This proof of principle study for rare diseases confirms the importance of database, allowing data management and analysis, which can be used to predict more effective treatments.

Identifiants

pubmed: 32050984
doi: 10.1186/s13023-020-1305-0
pii: 10.1186/s13023-020-1305-0
pmc: PMC7017449
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

46

Références

Eur J Hum Genet. 2019 Jun;27(6):888-902
pubmed: 30737480
Eur J Hum Genet. 2016 Jan;24(1):66-72
pubmed: 25804398
J Biol Chem. 1958 Jan;230(1):251-60
pubmed: 13502394
J Cell Biochem. 2010 Nov 1;111(4):922-32
pubmed: 20665660
Rheumatol Int. 2008 Jul;28(9):949-50
pubmed: 18324378
Biochim Biophys Acta. 2012 Nov;1822(11):1682-91
pubmed: 22850426
Yonsei Med J. 2007 Apr 30;48(2):218-24
pubmed: 17461519
BMC Med Inform Decis Mak. 2017 Apr 14;17(1):42
pubmed: 28410607
Mediators Inflamm. 2010;2010:535918
pubmed: 20847813
Arthritis Care Res (Hoboken). 2011 Nov;63 Suppl 11:S208-28
pubmed: 22588746
Allergy Asthma Immunol Res. 2015 Jan;7(1):14-21
pubmed: 25553258
Comput Biol Med. 2018 Dec 1;103:1-7
pubmed: 30316064
Free Radic Biol Med. 2012 Aug 15;53(4):907-15
pubmed: 22732185
J Am Soc Nephrol. 2008 Mar;19(3):593-9
pubmed: 18256365
J Inherit Metab Dis. 2015 Sep;38(5):797-805
pubmed: 25868666
Expert Rev Proteomics. 2013 Dec;10(6):521-35
pubmed: 24206226
Int J Biochem Cell Biol. 2016 Dec;81(Pt B):271-280
pubmed: 27590860
Rheumatology (Oxford). 2012 Mar;51(3):423-33
pubmed: 22179724
Rheumatology (Oxford). 2010 Oct;49(10):1975-83
pubmed: 20601653
N Engl J Med. 2002 Dec 26;347(26):2111-21
pubmed: 12501223
N Engl J Med. 1999 Feb 11;340(6):448-54
pubmed: 9971870
Osteoarthritis Cartilage. 2018 Aug;26(8):1078-1086
pubmed: 29852277
Mediators Inflamm. 2014;2014:258471
pubmed: 24876668
Free Radic Biol Med. 2015 Nov;88(Pt A):70-80
pubmed: 25733348
Diagn Pathol. 2014 Sep 26;9:185
pubmed: 25567001
FASEB J. 2019 Nov;33(11):12696-12703
pubmed: 31462106
Yonsei Med J. 2012 Sep;53(5):1045-8
pubmed: 22869491
J Inherit Metab Dis. 2011 Dec;34(6):1141-51
pubmed: 21748407
Pediatr Pulmonol. 2019 Sep;54(9):1422-1430
pubmed: 31211524
Front Immunol. 2013 Apr 22;4:92
pubmed: 23626589
Hum Mutat. 2009 Dec;30(12):1611-9
pubmed: 19862842
PLoS One. 2013;8(1):e54504
pubmed: 23365672
J Clin Biochem Nutr. 2008 Nov;43(3):154-8
pubmed: 19015749
Health Qual Life Outcomes. 2003 Nov 03;1:64
pubmed: 14613558
J Inherit Metab Dis. 2011 Dec;34(6):1163-76
pubmed: 21874298
Int J Cardiol. 2014;172(1):e121-3
pubmed: 24456888
Am J Hum Genet. 1960 Mar;12(1):76-85
pubmed: 17948450

Auteurs

Ottavia Spiga (O)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy. ottavia.spiga@unisi.it.

Vittoria Cicaloni (V)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.
Toscana Life Sciences Foundation, Siena, Italy.

Cosimo Fiorini (C)

Energy way, Modena, Italy.

Alfonso Trezza (A)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Anna Visibelli (A)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.
Department of Information Engineering and Mathematics, University of Siena, Siena, Italy.

Lia Millucci (L)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Giulia Bernardini (G)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Andrea Bernini (A)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Barbara Marzocchi (B)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.
UOC Patologia Clinica, Azienda Ospedaliera Senese, Siena, Italy.

Daniela Braconi (D)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Filippo Prischi (F)

School of Life Sciences, University of Essex, Colchester, CO4 3SQ, UK.

Annalisa Santucci (A)

Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via A., 53100, Siena, Italy.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH