The use of machine learning in rare diseases: a scoping review.
Machine learning
Rare diseases
Scoping review
Journal
Orphanet journal of rare diseases
ISSN: 1750-1172
Titre abrégé: Orphanet J Rare Dis
Pays: England
ID NLM: 101266602
Informations de publication
Date de publication:
09 06 2020
09 06 2020
Historique:
received:
16
04
2020
accepted:
27
05
2020
entrez:
11
6
2020
pubmed:
11
6
2020
medline:
22
6
2021
Statut:
epublish
Résumé
Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare disease context. This scoping review aims to address this gap and explores the use of machine learning in rare diseases, investigating, for example, in which rare diseases machine learning is applied, which types of algorithms and input data are used or which medical applications (e.g., diagnosis, prognosis or treatment) are studied. Using a complex search string including generic search terms and 381 individual disease names, studies from the past 10 years (2010-2019) that applied machine learning in a rare disease context were identified on PubMed. To systematically map the research activity, eligible studies were categorized along different dimensions (e.g., rare disease group, type of algorithm, input data), and the number of studies within these categories was analyzed. Two hundred eleven studies from 32 countries investigating 74 different rare diseases were identified. Diseases with a higher prevalence appeared more often in the studies than diseases with a lower prevalence. Moreover, some rare disease groups were investigated more frequently than to be expected (e.g., rare neurologic diseases and rare systemic or rheumatologic diseases), others less frequently (e.g., rare inborn errors of metabolism and rare skin diseases). Ensemble methods (36.0%), support vector machines (32.2%) and artificial neural networks (31.8%) were the algorithms most commonly applied in the studies. Only a small proportion of studies evaluated their algorithms on an external data set (11.8%) or against a human expert (2.4%). As input data, images (32.2%), demographic data (27.0%) and "omics" data (26.5%) were used most frequently. Most studies used machine learning for diagnosis (40.8%) or prognosis (38.4%) whereas studies aiming to improve treatment were relatively scarce (4.7%). Patient numbers in the studies were small, typically ranging from 20 to 99 (35.5%). Our review provides an overview of the use of machine learning in rare diseases. Mapping the current research activity, it can guide future work and help to facilitate the successful application of machine learning in rare diseases.
Sections du résumé
BACKGROUND
Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare disease context. This scoping review aims to address this gap and explores the use of machine learning in rare diseases, investigating, for example, in which rare diseases machine learning is applied, which types of algorithms and input data are used or which medical applications (e.g., diagnosis, prognosis or treatment) are studied.
METHODS
Using a complex search string including generic search terms and 381 individual disease names, studies from the past 10 years (2010-2019) that applied machine learning in a rare disease context were identified on PubMed. To systematically map the research activity, eligible studies were categorized along different dimensions (e.g., rare disease group, type of algorithm, input data), and the number of studies within these categories was analyzed.
RESULTS
Two hundred eleven studies from 32 countries investigating 74 different rare diseases were identified. Diseases with a higher prevalence appeared more often in the studies than diseases with a lower prevalence. Moreover, some rare disease groups were investigated more frequently than to be expected (e.g., rare neurologic diseases and rare systemic or rheumatologic diseases), others less frequently (e.g., rare inborn errors of metabolism and rare skin diseases). Ensemble methods (36.0%), support vector machines (32.2%) and artificial neural networks (31.8%) were the algorithms most commonly applied in the studies. Only a small proportion of studies evaluated their algorithms on an external data set (11.8%) or against a human expert (2.4%). As input data, images (32.2%), demographic data (27.0%) and "omics" data (26.5%) were used most frequently. Most studies used machine learning for diagnosis (40.8%) or prognosis (38.4%) whereas studies aiming to improve treatment were relatively scarce (4.7%). Patient numbers in the studies were small, typically ranging from 20 to 99 (35.5%).
CONCLUSION
Our review provides an overview of the use of machine learning in rare diseases. Mapping the current research activity, it can guide future work and help to facilitate the successful application of machine learning in rare diseases.
Identifiants
pubmed: 32517778
doi: 10.1186/s13023-020-01424-6
pii: 10.1186/s13023-020-01424-6
pmc: PMC7285453
doi:
Types de publication
Journal Article
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
145Références
N Engl J Med. 2019 Apr 4;380(14):1347-1358
pubmed: 30943338
Nat Med. 2019 Jan;25(1):60-64
pubmed: 30617323
Orphanet J Rare Dis. 2019 Mar 21;14(1):69
pubmed: 30898118
BMC Med Res Methodol. 2018 Nov 19;18(1):143
pubmed: 30453902
Int J Evid Based Healthc. 2015 Sep;13(3):141-6
pubmed: 26134548
Nat Med. 2019 Mar;25(3):433-438
pubmed: 30742121
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17
pubmed: 25750696
Ann Intern Med. 2018 Oct 2;169(7):467-473
pubmed: 30178033
Implement Sci. 2010 Sep 20;5:69
pubmed: 20854677
NPJ Digit Med. 2019 Aug 20;2:79
pubmed: 31453374
Eur J Hum Genet. 2020 Feb;28(2):165-173
pubmed: 31527858
Comput Struct Biotechnol J. 2017 Jan 08;15:104-116
pubmed: 28138367
J Gen Intern Med. 2014 Aug;29 Suppl 3:S780-7
pubmed: 25029978
Am J Hum Genet. 2015 Jul 2;97(1):111-24
pubmed: 26119816
Am J Hum Genet. 2008 Nov;83(5):610-5
pubmed: 18950739
JAMA. 2016 Dec 13;316(22):2402-2410
pubmed: 27898976
Am J Hum Genet. 2017 Feb 2;100(2):185-192
pubmed: 28157539
Sci Data. 2019 Oct 23;6(1):227
pubmed: 31645559
Genes (Basel). 2019 Nov 27;10(12):
pubmed: 31783696
Nature. 2017 Feb 2;542(7639):115-118
pubmed: 28117445
Hum Mutat. 2012 May;33(5):803-8
pubmed: 22422702
J Med Internet Res. 2018 Oct 17;20(10):e11936
pubmed: 30333097