AI-MARRVEL - A Knowledge-Driven AI System for Diagnosing Mendelian Disorders.


Journal

NEJM AI
ISSN: 2836-9386
Titre abrégé: NEJM AI
Pays: United States
ID NLM: 9918752186406676

Informations de publication

Date de publication:
May 2024
Historique:
medline: 4 7 2024
pubmed: 4 7 2024
entrez: 4 7 2024
Statut: ppublish

Résumé

Diagnosing genetic disorders requires extensive manual curation and interpretation of candidate variants, a labor-intensive task even for trained geneticists. Although artificial intelligence (AI) shows promise in aiding these diagnoses, existing AI tools have only achieved moderate success for primary diagnosis. AI-MARRVEL (AIM) uses a random-forest machine-learning classifier trained on over 3.5 million variants from thousands of diagnosed cases. AIM additionally incorporates expert-engineered features into training to recapitulate the intricate decision-making processes in molecular diagnosis. The online version of AIM is available at https://ai.marrvel.org. To evaluate AIM, we benchmarked it with diagnosed patients from three independent cohorts. AIM improved the rate of accurate genetic diagnosis, doubling the number of solved cases as compared with benchmarked methods, across three distinct real-world cohorts. To better identify diagnosable cases from the unsolved pools accumulated over time, we designed a confidence metric on which AIM achieved a precision rate of 98% and identified 57% of diagnosable cases out of a collection of 871 cases. Furthermore, AIM's performance improved after being fine-tuned for targeted settings including recessive disorders and trio analysis. Finally, AIM demonstrated potential for novel disease gene discovery by correctly predicting two newly reported disease genes from the Undiagnosed Diseases Network. AIM achieved superior accuracy compared with existing methods for genetic diagnosis. We anticipate that this tool may aid in primary diagnosis, reanalysis of unsolved cases, and the discovery of novel disease genes. (Funded by the NIH Common Fund and others.).

Sections du résumé

BACKGROUND BACKGROUND
Diagnosing genetic disorders requires extensive manual curation and interpretation of candidate variants, a labor-intensive task even for trained geneticists. Although artificial intelligence (AI) shows promise in aiding these diagnoses, existing AI tools have only achieved moderate success for primary diagnosis.
METHODS METHODS
AI-MARRVEL (AIM) uses a random-forest machine-learning classifier trained on over 3.5 million variants from thousands of diagnosed cases. AIM additionally incorporates expert-engineered features into training to recapitulate the intricate decision-making processes in molecular diagnosis. The online version of AIM is available at https://ai.marrvel.org. To evaluate AIM, we benchmarked it with diagnosed patients from three independent cohorts.
RESULTS RESULTS
AIM improved the rate of accurate genetic diagnosis, doubling the number of solved cases as compared with benchmarked methods, across three distinct real-world cohorts. To better identify diagnosable cases from the unsolved pools accumulated over time, we designed a confidence metric on which AIM achieved a precision rate of 98% and identified 57% of diagnosable cases out of a collection of 871 cases. Furthermore, AIM's performance improved after being fine-tuned for targeted settings including recessive disorders and trio analysis. Finally, AIM demonstrated potential for novel disease gene discovery by correctly predicting two newly reported disease genes from the Undiagnosed Diseases Network.
CONCLUSIONS CONCLUSIONS
AIM achieved superior accuracy compared with existing methods for genetic diagnosis. We anticipate that this tool may aid in primary diagnosis, reanalysis of unsolved cases, and the discovery of novel disease genes. (Funded by the NIH Common Fund and others.).

Identifiants

pubmed: 38962029
doi: 10.1056/aioa2300009
pmc: PMC11221788
doi:

Types de publication

Journal Article

Langues

eng

Auteurs

Dongxue Mao (D)

Department of Pediatrics, Baylor College of Medicine, Houston.
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Chaozhong Liu (C)

Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.
Graduate School of Biomedical Sciences, Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston.

Linhua Wang (L)

Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.
Graduate School of Biomedical Sciences, Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston.

Rami Ai-Ouran (R)

Department of Pediatrics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.
Department of Data Science and AI, Al Hussein Technical University, Amman, Jordan.

Cole Deisseroth (C)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Sasidhar Pasupuleti (S)

Department of Pediatrics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Seon Young Kim (SY)

Department of Pediatrics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Lucian Li (L)

Department of Pediatrics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Jill A Rosenfeld (JA)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.

Linyan Meng (L)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Baylor Genetics, Houston7.

Lindsay C Burrage (LC)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.

Michael F Wangler (MF)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Shinya Yamamoto (S)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Christine M Eng (CM)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Baylor Genetics, Houston7.

Brendan Lee (B)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.

Bo Yuan (B)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Human Genome Sequencing Center, Baylor College of Medicine, Houston.

Fan Xia (F)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Baylor Genetics, Houston7.

Hugo J Bellen (HJ)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.
Department of Neuroscience, Baylor College of Medicine, Houston.

Pengfei Liu (P)

Department of Molecular and Human Genetics, Baylor College of Medicine, Houston.
Baylor Genetics, Houston7.

Zhandong Liu (Z)

Department of Pediatrics, Baylor College of Medicine, Houston.
Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston.

Classifications MeSH