Phenotypic similarity-based approach for variant prioritization for unsolved rare disease: a preliminary methodological report.


Journal

European journal of human genetics : EJHG
ISSN: 1476-5438
Titre abrégé: Eur J Hum Genet
Pays: England
ID NLM: 9302235

Informations de publication

Date de publication:
06 Nov 2023
Historique:
received: 17 05 2023
accepted: 05 10 2023
revised: 13 09 2023
medline: 6 11 2023
pubmed: 6 11 2023
entrez: 5 11 2023
Statut: aheadofprint

Résumé

Rare diseases (RD) have a prevalence of not more than 1/2000 persons in the European population, and are characterised by the difficulty experienced in obtaining a correct and timely diagnosis. According to Orphanet, 72.5% of RD have a genetic origin although 35% of them do not yet have an identified causative gene. A significant proportion of patients suspected to have a genetic RD receive an inconclusive exome/genome sequencing. Working towards the International Rare Diseases Research Consortium (IRDiRC)'s goal for 2027 to ensure that all people living with a RD receive a diagnosis within one year of coming to medical attention, the Solve-RD project aims to identify the molecular causes underlying undiagnosed RD. As part of this strategy, we developed a phenotypic similarity-based variant prioritization methodology comparing submitted cases with other submitted cases and with known RD in Orphanet. Three complementary approaches based on phenotypic similarity calculations using the Human Phenotype Ontology (HPO), the Orphanet Rare Diseases Ontology (ORDO) and the HPO-ORDO Ontological Module (HOOM) were developed; genomic data reanalysis was performed by the RD-Connect Genome-Phenome Analysis Platform (GPAP). The methodology was tested in 4 exemplary cases discussed with experts from European Reference Networks. Variants of interest (pathogenic or likely pathogenic) were detected in 8.8% of the 725 cases clustered by similarity calculations. Diagnostic hypotheses were validated in 42.1% of them and needed further exploration in another 10.9%. Based on the promising results, we are devising an automated standardized phenotypic-based re-analysis pipeline to be applied to the entire unsolved cases cohort.

Identifiants

pubmed: 37926714
doi: 10.1038/s41431-023-01486-7
pii: 10.1038/s41431-023-01486-7
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257
Organisme : European Commission (EC)
ID : 779257

Informations de copyright

© 2023. The Author(s).

Références

European Union. Regulation (EC) N°141/2000 of the European Parliament and of the Council of 16 December 1999 on orphan medicinal products. 2000 https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2000:018:0001:0005:en:PDF .
Nguengang Wakap S, Lambert DM, Olry A, Rodwell C, Gueydan C, Lanneau V, et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur J Hum Genet. 2020;28:165–73.
doi: 10.1038/s41431-019-0508-0 pubmed: 31527858
Zanello G, Chan CH, Pearce DA. Recommendations from the IRDiRC Working Group on methodologies to assess the impact of diagnoses and therapies on rare disease patients. Orphanet J Rare Dis. 2022;17:181.
doi: 10.1186/s13023-022-02337-2 pubmed: 35526001 pmcid: 9078009
Recommendations from The Rare 2030 - Foresight Study The Future Of Rare Diseases Starts Today - February 2021 - Rare2030.Eu/Recommendations
RD-ACTION Work Package 5 - Milestone 24 - Specifications for an integrated coding application with Orphacodes. 2016 http://www.rd-action.eu/wp-content/uploads/2016/11/Milestone-24_05.10.2016.pdf .
Solve-RD - solving the unsolved rare diseases. https://solve-rd.eu .
Zurek B, Ellwanger K, Vissers LELM, Schüle R, Synofzik M, Töpf A, et al. Solve-RD: systematic pan-European data sharing and collaborative analysis to solve rare diseases. Eur J Hum Genet. 2021;29:1325–31.
doi: 10.1038/s41431-021-00859-0 pubmed: 34075208 pmcid: 8440542
Matalonga L, Hernández-Ferrer C, Piscia D, Solve-RD SNV-indel working group, Schüle R, Synofzik M, et al. Solving patients with rare diseases through programmatic reanalysis of genome-phenome data. Eur J Hum Genet. 2021;29:1337–47.
doi: 10.1038/s41431-021-00852-7 pubmed: 34075210 pmcid: 8440686
RD-Code consensus document on codification of suspected/undiagnosed rare diseases. 2021. http://www.rd-code.eu/wp-content/uploads/2022/02/D5.2_RDCODE_VF2021_FV.pdf .
BTumiene H, Graessner IM, Mathijssen AM, Pereira F, Schaefer M, Scarpa J-Y, Blay, et al. European Reference Networks: challenges and opportunities. J Community Genet. 2021;12:217–29. https://doi.org/10.1007/s12687-021-00521-8 .
doi: 10.1007/s12687-021-00521-8
Fujiwara T, Yamamoto Y, Kim JD, Buske O, Takagi T. PubCaseFinder: A Case-Report-Based, Phenotype-Driven Differential-Diagnosis System for Rare Diseases. Am J Hum Genet. 2018;103:389.
doi: 10.1016/j.ajhg.2018.08.003 pubmed: 30173820 pmcid: 6128307
Hartley T, Lemire G, Kernohan KD, Howley HE, Adams DR, Boycott KM. New Diagnostic Approaches for Undiagnosed Rare Genetic Diseases. Annu Rev Genomics Hum Genet. 2020;21:351–72. https://doi.org/10.1146/annurev-genom-083118-015345 .
doi: 10.1146/annurev-genom-083118-015345 pubmed: 32283948
Jacobsen JOB, Kelly C, Cipriani V, Research Consortium GE, Mungall CJ, Reese J, et al. Phenotype-driven approaches to enhance variant prioritization and diagnosis of rare disease. Hum Mutat. 2022;43:1071–81. https://doi.org/10.1002/humu.24380 .
doi: 10.1002/humu.24380 pubmed: 35391505 pmcid: 9288531
Kelly C, Szabo A, Pontikos N, Arno G, Robinson PN, Jacobsen JOB, et al. Phenotype-aware prioritisation of rare Mendelian disease variants. Trends Genet. 2022;38:1271–83. https://doi.org/10.1016/j.tig.2022.07.002 .
doi: 10.1016/j.tig.2022.07.002 pubmed: 35934592 pmcid: 9950798
Dingemans AJM, Hinne M, Truijen KMG, Goltstein L, van Reeuwijk J, de Leeuw N, et al. PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework. Nat Genet. 2023. https://doi.org/10.1038/s41588-023-01469-w . Online ahead of print.
Birgmeier J, Haeussler M, Deisseroth CA, Steinberg EH, Jagadeesh KA, Ratner AJ, et al. AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature. Sci Transl Med. 2020;12:eaau9113.
doi: 10.1126/scitranslmed.aau9113 pubmed: 32434849 pmcid: 9366928
Li Q, Zhao K, Bustamante CD, Ma X, Wong WH. Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis. Genet Med. 2019;21:2126–34. https://doi.org/10.1038/s41436-019-0439-8 .
doi: 10.1038/s41436-019-0439-8 pubmed: 30675030 pmcid: 6752318
Robinson PN, Ravanmehr V, Jacobsen JOB, Danis D, Zhang XA, Carmody LC, et al. Interpretable clinical genomics with a likelihood ratio paradigm. Am J Hum Genet. 2020;107:403–17.
doi: 10.1016/j.ajhg.2020.06.021 pubmed: 32755546 pmcid: 7477017
Zhao M, Havrilla JM, Fang L, Chen Y, Peng J, Liu C, et al. Phen2Gene: rapid phenotype-driven gene prioritization for rare diseases. NAR Genomics Bioinforma. 2020;2:lqaa032 https://doi.org/10.1093/nargab/lqaa032 .
doi: 10.1093/nargab/lqaa032
Zhai W, Huang X, Shen N, Zhu S. Phen2Disease: a phenotype-driven model for disease and gene prioritization by bidirectional maximum matching semantic similarities. Brief Bioinforma. 2023;24:bbad172 https://doi.org/10.1093/bib/bbad172 .
doi: 10.1093/bib/bbad172
Laurie S, Fernandez-Callejo M, Marco-Sola S, Trotta JR, Camps J, Chacón A, et al. From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing. Hum Mutat. 2016;37:1263–71. https://doi.org/10.1002/humu.23114 .
doi: 10.1002/humu.23114 pubmed: 27604516 pmcid: 5129537
Robinson PN, Köhler S, Bauer S, Seelow D, Horn D, Mundlos S. The Human Phenotype Ontology: A Tool for Annotating and Analyzing Human Hereditary Disease. Am J Hum Genet. 2008;83:610–5.
doi: 10.1016/j.ajhg.2008.09.017 pubmed: 18950739 pmcid: 2668030
What Is The Orphanet Rare Disease Ontology (ORDO)? https://www.orphadata.com/docs/WhatIsORDO.pdf .
What Is Hoom (The Hpo-Ordo Ontological Module)? https://www.orphadata.com/docs/WhatIsHOOM.pdf .
Lappalainen I, Almeida-King J, Kumanduri V, Senf A, Spalding JD, Ur-Rehman S, et al. The European Genome-phenome Archive of human data consented for biomedical research. Nat Genet. 2015;47:692–5. https://doi.org/10.1038/ng.3312
doi: 10.1038/ng.3312 pubmed: 26111507 pmcid: 5426533
van der Velde KJ, Imhann F, Charbon B, Pang C, van Enckevort D, Slofstra M, et al. MOLGENIS research: advanced bioinformatics data software for non-bioinformaticians. Bioinformatics. 2019;35:1076–8. https://doi.org/10.1093/bioinformatics/bty742 .
doi: 10.1093/bioinformatics/bty742 pubmed: 30165396
The Global Alliance for Genomics and Health. https://www.ga4gh.org .
Phenopackets - Concepts and Technology. http://phenopackets.org/ .
Resnik P. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995) https://arxiv.org/pdf/cmp-lg/9511007.pdf .
Pesquita C, Faria D, Falcão AO, Lord P, Couto FM. Semantic similarity in biomedical ontologies. PLoS Comput Biol 2009;5:e1000443.
doi: 10.1371/journal.pcbi.1000443 pubmed: 19649320 pmcid: 2712090
Köhler S, Schulz MH, Krawitz P, Bauer S, Dölken S, Ott CE, et al. Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in Ontologies. Am J Hum Genet. 85, 457–64.
Köhler S. Improved ontology-based similarity calculations using a study-wise annotation model. Database (Oxford). 2018;2018:bay026.
Bauer S, Köhler S, Schulz MH, Robinson PN. Bayesian ontology querying for accurate and noise-tolerant semantic searches. Bioinformatics. 2012;28:2502–8.
doi: 10.1093/bioinformatics/bts471 pubmed: 22843981 pmcid: 3463114
Smedley D, Oellrich A, Köhler S, Ruef B, Westerfield M, Robinson P, et al. PhenoDigm: analyzing curated annotations to associate animal models with human diseases. Database (Oxford). 2013;2013:bat025.
Köhler S, Havrylenko S, Adaptation of BOQA algorithm to its use in the ontology of unsolved rare diseases. Solve-RD D1.10 Deliverable https://solve-rd.eu/wp-content/uploads/2021/11/D1.10-Adaptation-of-BOQA-algorithm-to-its-use-in-the-on-tology-of-unsolved-RD_public.pdf .
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. ACMG Laboratory Quality Assurance Committee. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 2015;17:405–24. https://doi.org/10.1038/gim.2015.30 .
doi: 10.1038/gim.2015.30 pubmed: 25741868 pmcid: 4544753
Reid E, Kloos M, Ashley-Koch A, Hughes L, Bevan S, Svenson IK. et al. A Kinesin Heavy Chain (KIF5A) Mutation in Hereditary Spastic Paraplegia (SPG10). Am J Hum Genet.2002;71:1189 https://doi.org/10.1086/344210 .
doi: 10.1086/344210 pubmed: 12355402 pmcid: 385095
Maekawa M, Sudo K, Kanno T, Li SS. Molecular characterization of genetic mutation in human lactate dehydrogenase-A (M) deficiency. Biochem Biophys Res Commun. 1990;168:677–82. https://doi.org/10.1016/0006-291x(90)92374-9 .
doi: 10.1016/0006-291x(90)92374-9 pubmed: 2334430
Sakai N, Inui K, Fujii N, Fukushima H, Nishimoto J, Yanagihara I, et al. Krabbe disease: isolation and characterization of a full-length cDNA for human galactocerebrosidase. Biochem Biophys Res Commun. 1994;198:485–91. https://doi.org/10.1006/bbrc.1994.1071 .
doi: 10.1006/bbrc.1994.1071 pubmed: 8297359
Daud D, Griffin H, Douroudis K, Kleinle S, Eglon G, Pyle A, et al. Whole exome sequencing and the clinician: we need clinical skills and functional validation in variant filtering. J Neurol 2015;262:1673–7. https://doi.org/10.1007/s00415-015-7755-y .
doi: 10.1007/s00415-015-7755-y pubmed: 25957632 pmcid: 4503877
Johnson JO, Mandrioli J, Benatar M, Abramzon Y, Van Deerlin VM, Trojanowski, et al. Exome sequencing reveals VCP mutations as a cause of familial ALS. Neuron. 2010;68:857–64. https://doi.org/10.1016/j.neuron.2010.11.036 .
doi: 10.1016/j.neuron.2010.11.036 pubmed: 21145000 pmcid: 3032425
Smedley D, Smith KR, Martin A, Thomas EA, Mcdonagh EM, Cipriani V, et al. The 100,000 genomes pilot on rare disease diagnosis in healthcare—A preliminary report. N. Engl J Med. 2021;385:1868–80.
doi: 10.1056/NEJMoa2035790 pubmed: 34758253
Franz M, Lopes CT, Huck G, Dong Y, Sumer O, Bader GD. Cytoscape.js: a graph theory library for visualisation and analysis. Bioinformatics 2016;32:309–11. https://doi.org/10.1093/bioinformatics/btv557 .
doi: 10.1093/bioinformatics/btv557 pubmed: 26415722

Auteurs

David Lagorce (D)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France. david.lagorce@inserm.fr.

Emeline Lebreton (E)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Leslie Matalonga (L)

CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain.

Oscar Hongnat (O)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Maroua Chahdil (M)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Davide Piscia (D)

CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain.

Ida Paramonov (I)

CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain.

Kornelia Ellwanger (K)

Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany.
Centre for Rare Diseases, University of Tübingen, Tübingen, Germany.

Sebastian Köhler (S)

Ada Health GmbH, Berlin, Germany.

Peter Robinson (P)

The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA.

Holm Graessner (H)

Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany.
Centre for Rare Diseases, University of Tübingen, Tübingen, Germany.

Sergi Beltran (S)

CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain.

Caterina Lucano (C)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Marc Hanauer (M)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Ana Rath (A)

INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.

Classifications MeSH