Phenotypic signatures in clinical data enable systematic identification of patients for genetic testing.


Journal

Nature medicine
ISSN: 1546-170X
Titre abrégé: Nat Med
Pays: United States
ID NLM: 9502015

Informations de publication

Date de publication:
06 2021
Historique:
received: 12 08 2020
accepted: 16 04 2021
pubmed: 5 6 2021
medline: 20 8 2021
entrez: 4 6 2021
Statut: ppublish

Résumé

Around 5% of the population is affected by a rare genetic disease, yet most endure years of uncertainty before receiving a genetic test. A common feature of genetic diseases is the presence of multiple rare phenotypes that often span organ systems. Here, we use diagnostic billing information from longitudinal clinical data in the electronic health records (EHRs) of 2,286 patients who received a chromosomal microarray test, and 9,144 matched controls, to build a model to predict who should receive a genetic test. The model achieved high prediction accuracies in a held-out test sample (area under the receiver operating characteristic curve (AUROC), 0.97; area under the precision-recall curve (AUPRC), 0.92), in an independent hospital system (AUROC, 0.95; AUPRC, 0.62), and in an independent set of 172,265 patients in which cases were broadly defined as having an interaction with a genetics provider (AUROC, 0.9; AUPRC, 0.63). Patients carrying a putative pathogenic copy number variant were also accurately identified by the model. Compared with current approaches for genetic test determination, our model could identify more patients for testing while also increasing the proportion of those tested who have a genetic disease. We demonstrate that phenotypic patterns representative of a wide range of genetic diseases can be captured from EHRs to systematize decision-making for genetic testing, with the potential to speed up diagnosis, improve care and reduce costs.

Identifiants

pubmed: 34083811
doi: 10.1038/s41591-021-01356-z
pii: 10.1038/s41591-021-01356-z
pmc: PMC8981189
mid: NIHMS1724490
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1097-1104

Subventions

Organisme : NCATS NIH HHS
ID : UL1 TR000445
Pays : United States
Organisme : NHLBI NIH HHS
ID : U19 HL065962
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM010685
Pays : United States
Organisme : NCRR NIH HHS
ID : S10 RR025141
Pays : United States
Organisme : NICHD NIH HHS
ID : R01 HD074711
Pays : United States
Organisme : NIGMS NIH HHS
ID : RC2 GM092618
Pays : United States
Organisme : NIGMS NIH HHS
ID : P50 GM115305
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH111776
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG006378
Pays : United States
Organisme : NHGRI NIH HHS
ID : L30 HG009068
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG004798
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG009086
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH113362
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR002243
Pays : United States
Organisme : NCRR NIH HHS
ID : UL1 RR024975
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : NINDS NIH HHS
ID : R01 NS032830
Pays : United States

Références

Nguengang Wakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet. 28, 165–173 (2020).
doi: 10.1038/s41431-019-0508-0
Ferreira, C. R. The burden of rare diseases. Am. J. Med. Genet. A 179, 885–892 (2019).
doi: 10.1002/ajmg.a.61124
Rosenthal, E. T., Biesecker, L. G. & Biesecker, B. B. Parental attitudes toward a diagnosis in children with unidentified multiple congenital anomaly syndromes. Am. J. Med. Genet. 103, 106–114 (2001).
doi: 10.1002/ajmg.1527
About Rare Diseases (Orphanet, accessed June 2020); https://www.orpha.net/consor/cgi-bin/Education_AboutRareDiseases.php?lng=EN
About Rare Diseases (EURORDIS Rare Diseases Europe, accessed June 2020); https://www.eurordis.org/about-rare-diseases
Suther, S. & Kiros, G.-E. Barriers to the use of genetic testing: a study of racial and ethnic disparities. Genet. Med. 11, 655–662 (2009).
doi: 10.1097/GIM.0b013e3181ab22aa
Noll, A. et al. Barriers to Lynch syndrome testing and preoperative result availability in early-onset colorectal cancer: a national physician survey study. Clin. Transl. Gastroenterol. 9, 185 (2018).
doi: 10.1038/s41424-018-0047-y
Moreno-de-Luca, D. et al. Clinical genetic testing in autism spectrum disorder in a large community-based population sample. JAMA Psychiatry 77, 979–981 (2020).
doi: 10.1001/jamapsychiatry.2020.0950
OMIM: Online Mendelian Inheritance in Man (Johns Hopkins University, accessed June 2020); https://omim.org
McKusick, V. A. Mendelian Inheritance in Man and its online version, OMIM. Am. J. Hum. Genet. 80, 588–604 (2007).
doi: 10.1086/514346
Cooper, D. N., Krawczak, M., Polychronakos, C., Tyler-Smith, C. & Kehrer-Sawatzki, H. Where genotype is not predictive of phenotype: towards an understanding of the molecular basis of reduced penetrance in human inherited disease. Hum. Genet. 132, 1077–1130 (2013).
doi: 10.1007/s00439-013-1331-2
Girirajan, S. et al. Phenotypic heterogeneity of genomic disorders and rare copy-number variants. N. Engl. J. Med. 367, 1321–1331 (2012).
doi: 10.1056/NEJMoa1200395
Posey, J. E. et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N. Engl. J. Med. 376, 21–31 (2017).
doi: 10.1056/NEJMoa1516767
Goldstein, B. A., Navar, A. M., Pencina, M. J. & Ioannidis, J. P. A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inform. Assoc. 24, 198–208 (2017).
doi: 10.1093/jamia/ocw042
Bastarache, L. et al. Improving the phenotype risk score as a scalable approach to identifying patients with Mendelian disease. J. Am. Med. Inform. Assoc. 26, 1437–1447 (2019).
doi: 10.1093/jamia/ocz179
Bastarache, L. et al. Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359, 1233–1239 (2018).
doi: 10.1126/science.aal4043
Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat. Biotechnol. 31, 1102–1111 (2013).
doi: 10.1038/nbt.2749
Firth, H. V. et al. DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am. J. Hum. Genet. 84, 524–533 (2009).
doi: 10.1016/j.ajhg.2009.03.010
Corbett-Davies, S. & Goel, S. The measure and mismeasure of fairness: a critical review of fair machine learning. Preprint at arXiv https://arxiv.org/abs/1808.00023 (2018).
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
doi: 10.1038/s42256-019-0138-9
McInnes, L., Healy, J. & Melville, J. UMAP: uniform manifold approximation and projection for Dimension reduction. Preprint at arXiv https://arxiv.org/abs/1802.03426 (2018).
Brokamp, C. et al. Material community deprivation and hospital utilization during the first year of life: an urban population-based cohort study. Ann. Epidemiol. 30, 37–43 (2019).
doi: 10.1016/j.annepidem.2018.11.008
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
doi: 10.1023/A:1010933404324
Wang, K. et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 17, 1665–1674 (2007).
doi: 10.1101/gr.6861907
Diskin, S. J. et al. Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms. Nucleic Acids Res. 36, e126 (2008).
doi: 10.1093/nar/gkn556
Amemiya, H. M., Kundaje, A. & Boyle, A. P. The ENCODE blacklist: identification of problematic regions of the genome. Sci. Rep. 9, 9354 (2019).
doi: 10.1038/s41598-019-45839-z

Auteurs

Theodore J Morley (TJ)

Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.

Lide Han (L)

Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.

Victor M Castro (VM)

Center for Quantitative Health, Division of Clinical Research, Massachusetts General Hospital, Boston, MA, USA.

Jonathan Morra (J)

Zefr, Los Angeles, CA, USA.

Roy H Perlis (RH)

Center for Quantitative Health, Division of Clinical Research, Massachusetts General Hospital, Boston, MA, USA.

Nancy J Cox (NJ)

Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.

Lisa Bastarache (L)

Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.
Center for Precision Medicine, Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

Douglas M Ruderfer (DM)

Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA. douglas.ruderfer@vanderbilt.edu.
Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA. douglas.ruderfer@vanderbilt.edu.
Center for Precision Medicine, Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA. douglas.ruderfer@vanderbilt.edu.
Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA. douglas.ruderfer@vanderbilt.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH