Phenotyping coronavirus disease 2019 during a global health pandemic: Lessons learned from the characterization of an early cohort.
Controlled terminologies and vocabularies
Data management
Phenomics
Phenotype
Journal
Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413
Informations de publication
Date de publication:
05 2021
05 2021
Historique:
received:
16
10
2020
revised:
09
02
2021
accepted:
03
04
2021
pubmed:
11
4
2021
medline:
10
7
2021
entrez:
10
4
2021
Statut:
ppublish
Résumé
From the start of the coronavirus disease 2019 (COVID-19) pandemic, researchers have looked to electronic health record (EHR) data as a way to study possible risk factors and outcomes. To ensure the validity and accuracy of research using these data, investigators need to be confident that the phenotypes they construct are reliable and accurate, reflecting the healthcare settings from which they are ascertained. We developed a COVID-19 registry at a single academic medical center and used data from March 1 to June 5, 2020 to assess differences in population-level characteristics in pandemic and non-pandemic years respectively. Median EHR length, previously shown to impact phenotype performance in type 2 diabetes, was significantly shorter in the SARS-CoV-2 positive group relative to a 2019 influenza tested group (median 3.1 years vs 8.7; Wilcoxon rank sum P = 1.3e-52). Using three phenotyping methods of increasing complexity (billing codes alone and domain-specific algorithms provided by an EHR vendor and clinical experts), common medical comorbidities were abstracted from COVID-19 EHRs, defined by the presence of a positive laboratory test (positive predictive value 100%, recall 93%). After combining performance data across phenotyping methods, we observed significantly lower false negative rates for those records billed for a comprehensive care visit (p = 4e-11) and those with complete demographics data recorded (p = 7e-5). In an early COVID-19 cohort, we found that phenotyping performance of nine common comorbidities was influenced by median EHR length, consistent with previous studies, as well as by data density, which can be measured using portable metrics including CPT codes. Here we present those challenges and potential solutions to creating deeply phenotyped, acute COVID-19 cohorts.
Identifiants
pubmed: 33838341
pii: S1532-0464(21)00106-4
doi: 10.1016/j.jbi.2021.103777
pmc: PMC8026248
pii:
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
103777Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM139891
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM010685
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL140074
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG011166
Pays : United States
Organisme : NIGMS NIH HHS
ID : R25 GM062459
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL133786
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG011181
Pays : United States
Informations de copyright
Copyright © 2021 Elsevier Inc. All rights reserved.
Références
N Engl J Med. 2020 May 7;382(19):1787-1799
pubmed: 32187464
J Am Med Inform Assoc. 2018 Mar 1;25(3):289-294
pubmed: 29040596
J Biomed Inform. 2014 Dec;52:28-35
pubmed: 24534443
J Am Med Inform Assoc. 2013 Jan 1;20(1):117-21
pubmed: 22955496
N Engl J Med. 2020 Mar 5;382(10):929-936
pubmed: 32004427
J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052
pubmed: 27026615
J Biomed Inform. 2020 Apr;104:103362
pubmed: 31866434
Appl Clin Inform. 2020 Mar;11(2):265-275
pubmed: 32268390
J Am Med Inform Assoc. 2020 Jun 1;27(6):853-859
pubmed: 32208481
J Am Med Inform Assoc. 2020 Jan 1;27(1):119-126
pubmed: 31722396
Nat Biotechnol. 2013 Dec;31(12):1102-10
pubmed: 24270849
N Engl J Med. 2020 Apr 30;382(18):1708-1720
pubmed: 32109013
J Am Med Inform Assoc. 2020 Nov 1;27(11):1675-1687
pubmed: 32974638
Lancet. 2020 Mar 28;395(10229):1054-1062
pubmed: 32171076
Genet Med. 2013 Oct;15(10):761-71
pubmed: 23743551
NPJ Digit Med. 2019 Sep 6;2:88
pubmed: 31508498
J Am Med Inform Assoc. 2021 Mar 1;28(3):427-443
pubmed: 32805036
J Am Med Inform Assoc. 2016 Mar;23(2):407-12
pubmed: 26174867
Sci Transl Med. 2011 Apr 20;3(79):79re1
pubmed: 21508311
J Biomed Inform. 2008 Feb;41(1):1-14
pubmed: 17625974
J Am Med Inform Assoc. 2016 Apr;23(e1):e20-7
pubmed: 26338219
J Am Med Inform Assoc. 2012 Mar-Apr;19(2):219-24
pubmed: 22249968
NPJ Digit Med. 2020 Aug 19;3:109
pubmed: 32864472
AMIA Annu Symp Proc. 2020 Mar 04;2019:755-764
pubmed: 32308871
Int J Med Inform. 2013 Apr;82(4):239-47
pubmed: 22762862
Clin Pharmacol Ther. 2008 Sep;84(3):362-9
pubmed: 18500243
J Biomed Inform. 2020 May;105:103433
pubmed: 32335224
PLoS One. 2017 Jul 7;12(7):e0175508
pubmed: 28686612
Am J Hum Genet. 2011 Oct 7;89(4):529-42
pubmed: 21981779
J Am Med Inform Assoc. 2013 Dec;20(e2):e206-11
pubmed: 24302669
Genet Med. 2019 Jul;21(7):1585-1593
pubmed: 30514889
J Biomed Inform. 2010 Oct;43(5):747-51
pubmed: 20363365