Privacy-protecting, reliable response data discovery using COVID-19 patient observations.
COVID-19
common data elements
electronic health record
observational study
regression analysis
Journal
Journal of the American Medical Informatics Association : JAMIA
ISSN: 1527-974X
Titre abrégé: J Am Med Inform Assoc
Pays: England
ID NLM: 9430800
Informations de publication
Date de publication:
30 07 2021
30 07 2021
Historique:
received:
06
10
2020
revised:
28
12
2020
accepted:
17
03
2021
pubmed:
30
5
2021
medline:
19
8
2021
entrez:
29
5
2021
Statut:
ppublish
Résumé
To utilize, in an individual and institutional privacy-preserving manner, electronic health record (EHR) data from 202 hospitals by analyzing answers to COVID-19-related questions and posting these answers online. We developed a distributed, federated network of 12 health systems that harmonized their EHRs and submitted aggregate answers to consortia questions posted at https://www.covid19questions.org. Our consortium developed processes and implemented distributed algorithms to produce answers to a variety of questions. We were able to generate counts, descriptive statistics, and build a multivariate, iterative regression model without centralizing individual-level data. Our public website contains answers to various clinical questions, a web form for users to ask questions in natural language, and a list of items that are currently pending responses. The results show, for example, that patients who were taking angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, within the year before admission, had lower unadjusted in-hospital mortality rates. We also showed that, when adjusted for, age, sex, and ethnicity were not significantly associated with mortality. We demonstrated that it is possible to answer questions about COVID-19 using EHR data from systems that have different policies and must follow various regulations, without moving data out of their health systems. We present an alternative or a complement to centralized COVID-19 registries of EHR data. We can use multivariate distributed logistic regression on observations recorded in the process of care to generate results without transferring individual-level data outside the health systems.
Identifiants
pubmed: 34051088
pii: 6288530
doi: 10.1093/jamia/ocab054
pmc: PMC8194878
doi:
Types de publication
Journal Article
Multicenter Study
Observational Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1765-1776Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM118609
Pays : United States
Organisme : NIH HHS
ID : R01GM118609
Pays : United States
Organisme : NIH HHS
ID : T15LM011271
Pays : United States
Commentaires et corrections
Type : UpdateOf
Type : ErratumIn
Informations de copyright
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association.