Towards cross-application model-agnostic federated cohort discovery.

clinical trials cohort discovery electronic health records interoperability

Journal

Journal of the American Medical Informatics Association : JAMIA
ISSN: 1527-974X
Titre abrégé: J Am Med Inform Assoc
Pays: England
ID NLM: 9430800

Informations de publication

Date de publication:
07 Aug 2024
Historique:
received: 13 05 2024
revised: 16 07 2024
accepted: 25 07 2024
medline: 7 8 2024
pubmed: 7 8 2024
entrez: 7 8 2024
Statut: aheadofprint

Résumé

To demonstrate that 2 popular cohort discovery tools, Leaf and the Shared Health Research Information Network (SHRINE), are readily interoperable. Specifically, we adapted Leaf to interoperate and function as a node in a federated data network that uses SHRINE and dynamically generate queries for heterogeneous data models. SHRINE queries are designed to run on the Informatics for Integrating Biology & the Bedside (i2b2) data model. We created functionality in Leaf to interoperate with a SHRINE data network and dynamically translate SHRINE queries to other data models. We randomly selected 500 past queries from the SHRINE-based national Evolve to Next-Gen Accrual to Clinical Trials (ENACT) network for evaluation, and an additional 100 queries to refine and debug Leaf's translation functionality. We created a script for Leaf to convert the terms in the SHRINE queries into equivalent structured query language (SQL) concepts, which were then executed on 2 other data models. 91.1% of the generated queries for non-i2b2 models returned counts within 5% (or ±5 patients for counts under 100) of i2b2, with 91.3% recall. Of the 8.9% of queries that exceeded the 5% margin, 77 of 89 (86.5%) were due to errors introduced by the Python script or the extract-transform-load process, which are easily fixed in a production deployment. The remaining errors were due to Leaf's translation function, which was later fixed. Our results support that cohort discovery applications such as Leaf and SHRINE can interoperate in federated data networks with heterogeneous data models.

Identifiants

pubmed: 39110920
pii: 7729230
doi: 10.1093/jamia/ocae211
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NCATS NIH HHS
ID : U24 TR004111
Pays : United States

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Auteurs

Nicholas J Dobbins (NJ)

Department of Biomedical Informatics & Medical Education, University of Washington, Seattle, WA 98195, United States.
Biomedical Informatics & Data Science, Department of Medicine, Johns Hopkins University, Baltimore, MD 21218, United States.

Michele Morris (M)

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, United States.

Eugene Sadhu (E)

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, United States.

Douglas MacFadden (D)

Department of Biomedical Informatics, Harvard University, Boston, MA 02138, United States.

Marc-Danie Nazaire (MD)

Department of Biomedical Informatics, Harvard University, Boston, MA 02138, United States.

William Simons (W)

Department of Biomedical Informatics, Harvard University, Boston, MA 02138, United States.

Griffin Weber (G)

Department of Biomedical Informatics, Harvard University, Boston, MA 02138, United States.

Shawn Murphy (S)

Department of Biomedical Informatics, Harvard University, Boston, MA 02138, United States.

Shyam Visweswaran (S)

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, United States.

Classifications MeSH