Representing and utilizing clinical textual data for real world studies: An OHDSI approach.


Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
06 2023
Historique:
received: 25 05 2022
revised: 21 01 2023
accepted: 13 03 2023
pmc-release: 01 06 2024
medline: 5 6 2023
pubmed: 20 3 2023
entrez: 19 3 2023
Statut: ppublish

Résumé

Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies.

Identifiants

pubmed: 36935011
pii: S1532-0464(23)00064-3
doi: 10.1016/j.jbi.2023.104343
pmc: PMC10428170
mid: NIHMS1897646
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

104343

Subventions

Organisme : NIA NIH HHS
ID : P30 AG059307
Pays : United States
Organisme : NLM NIH HHS
ID : R00 LM013001
Pays : United States
Organisme : NCATS NIH HHS
ID : U01 TR002062
Pays : United States
Organisme : NCCIH NIH HHS
ID : R01 AT009457
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA008748
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM006910
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR002494
Pays : United States

Informations de copyright

Copyright © 2023 Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest Dr. Hua Xu and The University of Texas Health Science Center at Houston have research related financial interests at Melax Technologies Inc. Dr. Xiaoyan Wang has related financial interests at Sema4 Mount Sinai Genomics Inc.

Références

J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336
pubmed: 29186491
J Biomed Inform. 2015 Dec;58 Suppl:S11-S19
pubmed: 26225918
J Am Med Inform Assoc. 2021 Mar 1;28(3):504-515
pubmed: 33319904
J Am Med Inform Assoc. 2021 Jul 14;28(7):1468-1479
pubmed: 33712854
J Biomed Inform. 2019 Nov;99:103293
pubmed: 31542521
J Med Internet Res. 2020 Oct 2;22(10):e20509
pubmed: 32936770
Pharmacoepidemiol Drug Saf. 2020 Oct;29(10):1201-1212
pubmed: 31823482
Nat Commun. 2022 Mar 30;13(1):1678
pubmed: 35354802
J Am Med Inform Assoc. 2016 Jul;23(4):731-40
pubmed: 27107443
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):78
pubmed: 30943974
JMIR Res Protoc. 2017 Dec 05;6(12):e222
pubmed: 29208590
Cancer Res. 2019 Nov 1;79(21):5463-5470
pubmed: 31395609
BMC Med Genomics. 2011 Jan 26;4:13
pubmed: 21269473
J Am Med Inform Assoc. 2015 Mar;22(2):263-74
pubmed: 25352564
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:149-53
pubmed: 24303255
Epidemiology. 2019 Jul;30(4):597-608
pubmed: 31045611
Pharmacoepidemiol Drug Saf. 2012 Jan;21 Suppl 1:1-8
pubmed: 22262586
J Am Med Inform Assoc. 2017 Jul 1;24(4):841-844
pubmed: 28130331
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577
pubmed: 33150942
J Am Med Inform Assoc. 2013 Dec;20(e2):e226-31
pubmed: 23956018
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304
pubmed: 31265066
J Am Med Inform Assoc. 2021 Mar 1;28(3):427-443
pubmed: 32805036
AMIA Annu Symp Proc. 2022 Feb 21;2021:438-447
pubmed: 35308962
J Am Med Inform Assoc. 2023 Aug 09;:
pubmed: 37555837
Am J Infect Control. 2020 Oct;48(10):1261-1263
pubmed: 32070629
Nat Rev Neurol. 2017 Feb;13(2):105-118
pubmed: 28084327
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):139-153
pubmed: 29994486
J Biomed Inform. 2018 Dec;88:11-19
pubmed: 30368002
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13
pubmed: 20819853
JAMA. 2018 Sep 4;320(9):867-868
pubmed: 30105359
Diabetes Obes Metab. 2018 Apr;20(4):974-984
pubmed: 29206336
Thromb Res. 2021 Jul;203:190-195
pubmed: 34044246
Stud Health Technol Inform. 2015;216:574-8
pubmed: 26262116
Eur J Cancer. 2018 Sep;101:69-76
pubmed: 30031168
Appl Clin Inform. 2019 Oct;10(5):794-803
pubmed: 31645076
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):51
pubmed: 30066648
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36
pubmed: 20442139
AMIA Annu Symp Proc. 2021 Jan 25;2020:1441-1450
pubmed: 33936520
Sci Rep. 2020 Oct 6;10(1):16598
pubmed: 33024152
J Natl Cancer Inst. 2017 Nov 1;109(11):
pubmed: 29059439
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173
pubmed: 27174893
AMIA Annu Symp Proc. 2018 Apr 16;2017:1685-1694
pubmed: 29854239
J Am Med Inform Assoc. 2016 Sep;23(5):909-15
pubmed: 26911824
Stud Health Technol Inform. 2022 Jun 6;290:1062-1063
pubmed: 35673206
J Med Internet Res. 2020 Dec 9;22(12):e18526
pubmed: 33295294
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:48-57
pubmed: 28815104
Pharmacoepidemiol Drug Saf. 2022 Jul;31(7):717-720
pubmed: 35471704
J Biomed Inform. 2021 May;117:103744
pubmed: 33775815
AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:188-196
pubmed: 29888070
N Engl J Med. 2016 Dec 8;375(23):2293-2297
pubmed: 27959688

Auteurs

Vipina K Keloth (VK)

Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA.

Juan M Banda (JM)

Department of Computer Science, Georgia State University, Atlanta, GA, USA.

Michael Gurley (M)

Lurie Cancer Center, Northwestern University, Chicago, Illinois, USA.

Paul M Heider (PM)

Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC, USA.

Georgina Kennedy (G)

Ingham Institute for Applied Medical Research, Sydney, Australia.

Hongfang Liu (H)

Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA.

Feifan Liu (F)

Department of Population and Quantitative Health Sciences, University of Massachusetts Chan Medical School, Worcester, MA, USA.

Timothy Miller (T)

Computational Health Informatics Program, Boston Children's Hospital, and Department of Pediatrics, Harvard Medical School, Boston, MA, USA.

Karthik Natarajan (K)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Olga V Patterson (O)

VA Informatics and Computing Infrastructure, Department of Veterans Affairs Salt Lake City Health Care System, Salt Lake City, Utah, USA; Division of Epidemiology, Department of Internal Medicine, School of Medicine, University of Utah, Salt Lake City, Utah, USA; Verily Life Sciences, Mountain View, CA, USA.

Yifan Peng (Y)

Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA.

Kalpana Raja (K)

Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA.

Ruth M Reeves (RM)

TN Valley Healthcare System, U.S. Department of Veterans Affairs, Nashville, TN, USA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

Masoud Rouhizadeh (M)

Department of Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA; Biomedical Informatics and Data Science, Johns Hopkins University, Baltimore, MD, USA.

Jianlin Shi (J)

VA Informatics and Computing Infrastructure, Department of Veterans Affairs Salt Lake City Health Care System, Salt Lake City, Utah, USA; Division of Epidemiology, Department of Internal Medicine, School of Medicine, University of Utah, Salt Lake City, Utah, USA; Department of Biomedical Informatics, University of Utah, Salt Lake City, USA.

Xiaoyan Wang (X)

Sema4 Mount Sinai Genomics Incorporation, Stamford, CT, USA.

Yanshan Wang (Y)

Department of Health Information Management, Department of Biomedical Informatics, and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA.

Wei-Qi Wei (WQ)

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

Andrew E Williams (AE)

School of Medicine, Tufts University, Boston, MA, USA.

Rui Zhang (R)

Institute for Health Informatics, and Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, MN, USA.

Rimma Belenkaya (R)

Memorial Sloan Kettering Cancer Center, New York, NY, USA.

Christian Reich (C)

Real World Solutions, IQVIA, Durham, NC, USA.

Clair Blacketer (C)

Janssen Pharmaceutical Research and Development LLC, Titusville, NJ, USA; Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands.

Patrick Ryan (P)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA; Janssen Pharmaceutical Research and Development LLC, Titusville, NJ, USA.

George Hripcsak (G)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Noémie Elhadad (N)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA. Electronic address: noemie.elhadad@columbia.edu.

Hua Xu (H)

Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA. Electronic address: hua.xu@yale.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH