Representing and utilizing clinical textual data for real world studies: An OHDSI approach.
Electronic health records
Natural language processing
Real-world study
Journal
Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413
Informations de publication
Date de publication:
06 2023
06 2023
Historique:
received:
25
05
2022
revised:
21
01
2023
accepted:
13
03
2023
pmc-release:
01
06
2024
medline:
5
6
2023
pubmed:
20
3
2023
entrez:
19
3
2023
Statut:
ppublish
Résumé
Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies.
Identifiants
pubmed: 36935011
pii: S1532-0464(23)00064-3
doi: 10.1016/j.jbi.2023.104343
pmc: PMC10428170
mid: NIHMS1897646
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
104343Subventions
Organisme : NIA NIH HHS
ID : P30 AG059307
Pays : United States
Organisme : NLM NIH HHS
ID : R00 LM013001
Pays : United States
Organisme : NCATS NIH HHS
ID : U01 TR002062
Pays : United States
Organisme : NCCIH NIH HHS
ID : R01 AT009457
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA008748
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM006910
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR002494
Pays : United States
Informations de copyright
Copyright © 2023 Elsevier Inc. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of Competing Interest Dr. Hua Xu and The University of Texas Health Science Center at Houston have research related financial interests at Melax Technologies Inc. Dr. Xiaoyan Wang has related financial interests at Sema4 Mount Sinai Genomics Inc.
Références
J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336
pubmed: 29186491
J Biomed Inform. 2015 Dec;58 Suppl:S11-S19
pubmed: 26225918
J Am Med Inform Assoc. 2021 Mar 1;28(3):504-515
pubmed: 33319904
J Am Med Inform Assoc. 2021 Jul 14;28(7):1468-1479
pubmed: 33712854
J Biomed Inform. 2019 Nov;99:103293
pubmed: 31542521
J Med Internet Res. 2020 Oct 2;22(10):e20509
pubmed: 32936770
Pharmacoepidemiol Drug Saf. 2020 Oct;29(10):1201-1212
pubmed: 31823482
Nat Commun. 2022 Mar 30;13(1):1678
pubmed: 35354802
J Am Med Inform Assoc. 2016 Jul;23(4):731-40
pubmed: 27107443
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):78
pubmed: 30943974
JMIR Res Protoc. 2017 Dec 05;6(12):e222
pubmed: 29208590
Cancer Res. 2019 Nov 1;79(21):5463-5470
pubmed: 31395609
BMC Med Genomics. 2011 Jan 26;4:13
pubmed: 21269473
J Am Med Inform Assoc. 2015 Mar;22(2):263-74
pubmed: 25352564
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:149-53
pubmed: 24303255
Epidemiology. 2019 Jul;30(4):597-608
pubmed: 31045611
Pharmacoepidemiol Drug Saf. 2012 Jan;21 Suppl 1:1-8
pubmed: 22262586
J Am Med Inform Assoc. 2017 Jul 1;24(4):841-844
pubmed: 28130331
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577
pubmed: 33150942
J Am Med Inform Assoc. 2013 Dec;20(e2):e226-31
pubmed: 23956018
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304
pubmed: 31265066
J Am Med Inform Assoc. 2021 Mar 1;28(3):427-443
pubmed: 32805036
AMIA Annu Symp Proc. 2022 Feb 21;2021:438-447
pubmed: 35308962
J Am Med Inform Assoc. 2023 Aug 09;:
pubmed: 37555837
Am J Infect Control. 2020 Oct;48(10):1261-1263
pubmed: 32070629
Nat Rev Neurol. 2017 Feb;13(2):105-118
pubmed: 28084327
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):139-153
pubmed: 29994486
J Biomed Inform. 2018 Dec;88:11-19
pubmed: 30368002
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13
pubmed: 20819853
JAMA. 2018 Sep 4;320(9):867-868
pubmed: 30105359
Diabetes Obes Metab. 2018 Apr;20(4):974-984
pubmed: 29206336
Thromb Res. 2021 Jul;203:190-195
pubmed: 34044246
Stud Health Technol Inform. 2015;216:574-8
pubmed: 26262116
Eur J Cancer. 2018 Sep;101:69-76
pubmed: 30031168
Appl Clin Inform. 2019 Oct;10(5):794-803
pubmed: 31645076
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):51
pubmed: 30066648
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36
pubmed: 20442139
AMIA Annu Symp Proc. 2021 Jan 25;2020:1441-1450
pubmed: 33936520
Sci Rep. 2020 Oct 6;10(1):16598
pubmed: 33024152
J Natl Cancer Inst. 2017 Nov 1;109(11):
pubmed: 29059439
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173
pubmed: 27174893
AMIA Annu Symp Proc. 2018 Apr 16;2017:1685-1694
pubmed: 29854239
J Am Med Inform Assoc. 2016 Sep;23(5):909-15
pubmed: 26911824
Stud Health Technol Inform. 2022 Jun 6;290:1062-1063
pubmed: 35673206
J Med Internet Res. 2020 Dec 9;22(12):e18526
pubmed: 33295294
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:48-57
pubmed: 28815104
Pharmacoepidemiol Drug Saf. 2022 Jul;31(7):717-720
pubmed: 35471704
J Biomed Inform. 2021 May;117:103744
pubmed: 33775815
AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:188-196
pubmed: 29888070
N Engl J Med. 2016 Dec 8;375(23):2293-2297
pubmed: 27959688