Natural Language Processing Can Automate Extraction of Barrett's Esophagus Endoscopy Quality Metrics.

Barrett’s esophagus natural language processing quality improvement

Journal

medRxiv : the preprint server for health sciences
Titre abrégé: medRxiv
Pays: United States
ID NLM: 101767986

Informations de publication

Date de publication:
13 Jul 2023
Historique:
medline: 7 8 2023
pubmed: 7 8 2023
entrez: 7 8 2023
Statut: epublish

Résumé

To develop an automated natural language processing (NLP) method for extracting high-fidelity Barrett's Esophagus (BE) endoscopic surveillance and treatment data from the electronic health record (EHR). Patients who underwent BE-related endoscopies between 2016 and 2020 at a single medical center were randomly assigned to a development or validation set. Those not aged 40 to 80 and those without confirmed BE were excluded. For each patient, free text pathology reports and structured procedure data were obtained. Gastroenterologists assigned ground truth labels. An NLP method leveraging MetaMap Lite generated endoscopy-level diagnosis and treatment data. Performance metrics were assessed for this data. The NLP methodology was then adapted to label key endoscopic eradication therapy (EET)-related endoscopy events and thereby facilitate calculation of patient-level pre-EET diagnosis, endotherapy time, and time to CE-IM. 99 patients (377 endoscopies) and 115 patients (399 endoscopies) were included in the development and validation sets respectively. When assigning high-fidelity labels to the validation set, NLP achieved high performance (recall: 0.976, precision: 0.970, accuracy: 0.985, and F1-score: 0.972). 77 patients initiated EET and underwent 554 endoscopies. Key EET-related clinical event labels had high accuracy (EET start: 0.974, CE-D: 1.00, and CE-IM: 1.00), facilitating extraction of pre-treatment diagnosis, endotherapy time, and time to CE-IM. High-fidelity BE endoscopic surveillance and treatment data can be extracted from routine EHR data using our automated, transparent NLP method. This method produces high-level clinical datasets for clinical research and quality metric assessment.

Identifiants

pubmed: 37546941
doi: 10.1101/2023.07.11.23292529
pmc: PMC10403813
pii:
doi:

Types de publication

Preprint

Langues

eng

Déclaration de conflit d'intérêts

Potential Competing Interests: The authors declare no conflicts of interest.

Références

JAMA. 2022 Aug 16;328(7):663-671
pubmed: 35972481
Clin Gastroenterol Hepatol. 2014 Jul;12(7):1130-6
pubmed: 24316106
Gastrointest Endosc. 2019 Sep;90(3):335-359.e2
pubmed: 31439127
Gastrointest Endosc. 2012 Jun;75(6):1233-9.e14
pubmed: 22482913
J Clin Gastroenterol. 2019 Jan;53(1):e25-e30
pubmed: 28906424
PLoS One. 2020 May 12;15(5):e0232840
pubmed: 32396579
Gastroenterology. 2015 Nov;149(6):1599-606
pubmed: 26296479
J Urol. 2013 Jul;190(1):17-8
pubmed: 23608038
J Med Syst. 2020 Jul 31;44(9):151
pubmed: 32737597
Dig Dis Sci. 2018 Jul;63(7):1794-1800
pubmed: 29696479
Gastroenterology. 2022 Feb;162(2):468-481
pubmed: 34757142
Gastrointest Endosc. 2017 Jul;86(1):1-17.e3
pubmed: 28576294
Diagnostics (Basel). 2022 Jul 15;12(7):
pubmed: 35885630
BMC Med Res Methodol. 2022 Aug 15;22(1):227
pubmed: 35971057
Am J Gastroenterol. 2014 Dec;109(12):1844-9
pubmed: 24935271
PLoS One. 2015 Oct 20;10(10):e0138649
pubmed: 26484762
Endoscopy. 2020 Jan;52(1):17-28
pubmed: 31529444
Proc AMIA Symp. 2001;:17-21
pubmed: 11825149
J Am Med Inform Assoc. 2011 Dec;18 Suppl 1:i150-6
pubmed: 21946240
Am J Gastroenterol. 2015 Apr;110(4):543-52
pubmed: 25756240
J Am Med Inform Assoc. 2017 Jul 1;24(4):841-844
pubmed: 28130331
CA Cancer J Clin. 2022 Jan;72(1):7-33
pubmed: 35020204
Clin Gastroenterol Hepatol. 2023 May;21(5):1198-1204
pubmed: 36115659
Clin Gastroenterol Hepatol. 2022 Dec;20(12):2696-2706.e1
pubmed: 35788412
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13
pubmed: 20819853
Am J Gastroenterol. 2016 Jan;111(1):30-50; quiz 51
pubmed: 26526079
Gastrointest Endosc. 2015 Sep;82(3):512-9
pubmed: 25910665
BMC Med Res Methodol. 2008 May 09;8:29
pubmed: 18471298
PLoS One. 2008 Aug 25;3(8):e3049
pubmed: 18725958
AMIA Annu Symp Proc. 2023 Apr 29;2022:972-981
pubmed: 37128372
J Med Internet Res. 2022 Mar 23;24(3):e27210
pubmed: 35319481
Gastrointest Endosc. 2020 Oct;92(4):886-890
pubmed: 32417299
Am J Surg Pathol. 2018 Mar;42(3):376-381
pubmed: 29135518
Clin Gastroenterol Hepatol. 2013 Jun;11(6):689-94
pubmed: 23313839

Auteurs

Ali Soroush (A)

Division of Digestive and Liver Diseases, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA.

Courtney J Diamond (CJ)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Haley M Zylberberg (HM)

Division of Digestive and Liver Diseases, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA.

Benjamin May (B)

Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA.

Nicholas Tatonetti (N)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.
Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, CA, USA.

Julian A Abrams (JA)

Division of Digestive and Liver Diseases, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA.
Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA.

Chunhua Weng (C)

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Classifications MeSH