Use GPT-J Prompt Generation with RoBERTa for NER Models on Diagnosis Extraction of Periodontal Diagnosis from Electronic Dental Records.


Journal

AMIA ... Annual Symposium proceedings. AMIA Symposium
ISSN: 1942-597X
Titre abrégé: AMIA Annu Symp Proc
Pays: United States
ID NLM: 101209213

Informations de publication

Date de publication:
2023
Historique:
medline: 15 1 2024
pubmed: 15 1 2024
entrez: 15 1 2024
Statut: epublish

Résumé

This study explored the usability of prompt generation on named entity recognition (NER) tasks and the performance in different settings of the prompt. The prompt generation by GPT-J models was utilized to directly test the gold standard as well as to generate the seed and further fed to the RoBERTa model with the spaCy package. In the direct test, a lower ratio of negative examples with higher numbers of examples in prompt achieved the best results with a F1 score of 0.72. The performance revealed consistency, 0.92-0.97 in the F1 score, in all settings after training with the RoBERTa model. The study highlighted the importance of seed quality rather than quantity in feeding NER models. This research reports on an efficient and accurate way to mine clinical notes for periodontal diagnoses, allowing researchers to easily and quickly build a NER model with the prompt generation approach.

Identifiants

pubmed: 38222409
pii: 776
pmc: PMC10785852

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

904-912

Informations de copyright

©2023 AMIA - All rights reserved.

Auteurs

Yao-Shun Chuang (YS)

School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA.

Xiaoqian Jiang (X)

School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA.

Chun-Teh Lee (CT)

Department of Periodontics and Dental Hygiene, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.

Ryan Brandon (R)

Department of Oral Health Sciences, Temple University Kornberg School of Dentistry, Philadelphia, Pennsylvania, USA.

Duong Tran (D)

Diagnostic and Biomedical Sciences, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.

Oluwabunmi Tokede (O)

Oral Healthcare Quality and Safety, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.
Diagnostic and Biomedical Sciences, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.

Muhammad F Walji (MF)

Oral Healthcare Quality and Safety, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.
Diagnostic and Biomedical Sciences, The University of Texas Health Science Center at Houston School of Dentistry, Houston, Texas, USA.

Classifications MeSH