Text Classification of Cancer Clinical Trial Eligibility Criteria.


Journal

AMIA ... Annual Symposium proceedings. AMIA Symposium
ISSN: 1942-597X
Titre abrégé: AMIA Annu Symp Proc
Pays: United States
ID NLM: 101209213

Informations de publication

Date de publication:
2023
Historique:
medline: 15 1 2024
pubmed: 15 1 2024
entrez: 15 1 2024
Statut: epublish

Résumé

Automatic identification of clinical trials for which a patient is eligible is complicated by the fact that trial eligibility are stated in natural language. A potential solution to this problem is to employ text classification methods for common types of eligibility criteria. In this study, we focus on seven common exclusion criteria in cancer trials: prior malignancy, human immunodeficiency virus, hepatitis B, hepatitis C, psychiatric illness, drug/substance abuse, and autoimmune illness. Our dataset consists of 764 phase III cancer trials with these exclusions annotated at the trial level. We experiment with common transformer models as well as a new pre-trained clinical trial BERT model. Our results demonstrate the feasibility of automatically classifying common exclusion criteria. Additionally, we demonstrate the value of a pre-trained language model specifically for clinical trials, which yield the highest average performance across all criteria.

Identifiants

pubmed: 38222417
pii: 1347
pmc: PMC10785908

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1304-1313

Informations de copyright

©2023 AMIA - All rights reserved.

Auteurs

Yumeng Yang (Y)

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Soumya Jayaraj (S)

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Ethan Ludmir (E)

Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.

Kirk Roberts (K)

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Classifications MeSH