Zero-Shot Clinical Acronym Expansion via Latent Meaning Cells.

clinical acronyms representation learning variational inference

Journal

Proceedings of machine learning research
ISSN: 2640-3498
Titre abrégé: Proc Mach Learn Res
Pays: United States
ID NLM: 101735789

Informations de publication

Date de publication:
Dec 2020
Historique:
entrez: 18 11 2021
pubmed: 19 11 2021
medline: 19 11 2021
Statut: ppublish

Résumé

We introduce Latent Meaning Cells, a deep latent variable model which learns contextualized representations of words by combining local lexical context and metadata. Metadata can refer to granular context, such as section type, or to more global context, such as unique document ids. Reliance on metadata for contextualized representation learning is apropos in the clinical domain where text is semi-structured and expresses high variation in topics. We evaluate the LMC model on the task of zero-shot clinical acronym expansion across three datasets. The LMC significantly outperforms a diverse set of baselines at a fraction of the pre-training cost and learns clinically coherent representations. We demonstrate that not only is metadata itself very helpful for the task, but that the LMC inference algorithm provides an additional large benefit.

Identifiants

pubmed: 34790898
pmc: PMC8594244
mid: NIHMS1747886

Types de publication

Journal Article

Langues

eng

Pagination

12-40

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM114355
Pays : United States
Organisme : NLM NIH HHS
ID : T15 LM007079
Pays : United States
Organisme : NCATS NIH HHS
ID : U01 TR002062
Pays : United States

Références

N Engl J Med. 1968 Mar 14;278(11):593-600
pubmed: 5637758
J AHIMA. 2013 Mar;84(3):44-5
pubmed: 23556403
Bioinformatics. 2020 Feb 15;36(4):1234-1240
pubmed: 31501885
Sci Data. 2016 May 24;3:160035
pubmed: 27219127
Yearb Med Inform. 2016 Nov 10;(1):224-233
pubmed: 27830255
AMIA Annu Symp Proc. 2017 Feb 10;2016:560-569
pubmed: 28269852
J Am Med Inform Assoc. 2014 Mar-Apr;21(2):299-307
pubmed: 23813539
AMIA Annu Symp Proc. 2006;:399-403
pubmed: 17238371
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70
pubmed: 14681409
Yearb Med Inform. 2008;:128-44
pubmed: 18660887

Auteurs

Griffin Adams (G)

Columbia University, New York, NY, US.

Mert Ketenci (M)

Columbia University, New York, NY, US.

Shreyas Bhave (S)

Columbia University, New York, NY, US.

Adler Perotte (A)

Columbia University, New York, NY, US.

Noémie Elhadad (N)

Columbia University, New York, NY, US.

Classifications MeSH