Automatic Labeled Dialogue Generation for Nursing Record Systems.

dialogue systems machine learning natural language understanding nursing record systems

Journal

Journal of personalized medicine

ISSN: 2075-4426

Titre abrégé: J Pers Med

Pays: Switzerland

ID NLM: 101602269

Informations de publication

Date de publication:
16 Jul 2020

Historique:

received: 26 05 2020

revised: 29 06 2020

accepted: 09 07 2020

entrez: 26 7 2020

pubmed: 28 7 2020

medline: 28 7 2020

Statut: epublish

Résumé

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.

Identifiants

DOI: 10.3390/jpm10030062 PMID: 32708593 PMC: PMC7564988

pubmed: 32708593

pii: jpm10030062

doi: 10.3390/jpm10030062

pmc: PMC7564988

pii:

doi:

Types de publication

Journal Article

Langues

eng

Références

Acad Emerg Med. 2010 Oct;17(10):1086-92

pubmed: 21040110

J Am Med Inform Assoc. 2009 Jul-Aug;16(4):580-4

pubmed: 19390101

Database (Oxford). 2016 Oct 24;2016:

pubmed: 27777244

BMC Med. 2019 Oct 29;17(1):195

pubmed: 31665002

Neural Comput. 1997 Nov 15;9(8):1735-80

pubmed: 9377276

J Biomed Inform. 2018 Nov;87:12-20

pubmed: 30217670

J Med Syst. 2014 Jun;38(6):56

pubmed: 24827759

Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:856-865

pubmed: 28004040

JMIR Med Inform. 2015 Apr 27;3(2):e19

pubmed: 25917752

J Biomed Inform. 2017 May;69:230-250

pubmed: 28433825

Sensors (Basel). 2019 Aug 29;19(17):

pubmed: 31470554

AMIA Annu Symp Proc. 2014 Nov 14;2014:1815-24

pubmed: 25954454

J Am Med Inform Assoc. 2015 Sep;22(5):967-79

pubmed: 26063745

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):876-81

pubmed: 23043124

BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67

pubmed: 28699566

J Nurs Adm. 2014 Feb;44(2):79-86

pubmed: 24451445

J Formos Med Assoc. 2008 Feb;107(2):119-28

pubmed: 18285244

Automatic Labeled Dialogue Generation for Nursing Record Systems.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Références

Auteurs

Tittaya Mairittha (T)

Nattaya Mairittha (N)

Sozo Inoue (S)

Classifications MeSH