Automatic Labeled Dialogue Generation for Nursing Record Systems.
dialogue systems
machine learning
natural language understanding
nursing record systems
Journal
Journal of personalized medicine
ISSN: 2075-4426
Titre abrégé: J Pers Med
Pays: Switzerland
ID NLM: 101602269
Informations de publication
Date de publication:
16 Jul 2020
16 Jul 2020
Historique:
received:
26
05
2020
revised:
29
06
2020
accepted:
09
07
2020
entrez:
26
7
2020
pubmed:
28
7
2020
medline:
28
7
2020
Statut:
epublish
Résumé
The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.
Identifiants
pubmed: 32708593
pii: jpm10030062
doi: 10.3390/jpm10030062
pmc: PMC7564988
pii:
doi:
Types de publication
Journal Article
Langues
eng
Références
Acad Emerg Med. 2010 Oct;17(10):1086-92
pubmed: 21040110
J Am Med Inform Assoc. 2009 Jul-Aug;16(4):580-4
pubmed: 19390101
Database (Oxford). 2016 Oct 24;2016:
pubmed: 27777244
BMC Med. 2019 Oct 29;17(1):195
pubmed: 31665002
Neural Comput. 1997 Nov 15;9(8):1735-80
pubmed: 9377276
J Biomed Inform. 2018 Nov;87:12-20
pubmed: 30217670
J Med Syst. 2014 Jun;38(6):56
pubmed: 24827759
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:856-865
pubmed: 28004040
JMIR Med Inform. 2015 Apr 27;3(2):e19
pubmed: 25917752
J Biomed Inform. 2017 May;69:230-250
pubmed: 28433825
Sensors (Basel). 2019 Aug 29;19(17):
pubmed: 31470554
AMIA Annu Symp Proc. 2014 Nov 14;2014:1815-24
pubmed: 25954454
J Am Med Inform Assoc. 2015 Sep;22(5):967-79
pubmed: 26063745
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):876-81
pubmed: 23043124
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67
pubmed: 28699566
J Nurs Adm. 2014 Feb;44(2):79-86
pubmed: 24451445
J Formos Med Assoc. 2008 Feb;107(2):119-28
pubmed: 18285244