Automatic Labeled Dialogue Generation for Nursing Record Systems.

dialogue systems machine learning natural language understanding nursing record systems

Journal

Journal of personalized medicine
ISSN: 2075-4426
Titre abrégé: J Pers Med
Pays: Switzerland
ID NLM: 101602269

Informations de publication

Date de publication:
16 Jul 2020
Historique:
received: 26 05 2020
revised: 29 06 2020
accepted: 09 07 2020
entrez: 26 7 2020
pubmed: 28 7 2020
medline: 28 7 2020
Statut: epublish

Résumé

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.

Identifiants

pubmed: 32708593
pii: jpm10030062
doi: 10.3390/jpm10030062
pmc: PMC7564988
pii:
doi:

Types de publication

Journal Article

Langues

eng

Références

Acad Emerg Med. 2010 Oct;17(10):1086-92
pubmed: 21040110
J Am Med Inform Assoc. 2009 Jul-Aug;16(4):580-4
pubmed: 19390101
Database (Oxford). 2016 Oct 24;2016:
pubmed: 27777244
BMC Med. 2019 Oct 29;17(1):195
pubmed: 31665002
Neural Comput. 1997 Nov 15;9(8):1735-80
pubmed: 9377276
J Biomed Inform. 2018 Nov;87:12-20
pubmed: 30217670
J Med Syst. 2014 Jun;38(6):56
pubmed: 24827759
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:856-865
pubmed: 28004040
JMIR Med Inform. 2015 Apr 27;3(2):e19
pubmed: 25917752
J Biomed Inform. 2017 May;69:230-250
pubmed: 28433825
Sensors (Basel). 2019 Aug 29;19(17):
pubmed: 31470554
AMIA Annu Symp Proc. 2014 Nov 14;2014:1815-24
pubmed: 25954454
J Am Med Inform Assoc. 2015 Sep;22(5):967-79
pubmed: 26063745
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):876-81
pubmed: 23043124
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67
pubmed: 28699566
J Nurs Adm. 2014 Feb;44(2):79-86
pubmed: 24451445
J Formos Med Assoc. 2008 Feb;107(2):119-28
pubmed: 18285244

Auteurs

Tittaya Mairittha (T)

Graduate School of Engineering, Kyushu Institute of Technology, 1-1 Sensui-cho, Tobata-ku, Kitakyushu-shi, Fukuoka 804-8550, Japan.

Nattaya Mairittha (N)

Graduate School of Engineering, Kyushu Institute of Technology, 1-1 Sensui-cho, Tobata-ku, Kitakyushu-shi, Fukuoka 804-8550, Japan.

Sozo Inoue (S)

Graduate School of Engineering, Kyushu Institute of Technology, 1-1 Sensui-cho, Tobata-ku, Kitakyushu-shi, Fukuoka 804-8550, Japan.

Classifications MeSH