Examining spoken words and acoustic features of therapy sessions to understand family caregivers' anxiety and quality of life.
Caregiver
Chatbot
Communication
Machine learning
Quality of life
Journal
International journal of medical informatics
ISSN: 1872-8243
Titre abrégé: Int J Med Inform
Pays: Ireland
ID NLM: 9711057
Informations de publication
Date de publication:
04 2022
04 2022
Historique:
received:
15
06
2021
revised:
07
02
2022
accepted:
08
02
2022
pubmed:
21
2
2022
medline:
5
4
2022
entrez:
20
2
2022
Statut:
ppublish
Résumé
Speech and language cues are considered significant data sources that can reveal insights into one's behavior and well-being. The goal of this study is to evaluate how different machine learning (ML) classifiers trained both on the spoken word and acoustic features during live conversations between family caregivers and a therapist, correlate to anxiety and quality of life (QoL) as assessed by validated instruments. The dataset comprised of 124 audio-recorded and professionally transcribed discussions between family caregivers of hospice patients and a therapist, of challenges they faced in their caregiving role, and standardized assessments of self-reported QoL and anxiety. We custom-built and trained an Automated Speech Recognition (ASR) system on older adult voices and created a logistic regression-based classifier that incorporated audio-based features. The classification process automated the QoL scoring and display of the score in real time, replacing hand-coding for self-reported assessments with a machine learning identified classifier. Of the 124 audio files and their transcripts, 87 of these transcripts (70%) were selected to serve as the training set, holding the remaining 30% of the data for evaluation. For anxiety, the results of adding the dimension of sound and an automated speech-to-text transcription outperformed the prior classifier trained only on human-rendered transcriptions. Specifically, precision improved from 86% to 92%, accuracy from 81% to 89%, and recall from 78% to 88%. Classifiers can be developed through ML techniques which can indicate improvements in QoL measures with a reasonable degree of accuracy. Examining the content, sound of the voice and context of the conversation provides insights into additional factors affecting anxiety and QoL that could be addressed in tailored therapy and the design of conversational agents serving as therapy chatbots.
Sections du résumé
BACKGROUND
Speech and language cues are considered significant data sources that can reveal insights into one's behavior and well-being. The goal of this study is to evaluate how different machine learning (ML) classifiers trained both on the spoken word and acoustic features during live conversations between family caregivers and a therapist, correlate to anxiety and quality of life (QoL) as assessed by validated instruments.
METHODS
The dataset comprised of 124 audio-recorded and professionally transcribed discussions between family caregivers of hospice patients and a therapist, of challenges they faced in their caregiving role, and standardized assessments of self-reported QoL and anxiety. We custom-built and trained an Automated Speech Recognition (ASR) system on older adult voices and created a logistic regression-based classifier that incorporated audio-based features. The classification process automated the QoL scoring and display of the score in real time, replacing hand-coding for self-reported assessments with a machine learning identified classifier.
FINDINGS
Of the 124 audio files and their transcripts, 87 of these transcripts (70%) were selected to serve as the training set, holding the remaining 30% of the data for evaluation. For anxiety, the results of adding the dimension of sound and an automated speech-to-text transcription outperformed the prior classifier trained only on human-rendered transcriptions. Specifically, precision improved from 86% to 92%, accuracy from 81% to 89%, and recall from 78% to 88%.
INTERPRETATION
Classifiers can be developed through ML techniques which can indicate improvements in QoL measures with a reasonable degree of accuracy. Examining the content, sound of the voice and context of the conversation provides insights into additional factors affecting anxiety and QoL that could be addressed in tailored therapy and the design of conversational agents serving as therapy chatbots.
Identifiants
pubmed: 35183870
pii: S1386-5056(22)00030-2
doi: 10.1016/j.ijmedinf.2022.104716
pmc: PMC8902633
mid: NIHMS1783401
pii:
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
104716Subventions
Organisme : NINR NIH HHS
ID : R01 NR012213
Pays : United States
Informations de copyright
Copyright © 2022 Elsevier B.V. All rights reserved.
Références
Arch Intern Med. 2006 May 22;166(10):1092-7
pubmed: 16717171
Psychiatry Clin Neurosci. 2007 Jun;61(3):234-42
pubmed: 17472590
Aust Fam Physician. 2002 Sep;31(9):833-6
pubmed: 12402702
Int J Environ Res Public Health. 2017 Mar 31;14(4):
pubmed: 28362333
JAMA. 1999 Dec 15;282(23):2215-9
pubmed: 10605972
Proc IEEE Inst Electr Electron Eng. 2013 Feb 7;101(5):1203-1233
pubmed: 24039277
Soc Sci Med. 2000 Jan;50(2):271-84
pubmed: 10619695
Int J Alzheimers Dis. 2016;2016:9213968
pubmed: 28083154
BMC Geriatr. 2012 Oct 25;12:66
pubmed: 23095644
J Am Geriatr Soc. 2019 Jul;67(7):1345-1352
pubmed: 30946495
Qual Life Res. 2008 Mar;17(2):267-73
pubmed: 18157616
Ann Behav Med. 2008 Apr;35(2):230-8
pubmed: 18365297
Am J Nurs. 2008 Sep;108(9 Suppl):23-7; quiz 27
pubmed: 18797217
Oncol Nurs Forum. 1994 Aug;21(7):1189-95
pubmed: 7971429
Health Qual Life Outcomes. 2016 May 28;14:82
pubmed: 27233819
J Am Med Inform Assoc. 2020 Jun 1;27(6):929-933
pubmed: 32374378