Wav2DDK: Analytical and Clinical Validation of an Automated Diadochokinetic Rate Estimation Algorithm on Remotely Collected Speech.


Journal

Journal of speech, language, and hearing research : JSLHR
ISSN: 1558-9102
Titre abrégé: J Speech Lang Hear Res
Pays: United States
ID NLM: 9705610

Informations de publication

Date de publication:
17 08 2023
Historique:
pmc-release: 01 02 2024
medline: 18 8 2023
pubmed: 9 8 2023
entrez: 9 8 2023
Statut: ppublish

Résumé

Oral diadochokinesis is a useful task in assessment of speech motor function in the context of neurological disease. Remote collection of speech tasks provides a convenient alternative to in-clinic visits, but scoring these assessments can be a laborious process for clinicians. This work describes Wav2DDK, an automated algorithm for estimating the diadochokinetic (DDK) rate on remotely collected audio from healthy participants and participants with amyotrophic lateral sclerosis (ALS). Wav2DDK was developed using a corpus of 970 DDK assessments from healthy and ALS speakers where ground truth DDK rates were provided manually by trained annotators. The clinical utility of the algorithm was demonstrated on a corpus of 7,919 assessments collected longitudinally from 26 healthy controls and 82 ALS speakers. Corpora were collected via the participants' own mobile device, and instructions for speech elicitation were provided via a mobile app. DDK rate was estimated by parsing the character transcript from a deep neural network transformer acoustic model trained on healthy and ALS speech. Algorithm estimated DDK rates are highly accurate, achieving .98 correlation with manual annotation, and an average error of only 0.071 syllables per second. The rate exactly matched ground truth for 83% of files and was within 0.5 syllables per second for 95% of files. Estimated rates achieve a high test-retest reliability ( We demonstrate a system for automated DDK estimation that increases efficiency of calculation beyond manual annotation. Thorough analytical and clinical validation demonstrates that the algorithm is not only highly accurate, but also provides a convenient, clinically relevant metric for tracking longitudinal decline in ALS, serving to promote participation and diversity of participants in clinical research. https://doi.org/10.23641/asha.23787033.

Identifiants

pubmed: 37556308
doi: 10.1044/2023_JSLHR-22-00282
pmc: PMC10555468
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

3166-3181

Subventions

Organisme : NIDCD NIH HHS
ID : R01 DC006859
Pays : United States
Organisme : NIDCD NIH HHS
ID : R21 DC019475
Pays : United States

Références

Int J Lang Commun Disord. 2017 May;52(3):301-310
pubmed: 27432555
NPJ Digit Med. 2021 Oct 28;4(1):153
pubmed: 34711924
J Speech Hear Res. 1972 Dec;15(4):763-70
pubmed: 4652397
J Neurolinguistics. 2007 Jan;20(1):50-64
pubmed: 21253440
Behav Neurol. 2015;2015:183027
pubmed: 26136624
J Psycholinguist Res. 2017 Aug;46(4):897-904
pubmed: 28025805
J Commun Disord. 2014 Mar-Apr;48:27-37
pubmed: 24630145
Folia Phoniatr Logop. 2003 Sep-Oct;55(5):241-59
pubmed: 12931058
PLoS One. 2016 May 05;11(5):e0154971
pubmed: 27148967
J Speech Lang Hear Res. 2020 Oct 16;63(10):3453-3460
pubmed: 32955982
Int J Lang Commun Disord. 2003 Oct-Dec;38(4):417-28
pubmed: 14578054
J Speech Lang Hear Res. 2018 Nov 8;61(11):2757-2771
pubmed: 30383220
J Neurol Sci. 1999 Oct 31;169(1-2):13-21
pubmed: 10540002
IEEE EMBS Int Conf Biomed Health Inform. 2019;2019:1-4
pubmed: 32864624
Asian J Psychiatr. 2015 Jun;15:51-5
pubmed: 26013669
Front Neurol. 2021 Feb 12;12:626780
pubmed: 33643204
NPJ Digit Med. 2020 Apr 14;3:55
pubmed: 32337371
Clin Linguist Phon. 2004 Jan-Feb;18(1):57-84
pubmed: 15053268
Int J Lang Commun Disord. 2022 Sep;57(5):1085-1097
pubmed: 35703470
J Speech Lang Hear Res. 2009 Oct;52(5):1334-52
pubmed: 19717656
J Speech Lang Hear Res. 2022 Mar 8;65(3):940-953
pubmed: 35171700
Lang Speech. 2018 Mar;61(1):113-134
pubmed: 28610466
J Speech Lang Hear Res. 2022 Feb 9;65(2):574-623
pubmed: 34958599
NPJ Digit Med. 2020 Oct 13;3:132
pubmed: 33083567
J Speech Lang Hear Res. 2019 Jan 22;63(1):59-73
pubmed: 31940257
IEEE Trans Neural Syst Rehabil Eng. 2020 Jan;28(1):32-41
pubmed: 31545738
Amyotroph Lateral Scler Frontotemporal Degener. 2019 Feb;20(1-2):61-67
pubmed: 30486680
J Acoust Soc Am. 2020 Feb;147(2):839
pubmed: 32113309

Auteurs

Prad Kadambi (P)

School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe.
Aural Analytics Inc., Tempe, AZ.

Gabriela M Stegmann (GM)

Aural Analytics Inc., Tempe, AZ.

Julie Liss (J)

School of Speech and Hearing Science, Arizona State University, Tempe.
Aural Analytics Inc., Tempe, AZ.

Visar Berisha (V)

School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe.
School of Speech and Hearing Science, Arizona State University, Tempe.
Aural Analytics Inc., Tempe, AZ.

Shira Hahn (S)

Aural Analytics Inc., Tempe, AZ.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH