Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery.

convolutional neural networks deep learning laryngeal carcinoma substitution voicing voice analysis

Journal

Cancers
ISSN: 2072-6694
Titre abrégé: Cancers (Basel)
Pays: Switzerland
ID NLM: 101526829

Informations de publication

Date de publication:
11 May 2022
Historique:
received: 11 04 2022
revised: 03 05 2022
accepted: 04 05 2022
entrez: 28 5 2022
pubmed: 29 5 2022
medline: 29 5 2022
Statut: epublish

Résumé

Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient's inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known convolutional neural networks (CNNs) applied for image classification for the analysis of voice audio signal. Our approach takes an input of Mel-frequency spectrogram (MFCC) as an input of deep neural network architecture. A database of digital speech recordings of 367 male subjects (279 normal speech samples and 88 pathological speech samples) was used. Our approach has shown the best true-positive rate of any of the compared state-of-the-art approaches, achieving an overall accuracy of 89.47%.

Identifiants

pubmed: 35625971
pii: cancers14102366
doi: 10.3390/cancers14102366
pmc: PMC9139213
pii:
doi:

Types de publication

Journal Article

Langues

eng

Références

Int J Oral Maxillofac Surg. 2021 May;50(5):585-590
pubmed: 32917484
BMJ Open. 2022 Jan 17;12(1):e052518
pubmed: 35039289
J Voice. 2021 Nov;35(6):932.e1-932.e11
pubmed: 32402664
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:1-4
pubmed: 30440307
Auris Nasus Larynx. 2015 Jun;42(3):183-8
pubmed: 25440411
Bioinformatics. 2011 Jul 15;27(14):1986-94
pubmed: 21576180
BMJ Case Rep. 2021 Jul 28;14(7):
pubmed: 34321272
Logoped Phoniatr Vocol. 2011 Jul;36(2):78-89
pubmed: 21609247
J Voice. 2015 May;29(3):382-8
pubmed: 25619472
J Speech Lang Hear Res. 2018 Feb 15;61(2):298-323
pubmed: 29392295
Curr Opin Otolaryngol Head Neck Surg. 2021 Dec 1;29(6):451-457
pubmed: 34334615
Psychiatr Serv. 2020 Nov 1;71(11):1143-1150
pubmed: 32933411
Comput Biol Med. 2011 Sep;41(9):822-8
pubmed: 21777911
J Clin Oncol. 2003 Feb 1;21(3):496-505
pubmed: 12560441
JAMA Otolaryngol Head Neck Surg. 2021 Oct 1;147(10):909-911
pubmed: 34410331
Inform Health Soc Care. 2021 Mar 2;46(1):68-83
pubmed: 33251894
J Voice. 2019 Nov;33(6):947.e11-947.e33
pubmed: 30316551
Eur Arch Otorhinolaryngol. 2015 Nov;272(11):3391-9
pubmed: 26162450
Logoped Phoniatr Vocol. 2015 Apr;40(1):24-9
pubmed: 25019410
Int Arch Otorhinolaryngol. 2020 Oct;24(4):e535-e538
pubmed: 33101522
Biomed Res Int. 2021 Apr 14;2021:6635964
pubmed: 33937404
Eur Arch Otorhinolaryngol. 2021 Apr;278(4):1209-1222
pubmed: 32696251
PeerJ Comput Sci. 2021 May 26;7:e564
pubmed: 34141890
Diagnostics (Basel). 2021 Oct 14;11(10):
pubmed: 34679590
Cochrane Database Syst Rev. 2013 Jul 16;(7):CD009441
pubmed: 23857592
Clin Linguist Phon. 2009 Nov;23(11):825-41
pubmed: 19891523
Methods. 2018 Dec 1;151:41-54
pubmed: 30099083
J Voice. 2018 Jul;32(4):515.e1-515.e13
pubmed: 28739333
CA Cancer J Clin. 2017 Jan;67(1):31-50
pubmed: 27898173
J Voice. 2022 Mar;36(2):288.e15-288.e24
pubmed: 32660846
J Clin Med. 2020 Oct 25;9(11):
pubmed: 33113785
J Healthc Eng. 2018 Jul 10;2018:3846892
pubmed: 30123441
Sensors (Basel). 2017 Jan 29;17(2):
pubmed: 28146069
J Consult Clin Psychol. 2021 Dec;89(12):985-994
pubmed: 35025539
Eur Arch Otorhinolaryngol. 2012 Apr;269(4):1205-12
pubmed: 22218847
J Voice. 2019 Sep;33(5):634-641
pubmed: 29567049
J Commun Disord. 2010 May-Jun;43(3):161-74
pubmed: 20080243
Arch Acad Emerg Med. 2020 Dec 10;9(1):e7
pubmed: 33490964

Auteurs

Rytis Maskeliūnas (R)

Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania.

Audrius Kulikajevas (A)

Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania.

Robertas Damaševičius (R)

Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania.

Kipras Pribuišis (K)

Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania.

Nora Ulozaitė-Stanienė (N)

Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania.

Virgilijus Uloza (V)

Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania.

Classifications MeSH