Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language.
CTC-attention
Uzbek language
convolutional neural network
deep learning
end-to-end speech recognition
hidden Markov model
transformers
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
12 May 2022
12 May 2022
Historique:
received:
12
04
2022
revised:
01
05
2022
accepted:
10
05
2022
entrez:
28
5
2022
pubmed:
29
5
2022
medline:
1
6
2022
Statut:
epublish
Résumé
Communication has been an important aspect of human life, civilization, and globalization for thousands of years. Biometric analysis, education, security, healthcare, and smart cities are only a few examples of speech recognition applications. Most studies have mainly concentrated on English, Spanish, Japanese, or Chinese, disregarding other low-resource languages, such as Uzbek, leaving their analysis open. In this paper, we propose an End-To-End Deep Neural Network-Hidden Markov Model speech recognition model and a hybrid Connectionist Temporal Classification (CTC)-attention network for the Uzbek language and its dialects. The proposed approach reduces training time and improves speech recognition accuracy by effectively using CTC objective function in attention model training. We evaluated the linguistic and lay-native speaker performances on the Uzbek language dataset, which was collected as a part of this study. Experimental results show that the proposed model achieved a word error rate of 14.3% using 207 h of recordings as an Uzbek language training dataset.
Identifiants
pubmed: 35632092
pii: s22103683
doi: 10.3390/s22103683
pmc: PMC9147241
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : This work was supported by the GRRC program of Gyeonggi province. [GRRC-Gachon2020(B02), AI-based Medical Information Analysis]
ID : GRRC-Gachon2020(B02)
Références
PLoS One. 2015 Dec 11;10(12):e0144610
pubmed: 26656189
IEEE Rev Biomed Eng. 2021;14:342-356
pubmed: 32746367
Sensors (Basel). 2022 Jan 01;22(1):
pubmed: 35009863
Sensors (Basel). 2022 Mar 09;22(6):
pubmed: 35336276