Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language.

CTC-attention Uzbek language convolutional neural network deep learning end-to-end speech recognition hidden Markov model transformers

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
12 May 2022
Historique:
received: 12 04 2022
revised: 01 05 2022
accepted: 10 05 2022
entrez: 28 5 2022
pubmed: 29 5 2022
medline: 1 6 2022
Statut: epublish

Résumé

Communication has been an important aspect of human life, civilization, and globalization for thousands of years. Biometric analysis, education, security, healthcare, and smart cities are only a few examples of speech recognition applications. Most studies have mainly concentrated on English, Spanish, Japanese, or Chinese, disregarding other low-resource languages, such as Uzbek, leaving their analysis open. In this paper, we propose an End-To-End Deep Neural Network-Hidden Markov Model speech recognition model and a hybrid Connectionist Temporal Classification (CTC)-attention network for the Uzbek language and its dialects. The proposed approach reduces training time and improves speech recognition accuracy by effectively using CTC objective function in attention model training. We evaluated the linguistic and lay-native speaker performances on the Uzbek language dataset, which was collected as a part of this study. Experimental results show that the proposed model achieved a word error rate of 14.3% using 207 h of recordings as an Uzbek language training dataset.

Identifiants

pubmed: 35632092
pii: s22103683
doi: 10.3390/s22103683
pmc: PMC9147241
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : This work was supported by the GRRC program of Gyeonggi province. [GRRC-Gachon2020(B02), AI-based Medical Information Analysis]
ID : GRRC-Gachon2020(B02)

Références

PLoS One. 2015 Dec 11;10(12):e0144610
pubmed: 26656189
IEEE Rev Biomed Eng. 2021;14:342-356
pubmed: 32746367
Sensors (Basel). 2022 Jan 01;22(1):
pubmed: 35009863
Sensors (Basel). 2022 Mar 09;22(6):
pubmed: 35336276

Auteurs

Abdinabi Mukhamadiyev (A)

Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 13120, Korea.

Ilyos Khujayarov (I)

Department of Information Technologies, Samarkand Branch of Tashkent University of Information Technologies Named after Muhammad al-Khwarizmi, Tashkent 140100, Uzbekistan.

Oybek Djuraev (O)

Department of Hardware and Software of Control Systems in Telecommunication, Tashkent University of Information Technologies Named after Muhammad al-Khwarizmi, Tashkent 100084, Uzbekistan.

Jinsoo Cho (J)

Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 13120, Korea.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH