Cochlea-inspired speech recognition interface.


Journal

Medical & biological engineering & computing
ISSN: 1741-0444
Titre abrégé: Med Biol Eng Comput
Pays: United States
ID NLM: 7704869

Informations de publication

Date de publication:
Jun 2019
Historique:
received: 04 10 2018
accepted: 14 02 2019
pubmed: 5 3 2019
medline: 16 11 2019
entrez: 5 3 2019
Statut: ppublish

Résumé

Automatic speech recognition (ASR) technology provides a natural interface for human-machine interaction. Typical ASR systems can achieve high performance in quiet environments but, unlike humans, perform poorly in real-world situations. To better simulate the human auditory periphery and improve the performance in realistic noisy scenarios, we propose two models of speech recognition front-ends based on a biophysical cochlear model. The first front-end is based on the method of signal reconstruction from a basilar membrane response. When applied to noisy speech, this method results in improved signal quality. This method can be used as a preprocessing step in a standard ASR system and can also be used as a noise reduction technique for other applications. The second front-end we propose is based on the construction of speech recognition coefficients directly from a basilar membrane response. Experimental results using a continuous-density hidden Markov model (HMM) recognizer demonstrate significant improvement in performance compared to standard Mel-frequency cepstral coefficients (MFCC) in various types of noisy conditions. Graphical Abstract Speech recognition model based on cochlear front-end.

Identifiants

pubmed: 30830542
doi: 10.1007/s11517-019-01963-6
pii: 10.1007/s11517-019-01963-6
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1393-1403

Subventions

Organisme : Hrvatska Zaklada za Znanost
ID : UIP-2014-09-3875

Références

J Acoust Soc Am. 1999 Oct;106(4 Pt 1):2040-50
pubmed: 10530027
Physiol Rev. 2001 Jul;81(3):1305-52
pubmed: 11427697
J Assoc Res Otolaryngol. 2003 Dec;4(4):478-94
pubmed: 14716508
Hear Res. 2006 May;215(1-2):84-96
pubmed: 16678986
J Acoust Soc Am. 2011 May;129(5):EL204-9
pubmed: 21568376
J Acoust Soc Am. 1990 Apr;87(4):1738-52
pubmed: 2341679
Med Biol Eng Comput. 2016 Jun;54(6):915-26
pubmed: 26753778
J Acoust Soc Am. 1993 Jun;93(6):3320-32
pubmed: 8326060
J Acoust Soc Am. 1996 Apr;99(4 Pt 1):2244-55
pubmed: 8730071

Auteurs

Mladen Russo (M)

Laboratory for Smart Environment Technologies, FESB - University of Split, Split, Croatia. mrusso@fesb.hr.

Maja Stella (M)

Laboratory for Smart Environment Technologies, FESB - University of Split, Split, Croatia.

Marjan Sikora (M)

Laboratory for Smart Environment Technologies, FESB - University of Split, Split, Croatia.

Matko Šarić (M)

Laboratory for Smart Environment Technologies, FESB - University of Split, Split, Croatia.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH