Cochlea-inspired speech recognition interface.
Biophysical cochlear model
Noise robustness
Speech recognition interface
Journal
Medical & biological engineering & computing
ISSN: 1741-0444
Titre abrégé: Med Biol Eng Comput
Pays: United States
ID NLM: 7704869
Informations de publication
Date de publication:
Jun 2019
Jun 2019
Historique:
received:
04
10
2018
accepted:
14
02
2019
pubmed:
5
3
2019
medline:
16
11
2019
entrez:
5
3
2019
Statut:
ppublish
Résumé
Automatic speech recognition (ASR) technology provides a natural interface for human-machine interaction. Typical ASR systems can achieve high performance in quiet environments but, unlike humans, perform poorly in real-world situations. To better simulate the human auditory periphery and improve the performance in realistic noisy scenarios, we propose two models of speech recognition front-ends based on a biophysical cochlear model. The first front-end is based on the method of signal reconstruction from a basilar membrane response. When applied to noisy speech, this method results in improved signal quality. This method can be used as a preprocessing step in a standard ASR system and can also be used as a noise reduction technique for other applications. The second front-end we propose is based on the construction of speech recognition coefficients directly from a basilar membrane response. Experimental results using a continuous-density hidden Markov model (HMM) recognizer demonstrate significant improvement in performance compared to standard Mel-frequency cepstral coefficients (MFCC) in various types of noisy conditions. Graphical Abstract Speech recognition model based on cochlear front-end.
Identifiants
pubmed: 30830542
doi: 10.1007/s11517-019-01963-6
pii: 10.1007/s11517-019-01963-6
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1393-1403Subventions
Organisme : Hrvatska Zaklada za Znanost
ID : UIP-2014-09-3875
Références
J Acoust Soc Am. 1999 Oct;106(4 Pt 1):2040-50
pubmed: 10530027
Physiol Rev. 2001 Jul;81(3):1305-52
pubmed: 11427697
J Assoc Res Otolaryngol. 2003 Dec;4(4):478-94
pubmed: 14716508
Hear Res. 2006 May;215(1-2):84-96
pubmed: 16678986
J Acoust Soc Am. 2011 May;129(5):EL204-9
pubmed: 21568376
J Acoust Soc Am. 1990 Apr;87(4):1738-52
pubmed: 2341679
Med Biol Eng Comput. 2016 Jun;54(6):915-26
pubmed: 26753778
J Acoust Soc Am. 1993 Jun;93(6):3320-32
pubmed: 8326060
J Acoust Soc Am. 1996 Apr;99(4 Pt 1):2244-55
pubmed: 8730071