Speech signal enhancement in cocktail party scenarios by deep learning based virtual sensing of head-mounted microphones.
Artificial intelligence
Beamformer
Cochlear implant
Hearing aid
Neural network
Selective hearing
Journal
Hearing research
ISSN: 1878-5891
Titre abrégé: Hear Res
Pays: Netherlands
ID NLM: 7900445
Informations de publication
Date de publication:
01 09 2021
01 09 2021
Historique:
received:
08
02
2021
revised:
31
05
2021
accepted:
07
06
2021
pubmed:
29
6
2021
medline:
4
2
2022
entrez:
28
6
2021
Statut:
ppublish
Résumé
The cocktail party effect refers to the human sense of hearing's ability to pay attention to a single conversation while filtering out all other background noise. To mimic this human hearing ability for people with hearing loss, scientists integrate beamforming algorithms into the signal processing path of hearing aids or implants' audio processors. Although these algorithms' performance strongly depends on the number and spatial arrangement of the microphones, most devices are equipped with a small number of microphones mounted close to each other on the audio processor housing. We measured and evaluated the impact of the number and spatial arrangement of hearing aid or head-mounted microphones on the performance of the established Minimum Variance Distortionless Response beamformer in cocktail party scenarios. The measurements revealed that the optimal microphone placement exploits monaural cues (pinna-effect), is close to the target signal, and creates a large distance spread due to its spatial arrangement. However, this microphone placement is impractical for hearing aid or implant users, as it includes microphone positions such as on the forehead. To overcome microphones' placement at impractical positions, we propose a deep virtual sensing estimation of the corresponding audio signals. The results of objective measures and a subjective listening test with 20 participants showed that the virtually sensed microphone signals significantly improved the speech quality, especially in cocktail party scenarios with low signal-to-noise ratios. Subjective speech quality was assessed using a 3-alternative forced choice procedure to determine which of the presented speech mixtures was most pleasant to understand. Hearing aid and cochlear implant (CI) users might benefit from the presented approach using virtually sensed microphone signals, especially in noisy environments.
Identifiants
pubmed: 34182232
pii: S0378-5955(21)00128-3
doi: 10.1016/j.heares.2021.108294
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
108294Informations de copyright
Copyright © 2021 The Author(s). Published by Elsevier B.V. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.