Localizing concurrent sound sources with binaural microphones: A simulation study.

HRTF ILD ITD Reverberations Robotics Sound localization Sparseness

Journal

Hearing research
ISSN: 1878-5891
Titre abrégé: Hear Res
Pays: Netherlands
ID NLM: 7900445

Informations de publication

Date de publication:
11 2023
Historique:
received: 30 05 2023
revised: 04 09 2023
accepted: 09 09 2023
medline: 6 11 2023
pubmed: 26 9 2023
entrez: 25 9 2023
Statut: ppublish

Résumé

The human auditory system can localize multiple sound sources using time, intensity, and frequency cues in the sound received by the two ears. Being able to spatially segregate the sources helps perception in a challenging condition when multiple sounds coexist. This study used model simulations to explore an algorithm for localizing multiple sources in azimuth with binaural (i.e., two) microphones. The algorithm relies on the "sparseness" property of daily signals in the time-frequency domain, and sound coming from different locations carrying unique spatial features will form clusters. Based on an interaural normalization procedure, the model generated spiral patterns for sound sources in the frontal hemifield. The model itself was created using broadband noise for better accuracy, because speech typically has sporadic energy at high frequencies. The model at an arbitrary frequency can be used to predict locations of speech and music that occurred alone or concurrently, and a classification algorithm was applied to measure the localization error. Under anechoic conditions, averaged errors in azimuth increased from 4.5° to 19° with RMS errors ranging from 6.4° to 26.7° as model frequency increased from 300 to 3000 Hz. The low-frequency model performance using short speech sound was notably better than the generalized cross-correlation model. Two types of room reverberations were then introduced to simulate difficult listening conditions. Model performance under reverberation was more resilient at low frequencies than at high frequencies. Overall, our study presented a spiral model for rapidly predicting horizontal locations of concurrent sound that is suitable for real-world scenarios.

Identifiants

pubmed: 37748242
pii: S0378-5955(23)00196-X
doi: 10.1016/j.heares.2023.108884
pii:
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

108884

Informations de copyright

Copyright © 2023. Published by Elsevier B.V.

Auteurs

Jakeh Orr (J)

School of Science and Engineering, Saint Louis University, Saint Louis, MO 63105, USA. Electronic address: Jakeh.orr@slu.edu.

William Ebel (W)

School of Science and Engineering, Saint Louis University, Saint Louis, MO 63105, USA.

Yan Gai (Y)

School of Science and Engineering, Saint Louis University, Saint Louis, MO 63105, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH