Fischer, Tim; Caversaccio, Marco; Wimmer, Wilhelm (2021). Speech Signal Enhancement in Cocktail Party Scenarios by Deep Learning based Virtual Sensing of Head-Mounted Microphones. Hearing research, 408, p. 108294. Elsevier 10.1016/j.heares.2021.108294
|
Text
1-s2.0-S0378595521001283-main.pdf - Accepted Version Available under License Creative Commons: Attribution-Noncommercial-No Derivative Works (CC-BY-NC-ND). Download (1MB) | Preview |
|
|
Text
1-s2.0-S0378595521001283-main.pdf - Published Version Available under License Creative Commons: Attribution-Noncommercial-No Derivative Works (CC-BY-NC-ND). Download (1MB) | Preview |
The cocktail party effect refers to the human sense of hearing’s ability to pay attention to a single conversation while filtering out all other background noise. To mimic this human hearing ability for people with hearing loss, scientists integrate beamforming algorithms into the signal processing path of hearing aids or implants’ audio processors. Although these algorithms’ performance strongly depends on the number and spatial arrangement of the microphones, most devices are equipped with a small number of microphones mounted close to each other on the audio processor housing. We measured and evaluated the impact of the number and spatial arrangement of hearing aid or head-mounted microphones on the performance of the established Minimum Variance Distortionless Response beamformer in cocktail party scenarios. The measurements revealed that the optimal microphone placement exploits monaural cues (pinna-effect), is close to the target signal, and creates a large distance spread due to its spatial arrangement. However, this microphone placement is impractical for hearing aid or implant users, as it includes microphone positions such as on the forehead. To overcome microphones’ placement at impractical positions, we propose a deep virtual sensing estimation of the corresponding audio signals. The results of objective measures and a subjective listening test with 20 participants showed that the virtually sensed microphone signals significantly improved the speech quality, especially in cocktail party scenarios with low signal-to-noise ratios. Subjective speech quality was assessed using a 3-alternative forced choice procedure to determine which of the presented speech mixtures was most pleasant to understand. Hearing aid and cochlear implant (CI) users might benefit from the presented approach using virtually sensed microphone signals, especially in noisy environments.
Item Type: |
Journal Article (Original Article) |
---|---|
Division/Institute: |
04 Faculty of Medicine > Department of Head Organs and Neurology (DKNS) > Clinic of Ear, Nose and Throat Disorders (ENT) 10 Strategic Research Centers > ARTORG Center for Biomedical Engineering Research > ARTORG Center - Hearing Research Laboratory |
Graduate School: |
Graduate School for Cellular and Biomedical Sciences (GCB) |
UniBE Contributor: |
Fischer, Tim Alois, Caversaccio, Marco, Wimmer, Wilhelm |
Subjects: |
600 Technology > 610 Medicine & health 500 Science > 570 Life sciences; biology |
ISSN: |
0378-5955 |
Publisher: |
Elsevier |
Language: |
English |
Submitter: |
Wilhelm Wimmer |
Date Deposited: |
28 Jun 2021 16:03 |
Last Modified: |
05 Dec 2022 15:51 |
Publisher DOI: |
10.1016/j.heares.2021.108294 |
BORIS DOI: |
10.48350/157003 |
URI: |
https://boris.unibe.ch/id/eprint/157003 |