Localization of sound sources with cochlear implants
Localization of sound sources with cochlear implants
Disciplines
Computer Sciences (20%); Clinical Medicine (30%); Physics, Astronomy (20%); Psychology (30%)
Keywords
-
Cochlear Implant,
Hearing,
Localization,
Hrtf,
Pinna,
Learning
Electrical stimulation of the cochlear nerve by means of Cochlear Implant (CI) systems has been shown to successfully substitute basic functions of the inner ear: it enables deafened people to understand speech. Current CI systems, however, can not replace all functions of the normal auditory system. While localization of sound sources in the left/right dimension has been shown to be possible by means of CI stimulation at both ears, localization in the front/back and up/down dimensions (within the so-called sagittal planes) still remains difficult because of insufficient transmission of relevant cues by current CI systems. In normal hearing, localization in the sagittal planes is achieved by the evaluation of spectral colouring of the incoming sound induced by the external ear (pinna) at high frequencies (pinna cue or spectral cue). Current CI systems, however, do not transmit pinna cues due to the placement of the microphone behind the pinna and the restricted bandwidth of the signal analysis stage of the processor. While these technical limitations can be handled, the following basic questions remain: a) How many frequency channels are required to transmit pinna cues relevant for sound localization in the sagittal planes? b) To what extent can CI listeners learn to localize sound sources in the sagittal planes using high-frequency pinna cues that are mapped to tonotopic places with lower characteristic frequencies, as usually covered by CI electrodes? In this project these questions are studied by conducting psychoacoustic experiments with CI and normal hearing subjects, the latter being presented with acoustic simulations of the perception of a cochlear implant listener. The ultimate goal of the project is to enable CI listeners to localize sound sources in the sagittal planes. First, a series of experiments determines the effects of parameters, such as the number and tonotopic range of the stimulating electrodes, on the coding of pinna cues and the speech signal. In a second stage, the ability of both subject groups to learn localization in the sagittal planes is studied using pinna cues that are frequency-shifted to match the tonotopic range associated with the basal electrodes in CI listeners. The subjects learn to associate auditory stimuli with spatial positions by means of audio-visual stimulus presentation. CI listeners are expected to benefit from the coding of pinna cues with respect to localization of sound sources, in particular regarding front/back discrimination.
Current cochlear implants (CIs) allow implantees some ability to localize sounds in the left/right dimension. However, localization in the vertical planes, involving front/back and up/down localization, is more complicated because of insufficient transmission of relevant cues. In normal hearing (NH), localization in the vertical planes is achieved by the evaluation of spectral coloring of the incoming sound induced by the external ear at high frequencies. In this project we addressed several basic questions on the feasibility of providing spectral localization cues with CIs, with the ultimate aim to allow CI listeners to localize in the vertical planes. First, we developed methods to train listeners with new localization cues, involving a virtual audio-visual environment. Next, we studied the ability of bilateral CI listeners to localize in 3-D space with their clinical devices. As expected, vertical-plane localization was much worse than in NH listeners and relied solely on absolute level cues instead of spectral cues. Next, we explored the frequency channels and spectral range required for high speech intelligibility in quiet and background with a 12-channel CI. At least eight channels and frequencies up to about 3 kHz were found to be sufficient, allowing to encode spectral localization cues via the remaining four channels. Next, we addressed the question if CI listeners are able to discriminate between different spectral shapes, which is required for vertical-plane localization. All six CI listeners were sensitive to spectral peaks and notches imposed on a constant-loudness background, but the sensitivity declined dramatically when listeners were prevented from using absolute level cues. This is in contrast to NH listeners, who are sensitive even without absolute level cues. Given that NH listeners receive both spectral and temporal cues, the latter having been excluded in our CI study, we speculate on the importance of temporal cues. Next, we studied how many frequency channels are required for robust vertical-plane sound localization after an audio-visual training. NH subjects, listening to acoustic CI simulations, showed robust localization in a configuration with only four channels within the frequency range containing spectral localization cues. In the last project stage, we hypothesized that the auditory localization mechanisms can adapt to a warping of spectral localization information towards lower frequencies. This is important because in electric hearing the upper limit of the stimulation range is considerably lower than in acoustic hearing. Fifteen NH listeners completed a localization training experiment, involving two-hour daily training over a period of three weeks. After an initial strong increase of the localization error relative to the baseline performance without frequency-warping, the subjects` localization performance improved steadily, although not reaching the baseline performance at the end of the training period. The combination of results from the different project stages shows that it may be possible to enable vertical-plane localization in recipients of future CI systems by properly encoding spectral localization cues. While our results indicate that directional features can be mapped to the limited range of spectral channels available in electric hearing, one major challenge towards this goal will be to improve the CI listeners` spectral shape discriminate ability, possibly by means of including temporal cues.
Research Output
- 403 Citations
- 11 Publications
-
2009
Title Effects of center frequency and rate on the sensitivity to interaural delay in high-frequency click trains DOI 10.1121/1.3120413 Type Journal Article Author Majdak P Journal The Journal of the Acoustical Society of America Pages 3903-3913 Link Publication -
2009
Title Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range DOI 10.1121/1.3177264 Type Journal Article Author Kreuzer W Journal The Journal of the Acoustical Society of America Pages 1280-1290 Link Publication -
2009
Title Enhancing sensitivity to interaural time differences at high modulation rates by introducing temporal jitter DOI 10.1121/1.3206584 Type Journal Article Author Goupell M Journal The Journal of the Acoustical Society of America Pages 2511-2521 Link Publication -
2008
Title Effects of upper-frequency boundary and spectral warping on speech intelligibility in electrical stimulation DOI 10.1121/1.2831738 Type Journal Article Author Goupell M Journal The Journal of the Acoustical Society of America Pages 2295-2309 Link Publication -
2008
Title Current-level discrimination and spectral profile analysis in multi-channel electrical stimulation DOI 10.1121/1.2981638 Type Journal Article Author Goupell M Journal The Journal of the Acoustical Society of America Pages 3142-3157 Link Publication -
2011
Title Two-Dimensional Localization of Virtual Sound Sources in Cochlear-Implant Listeners DOI 10.1097/aud.0b013e3181f4dfe9 Type Journal Article Author Majdak P Journal Ear and Hearing Pages 198-208 Link Publication -
2010
Title 3-D localization of virtual sound sources: Effects of visual environment, pointing method, and training DOI 10.3758/app.72.2.454 Type Journal Article Author Majdak P Journal Attention, Perception, & Psychophysics Pages 454-469 Link Publication -
2010
Title Median-plane sound localization as a function of the number of spectral channels using a channel vocoder DOI 10.1121/1.3283014 Type Journal Article Author Goupell M Journal The Journal of the Acoustical Society of America Pages 990-1001 Link Publication -
2010
Title Interaural fluctuations and the detection of interaural incoherence. IV. The effect of compression on stimulus statistics DOI 10.1121/1.3505104 Type Journal Article Author Goupell M Journal The Journal of the Acoustical Society of America Pages 3691-3702 Link Publication -
2013
Title Effect of long-term training on sound localization performance with spectrally warped and band-limited head-related transfer functions DOI 10.1121/1.4816543 Type Journal Article Author Majdak P Journal The Journal of the Acoustical Society of America Pages 2148-2159 -
2010
Title Binaurally-Coherent Jitter Improves Neural and Perceptual ITD Sensitivity in Normal and Electric Hearing DOI 10.1007/978-1-4419-5686-6_29 Type Book Chapter Author Goupell M Publisher Springer Nature Pages 303-313