Effect of Noise Reduction Gain Errors on Simulated Cochlear Implant Speech Intelligibility

Abigail A Kressner, Tobias May, Torsten Dau, Abigail A Kressner, Tobias May, Torsten Dau

Abstract

It has been suggested that the most important factor for obtaining high speech intelligibility in noise with cochlear implant (CI) recipients is to preserve the low-frequency amplitude modulations of speech across time and frequency by, for example, minimizing the amount of noise in the gaps between speech segments. In contrast, it has also been argued that the transient parts of the speech signal, such as speech onsets, provide the most important information for speech intelligibility. The present study investigated the relative impact of these two factors on the potential benefit of noise reduction for CI recipients by systematically introducing noise estimation errors within speech segments, speech gaps, and the transitions between them. The introduction of these noise estimation errors directly induces errors in the noise reduction gains within each of these regions. Speech intelligibility in both stationary and modulated noise was then measured using a CI simulation tested on normal-hearing listeners. The results suggest that minimizing noise in the speech gaps can improve intelligibility, at least in modulated noise. However, significantly larger improvements were obtained when both the noise in the gaps was minimized and the speech transients were preserved. These results imply that the ability to identify the boundaries between speech segments and speech gaps may be one of the most important factors for a noise reduction algorithm because knowing the boundaries makes it possible to minimize the noise in the gaps as well as enhance the low-frequency amplitude modulations of the speech.

Keywords: cochlear implant; noise reduction; sound coding; speech intelligibility.

Figures

Figure 1.
Figure 1.
(a) Electrodogram showing stimulation levels above threshold for a clean sentence. Speech segments, transitions, and gaps are identified by the white, light gray, and dark gray shading, respectively. (b to j) Electrodograms showing unthresholded levels for the same sentence mixed with speech-shaped noise at 0 dB and then denoised using the indicated gain matrix, where G^gG^tG^s indicates the use of nonideal gains in the gap, transition, and speech regions, respectively; G^gG^tGs indicates the use of nonideal gains in the gap and transition regions but ideal gains in the speech regions, and so forth.
Figure 2.
Figure 2.
Individual SRTs for listeners who heard (a) stationary noise and (b) modulated noise. The condition labels along the abscissa are defined in the text, as well as in the caption of Figure 1. SRTs = speech reception thresholds; UN = unprocessed noisy speech.
Figure 3.
Figure 3.
(a) SRT and (b) SRT improvements relative to the reference condition (UN). Means are marked with stars. Boxplots show the 25th, 50th, and 75th percentiles, together with whiskers that extend to cover all data points not considered outliers. Outliers are marked with circles. SRT improvements that were not significantly different from one another (α = .05) are grouped via colored, horizontal lines at the bottom of the plot. The condition labels along the abscissa are defined in the text, as well as in the caption of Figure 1. SRTs = speech reception thresholds; UN = unprocessed noisy speech.

References

    1. Bates D., Mächler M., Bolker B., Walker S. (2015) Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1): 1–48.
    1. Bernstein J. G., Grant K. W. (2009) Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners. The Journal of the Acoustical Society of America 125(5): 3358–3372.
    1. Bhattacharya A., Vandali A., Zeng F.-G. (2011) Combined spectral and temporal enhancement to improve cochlear-implant speech perception. The Journal of the Acoustical Society of America 130(5): 2951–2960.
    1. Champoux F., Lepore F., Gagné J.-P., Théoret H. (2009) Visual stimuli can impair auditory processing in cochlear implant users. Neuropsychologia 47(1): 17–22.
    1. Cohen I. (2003) Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging. IEEE Speech Audio Process 11(5): 466–475.
    1. Dorman M. F., Liss J., Wang S., Berisha V., Ludwig C., Natale S. C. (2016) Experiments on auditory-visual perception of sentences by users of unilateral, bimodal, and bilateral cochlear implants. Journal of Speech, Language, and Hearing Research 59(6): 1505–1519.
    1. Duquesnoy A. (1983) Effect of a single interfering noise or speech source upon the binaural sentence intelligibility of aged persons. The Journal of the Acoustical Society of America 74(3): 739–743.
    1. Festen J. M., Plomp R. (1990) Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. The Journal of the Acoustical Society of America 88(4): 1725–1736.
    1. Fu Q.-J., Nogaki G. (2005) Noise susceptibility of cochlear implant users: The role of spectral resolution and smearing. Journal of the Association for Research in Otolaryngology 6(1): 19–27.
    1. Geurts L., Wouters J. (1999) Enhancing the speech envelope of continuous interleaved sampling processors for cochlear implants. The Journal of the Acoustical Society of America 105(4): 2476–2484.
    1. Goehring T., Bolner F., Monaghan J. J., van Dijk B., Zarowski A., Bleeck S. (2017) Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users. Hearing Research 344: 183–194.
    1. Hersbach A. A., Grayden D. B., Fallon J. B., McDermott H. J. (2013) A beamformer post-filter for cochlear implant noise reduction. The Journal of the Acoustical Society of America 133(4): 2412–2420.
    1. Hochberg I., Boothroyd A., Weiss M., Hellman S. (1992) Effects of noise and noise suppression on speech perception by cochlear implant users. Ear and Hearing 13(4): 263–271.
    1. Holden L. K., Vandali A. E., Skinner M. W., Fourakis M. S., Holden T. A. (2005) Speech recognition with the advanced combination encoder and transient emphasis spectral maxima strategies in nucleus 24 recipients. Journal of Speech, Language, and Hearing Research 48(3): 681–701.
    1. Holube I., Fredelake S., Vlaming M., Kollmeier B. (2010) Development and analysis of an international speech test signal (ISTS). International Journal of Audiology 49(12): 891–903.
    1. Hu Y., Loizou P. C. (2007) A comparative intelligibility study of single-microphone noise reduction algorithms. The Journal of the Acoustical Society of America 122(3): 1777–1786.
    1. Koning R., Wouters J. (2012) The potential of onset enhancement for increased speech intelligibility in auditory prostheses. The Journal of the Acoustical Society of America 132(4): 2569–2581.
    1. Koning R., Wouters J. (2016) Speech onset enhancement improves intelligibility in adverse listening conditions for cochlear implant users. Hearing Research 342: 13–22.
    1. Kressner A. A., May T., Høegh R. M. T., Juhl K. A., Bentsen T., Dau T. (2017) Investigating the effects of noise-estimation errors in simulated cochlear implant speech intelligibility. Proceedings of the International Symposium on Auditory and Audiological Research 6: 295–302.
    1. Kuznetsova A., Brockhoff P., Christensen R. (2017) LmerTest package: Tests in linear mixed effects models. Journal of Statistical Software 82(13): 1–26.
    1. Kuznetsova A., Christensen R. H., Bavay C., Brockhoff P. B. (2015) Automated mixed ANOVA modeling of sensory and consumer data. Food Quality and Preference 40: 31–38.
    1. Lenth, R. (2018). Emmeans: Estimated marginal means, aka least-squares means. Retrieved from .
    1. Mauger S. J., Arora K., Dawson P. W. (2012) Cochlear implant optimized noise reduction. Journal of Neural Engineering 9(6): 1–9.
    1. Mauger S. J., Warren C. D., Knight M. R., Goorevich M., Nel E. (2014) Clinical evaluation of the Nucleus 6 cochlear implant system: Performance improvements with SmartSound iQ. International Journal of Audiology 53(8): 564–576.
    1. Nelson P. B., Jin S.-H., Carney A. E., Nelson D. A. (2003) Understanding speech in modulated interference: Cochlear implant users and normal-hearing listeners. The Journal of the Acoustical Society of America 113(2): 961–968.
    1. Nielsen J. B., Dau T. (2011) The Danish hearing in noise test. International Journal of Audiology 50(3): 202–208.
    1. Qazi O. U., van Dijk B., Moonen M., Wouters J. (2013) Understanding the effect of noise on electrical stimulation sequences in cochlear implants and its impact on speech intelligibility. Hearing Research 299: 79–87.
    1. Qin M. K., Oxenham A. J. (2003) Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers. The Journal of the Acoustical Society of America 114(1): 446–454.
    1. Schorr E. A., Fox N. A., van Wassenhove V., Knudsen E. I. (2005) Auditory-visual fusion in speech perception in children with cochlear implants. Proceedings of the National Academy of Sciences of the United States of America 102(51): 18748–18750.
    1. Searle S. R., Speed F. M., Milliken G. A. (1980) Population marginal means in the linear model: An alternative to least squares means. The American Statistician 34(4): 216–221.
    1. Spriet A., Van Deun L., Eftaxiadis K., Laneau J., Moonen M., Van Dijk B., Wouters J. (2007) Speech understanding in background noise with the two-microphone adaptive beamformer beam™ in the nucleus freedom™ cochlear implant system. Ear and Hearing 28(1): 62–72.
    1. Vandali A. E. (2001) Emphasis of short-duration acoustic speech cues for cochlear implant users. The Journal of the Acoustical Society of America 109(5): 2049–2061.
    1. Yund E. W., Woods D. L. (2010) Content and procedural learning in repeated sentence tests of speech perception. Ear and Hearing 31(6): 769–778.

Source: PubMed

3
Suscribir