Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

Speech understanding in noisy environments is still one of the major challenges for cochlear implant (CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks (NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy...

Full description

Saved in:

Bibliographic Details
Published in	Hearing research Vol. 344; pp. 183 - 194
Main Authors	Goehring, Tobias, Bolner, Federico, Monaghan, Jessica J.M., van Dijk, Bas, Zarowski, Andrzej, Bleeck, Stefan
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.02.2017 Elsevier/North-Holland Biomedical Press
Subjects	Acoustic Stimulation Acoustics Adult Aged Aged, 80 and over Algorithms Audiometry, Speech Cochlear Implantation - instrumentation Cochlear Implants Comprehension Electric Stimulation Humans Machine learning Middle Aged Neural networks Neural Networks (Computer) Noise - adverse effects Noise reduction Perceptual Masking Persons With Hearing Impairments - psychology Persons With Hearing Impairments - rehabilitation Prosthesis Design Research Paper Signal Processing, Computer-Assisted Sound Spectrography Speech enhancement Speech Intelligibility Speech Perception Young Adult Speech enhancement Cochlear implants Noise reduction Neural networks Machine learning
Online Access	Get full text
ISSN	0378-5955 1878-5891 1878-5891
DOI	10.1016/j.heares.2016.11.012

Cover

More Information
Summary:	Speech understanding in noisy environments is still one of the major challenges for cochlear implant (CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks (NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise-dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n-of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker-independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices. •An algorithm for improving speech understanding in noise for cochlear implant users is evaluated.•Significant improvements were found for stationary and non-stationary noise types.•It generalizes to a novel speaker and works over a range of signal-to-noise ratios.•The small algorithmic delay makes it suitable for real-time application.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 These authors contributed equally to this work.
ISSN:	0378-5955 1878-5891 1878-5891
DOI:	10.1016/j.heares.2016.11.012