1 Department of Electronic Systems, The Faculty of Engineering and Science (ENG), Aalborg University, VBN2 Multimedia Information and Signal Processing, The Faculty of Engineering and Science (ENG), Aalborg University, VBN3 The Faculty of Engineering and Science (TECH), Aalborg University, VBN4 Royal Institute of Technology5 Signal and Information Processing, The Faculty of Engineering and Science (ENG), Aalborg University, VBN6 Royal Institute of Technology
In this letter the focus is on linear filtering of speech before degradation due to additive background noise. The goal is to design the filter such that the speech intelligibility index (SII) is maximized when the speech is played back in a known noisy environment. Moreover, a power constraint is taken into account to prevent uncomfortable playback levels and deal with loudspeaker constraints. Previous methods use linear approximations of the SII in order to find a closed-form solution. However, as we show, these linear approximations introduce errors in low SNR regions and are therefore suboptimal. In this work we propose a nonlinear approximation of the SII which is accurate for all SNRs. Experiments show large intelligibility improvements with the proposed method over the unprocessed noisy speech and better performance than one state-of-the art method.
I E E E Signal Processing Letters, 2013, Vol 20, Issue 3, p. 225-228