We have evaluated objectively the comparative performance of five noise reduction algorithms. These algorithms were based on the Short-Time Spectral Amplitude (STSA) estimation, subspace projections, wavelet packets with auditory masking, and time-frequency decompositions using matching pursuits. Speech stimuli corrupted by speech-shaped noise and multi-talker babble at 5 different Signal-to-Noise Ratios (SNRs) were used to test the performance of the noise reduction algorithms. Noise reduction performance was quantified using two different methods. In the first method, the Perceptual Evaluation of Speech Quality (PESQ) measure was computed twice - once between the original and noisy speech and the other between the original and enhanced speech. The difference between these two PESQ values indicated the performance of the noise reduction algorithm. The second method was based on the "phase reversed noise" technique where the noise reduction algorithm was tested twice, once with speech + noise and then with speech + phase reversed noise. The PESQ and SNR gain measures were then computed on the combined output. The results obtained from this study indicate that the STSA based algorithm performs better in terms of the amount of noise reduction, while the wavelet packet based algorithm performs better in terms of minimizing the speech distortion introduced by the noise reduction process.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.