The reproduction of speech over loudspeakers in a reverberant environment is often encountered in daily life, as for example, in a train station or during a telephone conference. Spatial reverberation degrades intelligibility. This study proposes two perceptually motivated preprocessing approaches that are applied to the dry speech before being played into a reverberant environment. In the first algorithm, which assumes prior knowledge of the room impulse response, the amount of overlap-masking due to successive phonemes is reduced. In the second algorithm, emphasizing onsets is combined with overlap-masking. A speech intelligibility model is used to find the best parameters for these algorithms by minimizing the predicted speech reception thresholds. Listening tests show that this preprocessing method can indeed improve speech intelligibility in reverberant environments. In listening tests, Speech Reception Thresholds improved up to 6 dB.
https://www.aes.org/e-lib/browse.cfm?elib=18541
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!