Quantifying the Effect of Room Response on Automatic Speech Recognition Systems
It has been demonstrated that the acoustic environment has an impact on timbre and speech intelligibility. Automatic speech recognition is an established area that suffers from the negative effects of mismatch between different room impulse responses (RIR.) To better understand the changes imparted by the RIR, we have created synthetic responses to simulate utterances recorded in different locations. Using speech recognition techniques to quantify our results, we then looked for trends in performance to connect with impulse response changes.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.