AES E-Library

AES E-Library

A Practical Approach to Robust Speech Recognition Using Two Microphones in Driving Environments

Document Thumbnail

Now that the technologies related to the automatic speech recognition have been mature enough and applicable to our everyday life, people have started considering speech as the most desirable human-device interaction means and utilized speech recognition in vehicles. Nonetheless, it is still challenging to recognize speech correctly in driving environments for at least two reasons. One is that the speech signal is corrupted by innumerable noise sources such as the engine sound, road friction, music from the radio, even worse the mixture of spoken words by passengers, etc. Another is that the recognition device may be put at any place like cup holder, passenger seat or dashboard. In this paper we propose a robust speech recognition front-end that removes the probable ambient noise in a driving car regardless of where the recognition device is. The proposed method finds the direction of speech and enhances the speech signal by first detecting the existence of speech utterance using only two microphones. This front-end is designed with practical consideration so that its implementation in the mobile device showed higher recognition accuracy, shorter processing latency and lower computing power consumption than any other top-tier methods.

Authors:
Affiliation:
AES Convention: Paper Number:
Publication Date:
Subject:
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17514

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


AES - Audio Engineering Society