The research interest in technologies for supporting people in their own homes is constantly increasing. In this context this paper proposes a speech-interfaced system for recognizing home automation commands and distress calls. The robustness of the system is increased by employing Power Normalized Cepstral Coefficients as features and by using an adaptive algorithm to reduce known sources of interference. In addition, the mismatch introduced by vocal effort variability is reduced employing a vocal effort classifier and multiple acoustic models. The performance has been evaluated on ITAAL, a recently proposed corpus of home automation commands and distress calls in Italian. The results confirm that the adopted solutions are effective to be employed in a distorted acoustic scenario.
https://www.aes.org/e-lib/browse.cfm?elib=17235
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!