A method of automatic recognition of regular voice, raised voice and scream used for audio surveillance system is presented. The algorithm for detection of voice activity in a noisy environment is discussed. Signal features used for sound classification, based on energy, spectral shape and tonality are introduced. Sound feature vectors are processed by a fuzzy classifier. The method is employed in an audio surveillance system working in real-time both in an indoor and outdoor environment. Achieved results of classifying real sound signals are presented and discussed.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.