A real-time circuit is describes which automatically discriminates between speech and music signals. An output of the circuit gives, by means of a fuzzy feature combiner, an estimate of the probability that the input is speech. The discriminator is tested (for both sexes) for various languages, such as English, Danish, Dutch, French, German, and Japanese, against various types of music, such as pop, opera, romantic, baroque, and various solo musical instruments. The discriminator is found to be extremely reliable, the false alarm probability (inferring speech while the input is music) being virtually zero.
https://www.aes.org/e-lib/browse.cfm?elib=12093
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!