This paper proposes a Gaussian mixture model (GMM)-based music discrimination system for mobile broadcasting receivers. The objective of the system is automatically archiving music signals from audio broadcasting programs that are normally mixed with human voices, acoustic noises, commercial advertisements, and so on. To enhance the robustness of the system performance and to sharply cut the starting/ending-point of the recording, we also introduce a post-processing module whose features consist of signal duration, energy dynamics, and local variation of feature statistics. Experimental results to various input signals verify the superiority of the proposed system.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.