A Non-linear Rhythm-Based Style Classifciation for Broadcast Speech-Music Discrimination

Batlle, Eloi and Guaus, Enric

AES E-Library

A Non-linear Rhythm-Based Style Classifciation for Broadcast Speech-Music Discrimination

Speech-Music discriminators are usually designed under some rigid constrains. This paper deals with a more general Speech-Music Discriminator successfully used in AIDA project. The system is based on a Hidden Markov Model style classification process in which the styles are grouped into two major categories: Speech or Music. The goals of this sub-system are (1)the expandible possibilities with the addition of some new styles (like "phone female voice"), (2)the use of new rhytmical descriptors in combination with other typical ones and (3)the robustness of our speech/music discriminator in many different environments by using some Mathematical Morphology and non-linear post-processing techniques. The techniques used in our system allow a fast track in changes between styles and, thus, typical confusions in commercials can be easily cleaned. The accuracy of this system can be up to a 94.3% in broadcast radio environment.

Author (s): Batlle, Eloi; Guaus, Enric;
Affiliation: Audiovisual Institute, Pompeu Fabra University, Barcelona, Spain (See document for exact affiliation information.)
AES Convention: 116 Paper Number:6013
Publication Date: 2004-05-06
Session subject: Audio Archiving, Storage, and Restoration: Content Management

DOI:

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type: Convention Paper

AES Conventions

AES Conferences

AES Training & Development

AES Inside Track

Journal of the AES

AES E-library

Special Publications

AES Sections are active around the world and provide a means for members to meet locally.

AES Student Website

AES Educational Foundation

Student Sections

See the committee’s accomplishments in diversity & inclusion

AES Statement of solidarity

Richard C. Heyser Memorial Lecture Series

AES E-Library

A Non-linear Rhythm-Based Style Classifciation for Broadcast Speech-Music Discrimination

Choose your country of residence from this list:

AES E-Library

Login Institutions

A Non-linear Rhythm-Based Style Classifciation for Broadcast Speech-Music Discrimination

Choose your country of residence from this list: