AES E-Library

AES E-Library

An Investigation of Temporal Feature Integration for a Low-Latency Classification with Application to Speech/Music/Mix Classification

Document Thumbnail

In this paper we propose several methodologies for the use of feature integration and evaluate them in a low-latency classification framework. These general methodologies are based on three key aspects that will be assessed in this study: the selection of the features that have to be temporally integrated, the choice of the integration techniques, i.e., how the temporal information is extracted, and the size of the integration window. The experiments carried out for the speech/music/mix classification task show that the different methodologies have a significant impact on the global performance. Compared to the state of the art procedures, the methodologies we proposed achieved the best performance, even with the low-latency constraints.

Authors:
Affiliations:
AES Convention: Paper Number:
Publication Date:
Subject:
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17503

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


AES - Audio Engineering Society