AES E-Library

AES E-Library

Temporal Segmentation and Pre-analysis for Non-linear Time-scaling of Audio.

Document Thumbnail

We present a method to achieve good segmentation of note events for use with non-linear time scaling algorithms, greatly reducing artefacts due to both rhythmic distortions and soft note transitions being treated as percussive transients. The proposed algorithm isolates percussive transients as a subset of note-onsets, leading to a more meaningful segmentation. A subband based hybrid onset detection algorithm forms the basis of this segmentation scheme. A new frequency content distance measure, automatic threshold setting and subband result validation are all key elements of this scheme. At the subband re-combining stage the algorithm differentiates between note onsets, which may appear in one or more subbands, from percussive transients that appear in multiple subbands.

Authors:
Affiliation:
AES Convention: Paper Number:
Publication Date:
Subject:
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=12493

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


AES - Audio Engineering Society