Journal Forum

Sound Board: High-Resolution Audio - October 2015

Synchronized Swept-Sine: Theory, Application, and Implementation - October 2015

Effect of Microphone Number and Positioning on the Average of Frequency Responses in Cinema Calibration - October 2015
1 comment

Access Journal Forum

AES E-Library

Tonic-Independent Stroke Transcription of the Mridangam

Document Thumbnail

In this paper, we use a data-driven approach for the tonic-independent transcription of strokes of the mridangam, a South Indian hand drum. We obtain feature vectors that encode tonic-invariance by computing the magnitude spectrum of the constant-Q transform of the audio signal. Then we use Non-negative Matrix Factorization (NMF) to obtain a low-dimensional feature space where mridangam strokes are separable. We make the resulting feature sequence event-synchronous using short-term statistics of feature vectors between onsets, before classifying into a predefined set of stroke labels using Support Vector Machines (SVM). The proposed approach is both more accurate and flexible compared to that of tonic-specific approaches.

AES Conference:
Paper Number:
Publication Date:

Click to purchase paper or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!

Facebook   Twitter   LinkedIn   Google+   YouTube   RSS News Feeds  
AES - Audio Engineering Society