AES Store

Journal Forum

Perceptual Effects of Dynamic Range Compression in Popular Music Recordings - January 2014
4 comments

Accurate Calculation of Radiation and Diffraction from Loudspeaker Enclosures at Low Frequency - June 2013
9 comments

New Measurement Techniques for Portable Listening Devices: Technical Report - October 2013
1 comment

Access Journal Forum

AES E-Library

Scalable, Content-Based Audio Identification by Multiple Independent Psychoacoustic Matching

A software system for content-based identification of audio recordings is presented. The system transforms its input using a perceptual model of the human auditory system, making its output robust to lossy compression and to other distortions. In order to make use of both the instantaneous pattern of a recording's perceptual features and the information contained in the evolution of these features over time, the system first matches fragments of the input against a database of fragments of known recordings. In a subsequent step, these matches at the fragment level are assembled in order to identify a single recording that matches consistently over time. In a small-scale test the system has matched all queries successfully against a database of 100 000 commercially released recordings.

Authors:
Affiliations:
JAES Volume 52 Issue 4 pp. 366-377; April 2004
Publication Date:

Click to purchase paper or login as an AES member. If your company or school subscribes to the AES Journal then you can look for this paper in the institutional version of the Online Journal. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


 
Facebook   Twitter   LinkedIn   Google+   YouTube   RSS News Feeds  
AES - Audio Engineering Society