Perceptual Optimization of Audio Representations Based on Time-Frequency Masking Data for Maximally-Compact Stimuli

Necciari, Thibaud; Balazs, Peter; Kronland-Martinet, Richard; Ystad, Sølvi; Laback, Bernhard; Savel, Sophie; Meunier, Sabine

AES E-Library

Perceptual Optimization of Audio Representations Based on Time-Frequency Masking Data for Maximally-Compact Stimuli

Many audio applications use time-frequency representations such as the Gabor and wavelet transforms. For such applications, it is often required that the signal representation matches human auditory perception and allows reconstruction. On that purpose, this paper presents the results of psychoacoustical experiment on auditory time-frequency masking using stimuli with maximal concentration in the time-frequency plane. These new data constitute a crucial basis for the prediction of auditory masking in the time-frequency representations of sound signals. An algorithm that removes the inaudible components in the wavelet transform of a sound while causing no audible difference to the original sound after re-synthesis is proposed. Preliminary results are promising, although further development is required.

Authors: Necciari, Thibaud; Balazs, Peter; Kronland-Martinet, Richard; Ystad, Sølvi; Laback, Bernhard; Savel, Sophie; Meunier, Sabine
Affiliations: Acoustics Research Institute, Austrian Academy of Sciences, Vienna, Austria; Laboratoire de Mécanique et d'Acoustique, Marseille, France(See document for exact affiliation information.)
AES Conference: 45th International Conference: Applications of Time-Frequency Processing in Audio (March 2012)
Paper Number: 3-1
Publication Date: March 1, 2012 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=16173

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 45thPapers) /conf/45/aes45-000006.pdf

Start a discussion about this paper!

AES E-Library

Perceptual Optimization of Audio Representations Based on Time-Frequency Masking Data for Maximally-Compact Stimuli

ABOUT AES

Contact Us