Synthesis of Spatially Extended Virtual Source with Time-Frequency Decomposition of Mono Signals

Pihlajamäki, Tapani; Santala, Olli; Pulkki, Ville

AES E-Library

Synthesis of Spatially Extended Virtual Source with Time-Frequency Decomposition of Mono Signals

Auditory displays, driven by nonauditory data, are often used to present a sound scene to a listener. Typically, the sound field places sound objects at different locations, but the scene becomes aurally richer if the perceived sonic objects have a spatial extent (size), called volumetric virtual coding. Previous research in virtual-world Directional Audio Coding has shown that spatial extent can be synthesized from monophonic sources by applying a time-frequency-space decomposition, i.e., randomly distributing time-frequency bins of the source signal. This technique does not guarantee a stable size and the timbre can degrade. This study explores how to optimize volumetric coding in terms of timbral and spatial perception. The suggested approach for most types of audio uses an STFT window size of 1024 samples and then distributes the frequency bands from lowest to highest using the Halton sequence. The results from two formal listening experiments are presented.

Open
Access

Authors: Pihlajamäki, Tapani; Santala, Olli; Pulkki, Ville
Affiliation: Aalto University, Department of Signal Processing and Acoustics, Helsinki, Finland
JAES Volume 62 Issue 7/8 pp. 467-484; July 2014
Publication Date: August 22, 2014 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17339

Download Now (680 KB)

This paper is Open Access which means you can download it for free.

Learn more about the AES E-Library

E-Library Location: (CD JAES62) /jaes62/7/pg467.pdf

DOI: https://doi.org/10.17743/jaes.2014.0031

Start a discussion about this paper!

AES E-Library

Synthesis of Spatially Extended Virtual Source with Time-Frequency Decomposition of Mono Signals

ABOUT AES

Contact Us