AES E-Library

AES E-Library

Spatial Audio Compression with Adaptive Singular Value Decomposition Using Reconstructed Frames

Document Thumbnail

MPEG-H 3D Audio is the current standard for the compression of higher-order ambisonics data. It uses singular value decomposition (SVD) to spatially decorrelate higher-order ambisonics data, followed by the modified discrete cosine transform to exploit temporal decorrelation. Prominent and ambient sound components are then separately encoded (e.g., using the standard core audio codec) and sent to the decoder. Significant improvements in bitrate and audio quality have been gained in earlier work over MPEG-H by applying the SVD operation in the frequency domain rather than the ambisonics domain. In this work, we provide additional compression gains by adaptively calculating and extending the set of SVD basis vectors, at negligible increase in side information cost, using information attained from the previously reconstructed frame. Objective and subjective results provide evidence for higher compression gains when compared to existing methods.

Authors:
Affiliation:
AES Conference:
Paper Number:
Publication Date:
Subject:
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=21858

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


AES - Audio Engineering Society