For object-based audio an appropriate definition of metadata is needed to ensure flexible playback in any reproduction scenario and to allow for interactivity. Important use-cases for object-based audio and audio interactivity are described and metadata requirements are derived. A metadata scheme is defined that allows for enhanced audio rendering techniques such as content-dependent processing, automatic scene scaling and enhanced level control. Also, a metadata preprocessing logic is proposed that prepares rendering and playout and allows for user interaction with the audio content of an object-based scene. In addition, the paper points out how the metadata can be transported efficiently in a bitstream. The proposed metadata scheme has been adopted and integrated into the currently finalized MPEG-H 3D Audio standard.
https://www.aes.org/e-lib/browse.cfm?elib=17420
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!