In this paper we describe an efficient scheme for compression and flexible spatial rendering of audio signals. The method is based on Binaural Cue Coding (BCC) which was recently introduced for efficient compression of multi-channel audio signals. The encoder input consists of separate signals without directional spatial cues, such as separate sound source signals, i.e. several monophonic signals. The signal transmitted to the decoder consists of the mono sum-signal of all input signals plus a low bit rate (e.g. 2 kb/s) set of BCC parameters. The mono signal can be encoded with any conventional audio or speech coder. Using the BCC parameters and the mono signal, the BCC synthesizer can flexibly render a spatial image by determining the perceived direction of the audio content of each of the encoder input signals. We provide the results of an audio quality assessment using headphones, which is a more critical scenario than loudspeaker playback.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.