Decomposing an arbitrary audio signal into direct and diffuse components is useful for applications such as spatial audio coding, spatial format conversion, binaural rendering, and spatial audio enhancement. This paper describes direct-diffuse decomposition methods for multichannel signals using a linear system of pairwise correlation estimates. The expected value of a correlation coefficient is analytically derived from a signal model with known direct and diffuse energy levels. It is shown that a linear system can be constructed from pairwise correlation coefficients to derive estimates of the Direct Energy Fraction (DEF) for each channel of a multichannel signal. Two direct-diffuse decomposition methods are described that utilize the DEF estimates within a time-frequency analysis-synthesis framework.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.