Spatial reproduction of the voices of conference participants can greatly enhance the performance of a life-size videoconferencing system in terms of qualities such as speech intelligibility, speaker identification and more generally the naturalness of a conference. A very suitable technique to implement accurate spatial sound reproduction including depth is Wave Field Synthesis (WFS). This paper presents results of research that has been carried out to investigate the combination of WFS with 2D video projection, including subjective experiments on sound localization, correspondence of perceived auditory and visual source directions and speaker identification in situations with multiple speakers, as well as speech intelligibility tests and investigations on the applicability of Distributed Mode Loudspeakers in WFS.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.