AES Journal

Journal of the AES

2019 September - Volume 67 Number 9

Papers

Nonlinear Distortion Reduction in Sound Zones by Constraining Individual Loudspeaker Control Effort

Authors:Ma, Xiaohui; Hegarty, Patrick J.; Jørgensen, Kristoffer F.; Larsen, Jakob Juul
Affiliation:Dynaudio A/S, Skanderborg, Denmark;Dynaudio A/S, Skanderborg, Denmark;Dynaudio A/S, Skanderborg, Denmark;Department of Engineering, Aarhus University, Denmark
Page:641

A personal sound zone system renders different audio contents to multiple listening groups within the same physical space. Such zones are called bright zones. Personal sound zone systems provide concurrent, interference-free listening experiences to multiple listeners using loudspeaker arrays. Nonlinear distortion in loudspeaker drivers can cause audible artifacts, and the acoustic contrast can be degraded especially at high driving levels. The distortion can be reduced by constraining the total control effort, but artifacts can still be present due to one or several loudspeaker drivers having high control effort. To reduce nonlinear distortion the researcher applied individual control effort constrained acoustic contrast control (ICECACC), where control effort constraints are imposed for each individual loudspeaker driver. Simulations and experiments were performed on a two-sound-zone setup, with one bright and one dark zone using individually control effort constrained acoustic contrast control (ICECACC) or acoustic contrast control (ACC) and a two-tone stimulus generating both harmonic and intermodulation distortion. Frequency resolved measurements show that ICECACC and ACC give nearly identical acoustic contrast at the two fundamental frequencies, but ICECACC has less nonlinear distortion than ACC. Experiments using a multitone stimulus and identical total control efforts also gave reduced nonlinear distortion with ICECACC over ACC, however this was achieved at the expense of contrast. The results show that a compromise can be made between acoustic contrast and nonlinear distortion.

Download: PDF (HIGH Res) (5.0MB)

Download: PDF (LOW Res) (1.2MB)

Be the first to discuss this paper

Directional Loudness of Low-Frequency Noises Actually Presented Over Loudspeakers And Virtually Presented Over Headphones

Authors:Berthomieu, Gauthier; Koehl, Vincent; Paquier, Mathieu
Affiliation:Univ Brest, Lab-STICC, CNRS, Brest, France;Univ Brest, Lab-STICC, CNRS, Brest, France;Univ Brest, Lab-STICC, CNRS, Brest, France
Page:655

The direction of a sound source in relation to the listener significantly affects the loudness of the sounds it produces, especially in the horizontal plane, where interaural time difference (ITD) is the main localization cue. There is growing awareness of this phenomenon of directional loudness sensitivity (DLS); this has to be taken into account for audio reproduction systems, especially for multichannel. This effect has only been studied for sounds generated and presented directly over headphones, which are not natural listening conditions. The present study aims at investigating this effect on low-frequency noises originating from real sources. Twenty subjects assessed the loudness of stimuli that were presented by both loudspeakers arranged at various locations within a listening room and by a recording with a dummy head and then virtually reproduced through headphones. Results show that the directional loudness sensitivity (DLS) is in agreement with the previously revealed ITD effect. Moreover, the DLS was higher when stimuli were reproduced over headphones than over loudspeakers, specifically when frontal sources were located at a short distance from the listeners. One hypothesis for this effect relies on visual cues that were available to the listeners only when sounds were reproduced over loudspeakers, providing information about the source distance. Listeners were also aware that sounds were reproduced on loudspeaker or headphones, possibly involving different loudness assessments, leading to DLS differences.

Download: PDF (HIGH Res) (2.9MB)

Download: PDF (LOW Res) (496KB)

Be the first to discuss this paper

Generalized Metrics for Constant Directivity

Open
Access

Authors:Sridhar, Rahulram; Tylka, Joseph G.; Choueiri, Edgar Y.
Affiliation:3D Audio and Applied Acoustics Laboratory, Princeton University, Princeton, NJ, USA;3D Audio and Applied Acoustics Laboratory, Princeton University, Princeton, NJ, USA;3D Audio and Applied Acoustics Laboratory, Princeton University, Princeton, NJ, USA
Page:666

Many applications in audio benefit from transducer arrays whose directional characteristics do not vary with frequency, as for example sound reinforcement and selective microphone beams. The coverage angle should be constant over a usable frequency range. Metrics are proposed for quantifying the extent to which a transducer’s polar radiation (or sensitivity) pattern is invariant with frequency. As there is currently no established measure of this quality (often called “controlled” or “constant directivity”), this paper proposes five metrics, each based on commonly-used criteria for constant directivity: 1) a Fourier analysis of sensitivity contour lines (i.e., lines of constant sensitivity over frequency and angle), 2) the average of spectral distortions within a specified angular listening window, 3) the solid angle of the frontal region with distortions below a specified threshold, 4) the standard deviation of the directivity index, and 5) cross-correlations of polar responses. These metrics are computed for ten loudspeakers, which are ranked from most constant-directive to least, according to each metric. The resulting values and rankings are compared, and the suitability of each metric for comparing transducers in different applications is assessed. For critical listening applications in reflective or dynamic listening environments, metric 1 appears most suitable, while for such applications in acoustically-treated and static environments, metrics 2 and 3 may be preferable. Furthermore, for high-amplitude applications (e.g., live sound) in reflective or noisy environments, metrics 4 and 5 appear most suitable.

Download: PDF (HIGH Res) (2.2MB)

Download: PDF (LOW Res) (438KB)

Be the first to discuss this paper

Engineering Reports

A New Decoder for CD-4 (Quadradisc) Phonograph Records

Author:Brice, Richard
Affiliation:Pspatial Audio, France
Page:679

CD-4 (or Compatible Discrete 4 Channel) was a short-lived, four-channel, surround-sound system for phonograph records. Developed by JVC in Japan, the system was adopted in America around 1972 by RCA where it was known as RCA Quadradisc. Unlike matrix quadraphonic systems, CD-4 took a more radical approach. The baseband signals, which modulate the groove, are the sum of the front and back signals (LF + LB) and (RF + RB). The difference signals, used to separate back from front in the decoder, are FM encoded on a pair of ultrasonic (30kHz) subcarriers recorded above this baseband signal. The development of a new, software-based decoder for CD-4 phonograph records is described in this report. A relatively complete understanding of the original hardware decoders is necessary, and this analysis is new. A special phono cartridge with an extended frequency-response up to 45 kHz is required, and this must be fitted with a Shibata or line-contact stylus to track the high-frequency subcarrier modulation. In addition, wide bandwidth preamplifiers, correct cable types, and low crosstalk are all required to recover subcarrier signals of sufficient quality and amplitude so that successful decoding is possible. A different approach to the output matrix is described based on Ambisonics theory, which increases the reliability of successfully decoding worn and damaged CD-4 media.

Download: PDF (HIGH Res) (8.3MB)

Download: PDF (LOW Res) (612KB)

Discuss this report (3 comments)

The Auditory Source Widening Effect in Binaural Synthesis with Spatial Distribution of Frequency Bands

Authors:Su, Hengwei; Marui, Atsushi; Kamekawa, Toru
Affiliation:Tokyo University of the Arts, Tokyo, Japan;Tokyo University of the Arts, Tokyo, Japan;Tokyo University of the Arts, Tokyo, Japan
Page:691

A binaural technique (involving direct control of signals transferred into both ears of listeners), not only can solve the problem of spatial impression of headphone reproduction but also has the ability to provide realistic auditory experiences, especially in 3D spatial acoustic reproduction. In this study, monophonic source signals were processed by frequency-band decomposition and distribution to achieve spatially widened perceived source widths in binaural synthesis. Stimuli with different widths were synthesized, and the perceived widths were evaluated by conducting a listening experiment to investigate the relationship of the perceived width and the synthesized width. Three different bandwidths of frequency bands and two center positions of synthesized widths were used in the processing, and the relevant effects on perception of source width were investigated. The results of the listening experiment suggested that under proper processing conditions the perceived width could increase with increasing synthesized widths. However, dependencies of source signal characteristics and variations between participants were observed. Degradations of timbre and spatial quality were also evaluated. The results suggested that this method suffered less degradation than a conventional decorrelation method while it achieved comparable widening effects for binaural reproduction. For example, for a cello source signal with 1/12-octave bandwidth, the perceived width increased with increasing synthesis width. This suggests that under appropriate conditions this method could control the perceived width of a monophonic source in binaural synthesis.

Download: PDF (HIGH Res) (1.8MB)

Download: PDF (LOW Res) (381KB)

Be the first to discuss this report

A Cross-Evaluated Database of Measured and Simulated HRTFs Including 3D Head Meshes, Anthropometric Features, and Headphone Impulse Responses

Authors:Brinkmann, Fabian; Dinakaran, Manoj; Pelzer, Robert; Grosche, Peter; Voss, Daniel; Weinzierl, Stefan
Affiliation:Audio Communication Group, Technical University of Berlin, Germany;Audio Communication Group, Technical University of Berlin, Germany;Audio Communication Group, Technical University of Berlin, Germany;Huawei Technologies, Munich Research Centre, Munich, Germany;Sennheiser electronic GmbH & Co. KG, Wedemark, Germany;Audio Communication Group, Technical University of Berlin, Germany
Page:705

The individualization of head related transfer functions (HRTFs) can make an important contribution to improving the quality of binaural technology applications. One approach to individualization is to exploit the relationship between the shape of HRTFs and the anthropometric features of the ears, head, and torso of the corresponding listeners. To identify statistically significant relationships between the two sets of variables, a relatively large database is required. For this purpose full-spherical HRTFs of 96 subjects were acoustically measured and numerically simulated. A detailed cross-evaluation showed a good agreement to previous data between repeated measurements and between measured and simulated data. In addition to 96 HRTFs, the database includes high-resolution head-meshes, a list of 25 anthropometric features per subject, and headphone transfer functions for two headphone models.

Download: PDF (HIGH Res) (9.0MB)

Download: PDF (LOW Res) (888KB)

Be the first to discuss this report

Letters To The Editor

Time for Slow Listening

Authors:Lund, Thomas; Mäkivirta, Aki; Naghian, Siamäk
Affiliation:Genelec OY, Iisalmi, Finland;Genelec OY, Iisalmi, Finland;Genelec OY, Iisalmi, Finland
Page:636

Conscious perception is influenced by long-term experience and learning, to an extent that it might be more accurately understood and studied as primarily a reach-out phenomenon, at least in adults. Considering human hearing, time is a deciding factor on several scales, and the sensory information flow rate, otherwise termed the perceptual bandwidth, is modest. We introduce the term “slow listening” and discuss how new findings from other fields of science should be taken into account in pro audio, for instance when conducting subjective tests, and when preserving content for future generations to enjoy.

Download: PDF (152KB)

Be the first to discuss this letter

Standards and Information Documents

AES Standards Committee News

Page: 720

Download: PDF (46KB)

Features

Audio Education Conference, Murfreesboro and Nashville, Call for Contributions

Page: 721

Download: PDF (140KB)

Audio for Virtual and Augmented Reality Conference, Redmond, Call for Contributions

Page: 721

Download: PDF (140KB)

Audio Forensics Conference Report, Porto

Page: 722

Download: PDF (1.4MB)

JAES Special Issue on Semantic Music Production, Call for Papers

Page: 732

Download: PDF (100KB)

2020 AES Academy, Anaheim, Call for Contributions

Page: 733

Download: PDF (258KB)

With-Height Spatial Audio: Choices and Change

Author:Rumsey, Francis
Page:734

When working on “with-height” spatial audio, the pressure to abandon specific channel-based production formats will grow, given that the number of possible reproduction systems is increasingly large. The responsibility then lies increasingly on the playback rendering system to do a good job of delivering a convincing impression of the original intention on whatever reproduction system it is presented with. Preserving or delivering authentic or plausible spatial characteristics of both the direct and diffuse elements of a scene becomes the target.

Download: PDF (407KB)

Be the first to discuss this feature

148th Convention, Vienna, Call for Contributions

Page: 747

Download: PDF (115KB)

Departments

Section News

Page: 739

Download: PDF (233KB)

Book Reviews

Page: 743

Download: PDF (212KB)

AES Conventions and Conferences

Page: 748

Download: PDF (112KB)

Navigation

Journal of the AES

2019 September - Volume 67 Number 9

Papers

Nonlinear Distortion Reduction in Sound Zones by Constraining Individual Loudspeaker Control Effort

Directional Loudness of Low-Frequency Noises Actually Presented Over Loudspeakers And Virtually Presented Over Headphones

Generalized Metrics for Constant Directivity

Engineering Reports

A New Decoder for CD-4 (Quadradisc) Phonograph Records

The Auditory Source Widening Effect in Binaural Synthesis with Spatial Distribution of Frequency Bands

A Cross-Evaluated Database of Measured and Simulated HRTFs Including 3D Head Meshes, Anthropometric Features, and Headphone Impulse Responses

Letters To The Editor

Time for Slow Listening

Standards and Information Documents

AES Standards Committee News

Features

Audio Education Conference, Murfreesboro and Nashville, Call for Contributions

Audio for Virtual and Augmented Reality Conference, Redmond, Call for Contributions

Audio Forensics Conference Report, Porto

JAES Special Issue on Semantic Music Production, Call for Papers

2020 AES Academy, Anaheim, Call for Contributions

With-Height Spatial Audio: Choices and Change

148th Convention, Vienna, Call for Contributions

Departments

Section News

Book Reviews

AES Conventions and Conferences

Extras

Table of Contents

Cover & Sustaining Members List

AES Officers, Committees, Offices & Journal Staff

ABOUT AES

Contact Us