AES Journal

Journal of the AES

2014 December - Volume 62 Number 12


President’s Message

Author: Andres Mayo

Page: 819

Download: PDF (34KB)

Editors’ Note

Author: Bozena Kostek

Page: 820

Download: PDF (26KB)

Papers

MPEG-H Audio—The New Standard for Universal Spatial/3D Audio Coding

Authors:
Affiliation:
Page:

In order to facilitate high-quality bitrate-efficient distribution and flexible reproduction of 3D sound, the MPEG standardization group is engaged in the development of MPEG-H Audio Coding. This proposed standard will allow for universal carriage of encoded 3D sound from channel-based, object-based, and HOA-based sound formats. Reproduction is supported for many output setups ranging from 22.2 and beyond down to 5.1, stereo, and binaural reproduction. Depending on the available reproduction configuration, the encoded material is rendered to yield the highest spatial audio quality, overcoming the incompatibility between various 3D reproduction formats. Moreover, MPEG-H Audio is a unified system for carriage of channel-oriented, object-oriented, and Higher Order Ambisonics-based content. This paper describes the current status of the standardization project and provides an overview of the system architecture, technology, capabilities, and current performance.

  Download: PDF (HIGH Res) (505KB)

  Download: PDF (LOW Res) (265KB)

  Be the first to discuss this paper

Spatial Audio Quality Perception (Part 1): Impact of Commonly Encountered Processes

Open Access

Open
Access

Authors:
Affiliation:
Page:

Spatial audio processes (SAPs) commonly encountered in consumer audio reproduction systems are known to produce a range of impairments to spatial quality. By way of two listening tests, this paper investigated the degree of degradation of the spatial quality of six 5-channel audio recordings resulting from 48 such SAPs. Perceived degradation also depends on the particular listeners, the program content, and the listening location. For example, combining off-center listener with another SAP can reduce spatial quality significantly when compared to listening to that SAP from a central location. The choice of the SAP can have a large influence on the degree of degradation. Taken together these findings and the quality-annotated database can guide the development of a regression model of perceived overall spatial audio quality, incorporating previously developed spatially-relevant feature-extraction algorithms. The results can guide the development of an artificial-listener-based evaluation system.

  Download: PDF (HIGH Res) (1.9MB)

  Download: PDF (LOW Res) (457KB)

  Be the first to discuss this paper

Spatial Audio Quality Perception (Part 2): A Linear Regression Model

Open Access

Open
Access

Authors:
Affiliation:
Page:

The QESTRAL (Quality Evaluation of Spatial Transmission and Reproduction using an Artificial Listener) system is intended to be an artificial-listener-based evaluation system capable of predicting the perceived spatial quality degradations resulting from SAPs (Spatial Audio Processes) commonly encountered in consumer audio reproduction. A generalizable model employing just five metrics and two principal components performs well in its prediction of the quality over a range of program types. Commonly-encountered SAPs can have a large deleterious effect on several spatial attributes including source location, envelopment, coverage angle, ensemble width, and spaciousness. They can also impact timbre, and changes to timbre can then influence spatial perception. Previously obtained data was used to build a regression model of perceived spatial audio quality in terms of spatial and timbral metrics. In conjunction with two simple probe signals, the resulting model can form the core of an evaluation system.

  Download: PDF (HIGH Res) (2.4MB)

  Download: PDF (LOW Res) (334KB)

  Be the first to discuss this paper

Adaptive Optimization of Interchannel Coherence with Stereo and Surround Audio Content

Authors:
Affiliation:
Page:

Interchannel coherence is one of the key features of stereo and multichannel audio that contribute to the perception of width. Certain production techniques, most commonly those using linear coincident microphones, are characterized by excessively high interchannel coherence. Addressing this issue, the authors propose a blind spatial sound enhancement technique that adjusts the coherence in frequency bands while minimizing the change of timbre. The adaptive processing uses regularized least squares optimized mixing, decorrelation, and bypassed onsets. Results of listening experiments show significant improvements by the processing to the attributes width, preference, and overall sound quality without observing adverse effects.

  Download: PDF (HIGH Res) (459KB)

  Download: PDF (LOW Res) (291KB)

  Be the first to discuss this paper

Effect of Vertical Microphone Layer Spacing for a 3D Microphone Array

Open Access

Open
Access

Authors:
Affiliation:
Page:

Subjective listening tests were conducted to investigate how the spacing between main (lower) and height (upper) microphone layers in a 3D main microphone array affects perceived spatial impression and overall preference. It was generally found that layer spacing of 0.5 m, 1 m, and 1.5 m did not produce significant differences in either perceived spatial impression or preference. The 0-m layer had slightly higher ratings than the spaced layers in both spatial impression and preference, depending on the type of source. The four configurations were compared with trumpet, acoustic guitar, percussion quartet, and string quartet using a 9-channel loudspeaker setup. It is suggested that the perceived results were mainly associated with vertical interchannel crosstalk in the signals of each height layer and the magnitude and pattern of spectral change at the listener’s ear caused by each layer. Informal comments suggested that the main preference attributes were tonal quality, as well as spatial quality.

  Download: PDF (HIGH Res) (5.1MB)

  Download: PDF (LOW Res) (713KB)

  Be the first to discuss this paper

Standards and Information Documents

AES Standards Committee News

Page: 885

Download: PDF (55KB)

Features

Call for Awards Nominations

Page: 886

Download: PDF (35KB)

137th Convention Report, Los Angeles

Page: 888

Download: PDF (2.3MB)

57th Conference Preview, Hollywood

Page: 904

Download: PDF (498KB)

Loudness Revisited

Author:
Page:

It is a feature of modern life that things are loud, but action can be taken to limit the effects of loudness competition in audio-related services. During the 137th Convention, experts from around the world gathered to debate what is being done to control excessive loudness and deal with problems of dialog intelligibility in consumer systems and the cinema.

  Download: PDF (244KB)

  Be the first to discuss this feature

58th Conference, Aalborg, Call for Papers

Page: 916

Download: PDF (70KB)

59th Conference, Montreal, Call for Papers

Page: 917

Download: PDF (60KB)

139th Convention, New York, Call for Papers

Page: 918

Download: PDF (70KB)

Call for Nominations for Board of Governors

Page: 919

Download: PDF (27KB)

Departments

Index to Volume 62

Page: 924

Download: PDF (131KB)

Section News

Page: 911

Download: PDF (339KB)

Products and Developments

Page: 914

Download: PDF (110KB)

137th Convention Audio mp3 order form

Page: 920

Download: PDF (759KB)

Annual Report

Page: 922

Download: PDF (23KB)

AES Conventions and Conferences

Page: 940

Download: PDF (79KB)

Extras

Table of Contents

Download: PDF (57KB)

Cover & Sustaining Members List

Download: PDF (41KB)

AES Officers, Committees, Offices & Journal Staff

Download: PDF (59KB)

AES - Audio Engineering Society