AES Journal

Journal of the Audio Engineering Society

AES Journal

The Journal of the Audio Engineering Society — the official publication of the AES — is the only peer-reviewed journal devoted exclusively to audio technology. Published 10 times each year, it is available to all AES members and subscribers.

The Journal contains state-of-the-art technical papers and engineering reports; feature articles covering timely topics; pre and post reports of AES conventions and other society activities; news from AES sections around the world; Standards and Education Committee work; membership news, patents, new products, and newsworthy developments in the field of audio.

Calls for Papers for Special Issues of the Journal

Call for Papers for JAES Special Issue on Object-Based Audio

 


Current Issue: 2018 October - Volume 66 Number 10


Letters To The Editor

Comments on “Bandpass Subwoofer Design”

Authors:
Affiliation:
Page:

COMMENTS ON “BANDPASS SUBWOOFER DESIGN” Referring to the above paper, we would like to make some comments on the design parameters of the doubly vented loudspeaker as presented by the author in his Eqs. (30) – (40). We would like to point out on some mistakes in the original paper, although the parameters presented in his Table 3 are correct.

  Download: PDF (58KB)

  Be the first to discuss this letter

Papers

Single-Ended Speech Quality Prediction Based on Automatic Speech Recognition

Authors:
Affiliation:
Page:

Quality evaluation of digitally-transmitted speech is an important prerequisite to ensure the required quality of telecommunication service. Although formal subjective listening tests still represent the gold standard, they are time-consuming and costly. A new single-ended speech quality measure is proposed that uses a deep neural network (DNN)-based automatic speech recognition system. A quality measure is used to quantify the degradation of the DNN output caused by speech distortions. The new method was evaluated using five databases containing nine subsets of data covering several conditions of narrowband and broadband speech that was degraded by speech codecs, telecommunication networks, clipping, chopped speech, echoes, competing speakers, and additional background noises. Other than the training data set, evaluation results with the remaining eight data subsets showed good average correlations with subjective speech quality ratings achieved without any task-specific training or optimizations. These average results are close to those achieved with the American National Standard ANIQUE+ and clearly better than those obtained with the ITU-T standard P.563.

  Download: PDF (HIGH Res) (769KB)

  Download: PDF (LOW Res) (486KB)

  Be the first to discuss this paper

Layered Motion and Gesture Sonification in an Interactive Installation

Authors:
Affiliation:
Page:

SoundThimble is an interactive sound installation based on the relationship between human motion and virtual objects in 3D space. A Vicon infrared motion-capture system and custom software are used to track, interpret, and sonify the movement and gestures of a performer relative to a virtual object. The authors explore the resulting possibilities for layered sonification dynamics and extended perception and expression in internal tests as well as in a public demo. Experimental evaluation reveals an average object search time of around 60 s, as well as thresholding ranges for effective gesture spotting. The underlying software platform is open source and portable to similar hardware systems, leaving room for extension and variation. This paper presents the pilot application of the proposed framework. Audience members entering the tracking area shift among the roles of game player, sonic performer, and composer/arranger, according to an iterative interaction schema. The central vehicle in all three layers is the “sound-thimble” itself, a virtual object with particular spatial, sonic, and interaction attributes.

  Download: PDF (HIGH Res) (1.3MB)

  Download: PDF (LOW Res) (282KB)

  Be the first to discuss this paper

A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds

Authors:
Affiliation:
Page:

With the ever-increasing applications for digital signal processing, there is a strong motivation to discover new processing techniques. Methods based on matrix rank minimization have been increasingly used for signal analysis, particularly for signal separation. This research considers the analysis and application of the Non-Negative Matrix Factorization (NMF), associated with Kullback-Leibler and Itakura-Saito divergences, for the separation of digital sound sources consisting of harmonic and percussive elements. The NMF algorithm and divergence functions were implemented in a MATLAB environment and applied to musical mixes composed of electric guitar, bass, kick, ride, and snare. Then, comparative analyses of the divergence functions performance used SNR-based metrics. Considering the inconsistencies between the objective metrics and the human perception, two alternative objective metrics were proposed for the Signal-Interference Ratio (SIR), called Windowed SIR (W-SIR) and Average Windowed SIR (AW-SIR). Based on the W-SIR metric, the authors present the new Recursive Semi-Supervised NMF (RSS-NMF), for which the training information is extracted from the original signal. In both cases, the results demonstrated better performance of the RSS-NMF technique in relation to the non-supervised NMF technique.

  Download: PDF (HIGH Res) (4.9MB)

  Download: PDF (LOW Res) (370KB)

  Be the first to discuss this paper

Minimally Simple Binaural Room Modeling Using a Single Feedback Delay Network

Authors:
Affiliation:
Page:

The widespread adoption of acoustic modeling in such applications as 3D gaming and virtual reality simulation are generally hindered by the complexity of the implementation. The most efficient binaural acoustic modeling systems use a multi-tap delay to generate early reflections, which are then combined with a feedback delay network to produce generic late reverberation. This report presents a method of binaural acoustic simulation that uses one feedback delay network to simultaneously model both first-order reflections and late reverberation. The advantages are simplicity and efficiency. The proposed method is compared to existing methods for modeling binaural early reflections using a multi-tap delay line. Measurements of ISO standard evaluators including interaural correlation coefficient, decay time, clarity, definition, and center time indicate that the proposed method achieves a comparable accuracy to less efficient methods. The proposed method is implemented as an iOS application, and is able to auralize input signal directly without convolution and update in real time. This significantly reduces the computational time because it does not need to produce an impulse response after every parameter update.

  Download: PDF (HIGH Res) (1.2MB)

  Download: PDF (LOW Res) (473KB)

  Be the first to discuss this paper

Physically Based Unified Modeling for a Series of Miniature Twin Triode Tubes

Authors:
Affiliation:
Page:

Modern technology, such as SPICE circuit analysis and signal processing implementation of equations, can be applied to ancient technology such as triode vacuum tubes to gain a better understanding of their properties. In this paper the authors propose a formula that can be applied uniformly to a series of miniature tubes: 12AX7, 12AU7, 12AY7, and 12AT7. This model considers the effects of the differences in physical shape and dimension on the equivalent diode characteristics in the initial velocity region. Such effects were theoretically analyzed in the beginning of the tube development in the 1920s. However, the results were never incorporated into today’s SPICE modeling. The model consists of space charge modulation formula, which enables the expression of nonlinear behavior in wide operation range of plus and minus grid bias in a circuit. Thus, the model also successfully simulates the modulation of amplification factor obtained by the differential parameters of transconductance and output resistance. Furthermore, the dispersions of the parameter-value for the different tube types are compared in detail based on the electrode shape and the physical dimension associated with various aspects of properties. From this analysis, the manufacturing stability for 12AU7 and 12AY7 is found to be better than that for 12AX7 and 12AT7 because of the difficulty of the accurate processing of high-amplification factor and high-transconductance factor, respectively.

  Download: PDF (HIGH Res) (10.5MB)

  Download: PDF (LOW Res) (990KB)

  Be the first to discuss this paper

Identification of Volterra Models of Tube Audio Devices using Multiple-Variance Method

Authors:
Affiliation:
Page:

The multiple-variance method is a cross-correlation method that exploits input signals with different powers for the identification of a nonlinear system by means of the Volterra series. It overcomes the problem of the locality of the solution of traditional nonlinear identification methods that successfully approximate only for inputs having approximately the same power of the identification signal. The multiple-variance method improves the model performance in case of inputs with high dynamic range. This method is used to identify three different tube amplifiers, and it is applied to a novel reduced Volterra model. This overcomes the problem of the very large number of coefficients required by the Volterra series, the so-called “course of dimensionality.” The paper demonstrates the effectiveness of the multiple-variance methodology in terms of system identification error and computational complexity.

  Download: PDF (HIGH Res) (3.2MB)

  Download: PDF (LOW Res) (604KB)

  Be the first to discuss this paper

Standards and Information Documents

AES Standards Committee News

Page: 839

Download: PDF (64KB)

Features

Music-Induced Hearing Disorders Conference Report, Chicago

Page: 840

Download: PDF (648KB)

Spatial Reproduction Conference Report, Tokyo

Page: 846

Download: PDF (936KB)

Preserving Our Audio Heritage

Author:
Page:

Papers presented at the recent conference on archiving, preservation, and restoration majored on the degradation of cylinders and magnetic tapes, but also provided a fascinating insight into other issues such as correcting timebase errors, preservation metadata, and checking digital recordings for ingest errors.

  Download: PDF (552KB)

  Be the first to discuss this feature

New Officers 2018–2019

Page: 859

Download: PDF (430KB)

Review of Society’s Sustaining Members

Page: 865

Download: PDF (249KB)

Departments

Section News

Page: 858

Download: PDF (44KB)

Obituaries

Page: 863

Download: PDF (42KB)

Products

Page: 864

Download: PDF (89KB)

AES Conventions and Conferences

Page: 880

Download: PDF (107KB)

Extras

Table of Contents

Download: PDF (41KB)

Cover & Sustaining Members List

Download: PDF (46KB)

AES Officers, Committees, Offices & Journal Staff

Download: PDF (75KB)

AES - Audio Engineering Society