AES E-Library

AES E-Library Search Results

Bulk download - click topic to download Zip archive of all papers related to that topic: No Subject Listed Applications in Audio Audio Culture and Education Perception Psychoacoustics and Perception Recording and Production Room Acoustics Semantic Audio Signal Processing Spatial Audio Transducers

Multiband time-domain crosstalk cancellation

Pioneered in the sixties, Crosstalk Cancellation (CTC) allows for immersive sound reproduction from a limited number of loudspeakers. Upcoming virtual reality and augmented reality applications, as well as widespread availability of 3D audio content, have boosted interest in CTC technologies over the recent years. In this paper, we present a novel multiband approach to CTC, evolving and superseding our original work based on modeling of the system’s geometrical acoustics. This new solution, whilst keeping a simple processing model, offers improved CTC effectiveness, reduced residual coloration and wider bandwidth. The enhanced performance of our new approach has been confirmed by laboratory experiments.

Open
Access

Authors: Vancheri, Alberto; Leidi, Tiziano; Heeb, Thierry; Grossi, Loris; Spagoli, Noah
Affiliations: University of Applied Sciences and Arts of Southern Switzerland; University of Applied Sciences and Arts of Southern Switzerland; University of Applied Sciences and Arts of Southern Switzerland; University of Applied Sciences and Arts of Southern Switzerland; University of Applied Sciences and Arts of Southern Switzerland(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10614 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Spatial Audio

Download Now (1.1 MB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Spatial Audio!

A Wideband Target Response Curve for Insert Earphones

The frequency response of a headphone is very important for listener satisfaction. Listener preferences have been thoroughly studied for frequencies below 10 kHz, but preferences above that frequency are less well known. Recent improvements in the high frequency performance of ear simulators make it more practical to study this frequency region now. In this study, the high frequency portion of the target response for insert headphone designs is investigated, focusing on the range from 10 kHz to 20 kHz. A new target response is proposed, based on listener preference ratings in a blind listening test. The results show a clear preference for significantly more high frequency energy than was proposed in a previous popular earphone target curve. The preferred response is also affected by the listener’s hearing thresholds, with additional high frequency boost being preferred for listeners with age-related hearing loss.

Authors: Miller, Tom; Downey, Cristina
Affiliations: Knowles Electronics, LLC; Knowles Electronics, LLC(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10615 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Applications in Audio

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Start a discussion about this Applications in Audi!

Combining Deep Learning and Web Audio for Automated Real-time Audio Speech Production

With the advent of Web Audio the compliant browser offers a toolbox of audio production components right out of the box. This could prove valuable for any content producer in the audio field, either professional or amateur. In this paper we will add deep learning to this scenario with the ultimate goal of obtaining machine assisted real-time audio production in-browser. As a proof of concept we will implement a basic yet complete one-channel design that uses deep learning to assist an automatic filtering algorithm producing parameters to adjust an audio signal in the Web Audio context of a browser. To achieve this we will evaluate five-class audio prediction models and compare their accuracy at the model building stage with their accuracy when exported to the real-time context. We will present two ways to measure this accuracy in real-time. We will also present a method to reduce jumpiness in real-time predictions when classification scores are ambiguous. We will highlight some important limitations and we will also present a refined model – designed for our domain specific audio set – based on some architectures from previous research and find that that our model outperforms these architectures

Author: Sigvardson, Tor
Affiliation: Blekinge Institute of Technology - BTH, Karlskrona, Sweden and Swedish Radio, Stockholm, Sweden
AES Convention: 153 (October 2022) Paper Number: 10616 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Signal Processing

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Start a discussion about this Signal Processing!

Simulating low frequency noise pollution using the parabolic equations in sound reinforcement loudspeaker systems

Sound system designers are used to optimizing loudspeaker systems for the audience experience with free-field simulation software. However, noise pollution reduction must also be considered during the design phase and the propagation of sound may be affected by inhomogeneous atmospheric conditions, such as wind, temperature gradients, and ground impedance. This paper proposes a method to simulate the impact of the environment on sound pressure levels at large distances created by loudspeaker systems using parabolic equations, considering a reference left-right main system associated with either flown or ground-stacked subwoofers. Results show a higher variability of the sound pressure level with systems using ground-stacked subwoofers. The influence of the crossover frequency between main and subwoofers is discussed in this paper.

Open
Access

Authors: Mouterde, Thomas; Perrot, Joris; Lihoreau, Bertrand; Corteel, Etienne
Affiliations: L-Acoustics, Marcoussis, France; L-Acoustics, Marcoussis, France; Laboratoire d'Acoustique de l'Université du Mans, France; L-Acoustics, Marcoussis, France(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10617 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Applications in Audio

Download Now (396 KB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Applications in Audi!

Piezoelectric Actuators for Flat-Panel Loudspeakers

Piezoelectric actuators offer advantages in terms of weight and form-factor when compared with traditional inertial exciters used to drive flat-panel loudspeakers. Unlike inertial exciters, piezoelectric actuators induce vibrations in the panel by generating bending moments at the actuator edges. Models for piezoelectric excitation are developed, and it is shown that piezoelectric actuators are most effective at driving resonant modes whose bending half-wavelength is the same size as the actuator dimensions, giving a natural boost to the high-frequency response. Flat-panel loudspeaker design techniques such as the modal crossover method, and the corresponding array layout optimization are adapted using the model for the response of piezo-driven panels. The adapted design techniques are shown to eliminate the isolated, low-frequency bending modes responsible for reduced sound quality on a prototype panel speaker.

Open
Access

Authors: Heilemann, Michael C.; DiPassio, Tre
Affiliation: University of Rochester, Rochester, NY, USA, University of Rochester, Rochester, NY, USA
AES Convention: 153 (October 2022) Paper Number: 10618 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Applications in Audio

Download Now (11.5 MB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Applications in Audi!

Ambience recording - The influence of the direct sound when combining close and ambience microphones

When combining close microphone(s) with ambience microphone(s), the direct signal will be picked up slightly delayed by the ambience microphone(s) compared to the close microphone(s). This study examines the auditory significance of the delayed direct signal in the context of early reflections and reverberation emitted from the boundaries within an acoustic environment. The aim is to establish to which extent any auditory perception influenced by the delayed direct signal may impact the application of the ambience microphone setup itself. The findings in this study suggests that when combined with close microphones, the ambience setup when using omni microphones, should be placed at a distance beyond at least twice the critical distance of the acoustical recording environment relative to the sound source.

Author: Tore Teigland and Paal Erik Jensen
Affiliations: Kristiania University College, Oslo, Norway; Kristiania University College, Oslo, Norway(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10619 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Recording and Production

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Start a discussion about this Recording and Produc!

On the use of integral input devices and relative mappings for an immersive audio mixing task. Part II.

Current systems capable of 3D sound reproduction and panning of multi-channel sound sources require the user to employ standard, low-level interaction tools such as knobs and sliders to handle audio sources and reproduction setups with higher complexity. Innovative tools and control methods designed to meet the requirements of immersive audio environments are essential to enable efficient implementation of these technologies and software. This work investigated three types of input devices as controllers for an immersive audio mixing task in a spatial audio system. This second part of the research focuses on the objective data registered by the apparatus of the experiment. Results suggest devices with a high degree of integrality had the fastest learning curve and produced comparable results to the traditional method in response time.

Authors: Quiroz, Diego; Martin, Denis
Affiliations: McGill University, Montreal, Canada; CIRMMT, Montreal, Canada(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10620 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Spatial Audio

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Start a discussion about this Spatial Audio!

In-Depth Latency and Reliability Analysis of a Networked Music Performance over Public 5G Infrastructure

Networked Music Performances (NMP) are of increasing importance for musical enthusiasts, amateurs and professionals exploring new solutions and opportunities to rehearse or perform together at geographically distant locations. The use of public cellular connectivity to access wide area networks for such purposes can provide unique flexibility in planning NMP setups. Generally, latency and reliability are two key parameters for the end-to-end transmission of audio information through networks in this context. The stringent requirements of NMPs with respect to those parameters pose a major challenge for the current fourth generation (4G) of cellular technology. It is expected that the new and upcoming fifth generation (5G) of cellular technology will deliver significant Key Performance Indicator (KPI) improvements, and thus could be a promising enabler for musical interaction over distant locations. Currently, first public deployments of cellular 5G are being rolled out. This work presents an in-depth latency and reliability analysis of a distributed performance conducted over public 5G infrastructure in Finland. Measurement results suggest that Quality-of-Service (QoS) is needed in order to enable NMP over cellular 5G for many users and devices in a consistent, plannable and also flexible way.

Open
Access

Authors: Dürre, Jan; Werner, Norbert; Hämäläinen, Seppo; Lindfors, Oscar; Koistinen, Janne; Saarenmaa, Miro; Hupke, Robert
Affiliations: Sennheiser electronic GmbH & Co. KG; Sennheiser electronic GmbH & Co. KG; Nokia OYJ; Nokia OYJ; Telia Company AB; Telia Company AB; Leibniz University Hanover(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10621 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Applications in Audio

Download Now (872 KB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Applications in Audi!

Comparisons between VBAP and WFS using Spatial Sound Synthesis

This paper presents analysis methods that make use of synthesis concepts by means of movements of sounds in virtual 3D auditory space at audio rate. The main focuses lies on the modulating effect caused by moving a sound around a listener, commonly referred to as panning. Here, we will demonstrate a comparison between two common panning approaches: Vector-Base Amplitude Panning (VBAP) and Wavefield Synthesis (WFS). Both apply some type of secondary source selection process, reducing the number of active loudspeakers at any moment in time. Under specific conditions it therefore becomes possible to compare VBAP and WFS directly. Using a technique originally developed for 3D sound synthesis over multichannel loudspeaker arrays at audio rate, both VBAP and WFS are adapted to be emulated by this technique. By using this sonification process, the differences between panning with VBAP or WFS can then not only be visualised but also heard. This makes spatial sound synthesis not just a creative tool for sound creation, but also an analytic tool that can help illustrate characteristics of spatialisation approaches.

Open
Access

Authors: Schmele, Timothy; Lopez, Jose J.
Affiliations: Eurecat, Barcelona Spain and University of Music Karlsruhe, Germany; Universitat Politecnica de Valencia, Spain(See document for exact affiliation information.)
AES Convention: 153 (October 2022) Paper Number: 10622 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Spatial Audio

Download Now (2.0 MB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Spatial Audio!

Spider Design Optimization At The Buckling Limit

Size and design constraints in products such as soundbars and TVs require loudspeaker spiders of small diameter to allow for large voice-coil excursion. Spider designs that undergo exceedingly large displacements can exhibit buckling of the spider rolls, resulting in very audible distortion. Such buckling events are non-trivial to simulate with finite element methods and often lead to solver non-convergence. When wrapping numerical optimization algorithms around the finite-element simulations to achieve optimal spider designs, it is important to ensure that all simulated designs can be solved without errors or convergence issues. The optimal spider design might be right within the buckling limits and an automated numerical optimization algorithm will need to be able to resolve some designs that exhibit buckling. This work shows how an augmented FEM method can be used to circumvent issues when employing numerical optimization for a spider design near its buckling limits.

Open
Access

Author: Bezzola, Andri
Affiliation: Samsung Research America
AES Convention: 153 (October 2022) Paper Number: 10623 Permalink
Publication Date: October 19, 2022 Import into BibTeX
Subject: Transducers

Download Now (791 KB)

This paper is Open Access which means you can download it for free.

Start a discussion about this Transducers!

Search Results (Displaying 1-10 of 21 matches)		New Search
Sort by:
	Records Per Page:

AES E-Library Search Results

Multiband time-domain crosstalk cancellation

A Wideband Target Response Curve for Insert Earphones

Combining Deep Learning and Web Audio for Automated Real-time Audio Speech Production

Simulating low frequency noise pollution using the parabolic equations in sound reinforcement loudspeaker systems

Piezoelectric Actuators for Flat-Panel Loudspeakers

Ambience recording - The influence of the direct sound when combining close and ambience microphones

On the use of integral input devices and relative mappings for an immersive audio mixing task. Part II.

In-Depth Latency and Reliability Analysis of a Networked Music Performance over Public 5G Infrastructure

Comparisons between VBAP and WFS using Spatial Sound Synthesis

Spider Design Optimization At The Buckling Limit

ABOUT AES

Contact Us