Bulk download: Download Zip archive of all papers from this Journal issue
Music producers and casual users often seek to replicate dynamic range compression used in a particular recording or production context for their own track. However, not knowing the parameter settings used to produce the audio using the effect may become an impediment, especially for beginners or untrained users who may lack critical listening skills. We address this issue by presenting an automatic compressor plugin relying on a neural network to extract relevant features from a reference signal and estimate compression parameters. The plugin automatically adjusts its parameters to match the input signal with a reference audio recording as closely as possible. Quantitative and qualitative usability evaluation of the plugin was conducted with amateur, pro-amateur and professional music producers. The results established acceptance of the core idea behind the proposed control method across these user groups.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!
This paper evaluates the ability of several algorithms to detect impulse distortions (clicks) in audio signals. The systems are evaluated against data from a listening test conducted using real audio signals provided by a vinyl manufacturer. Some of the signals contained clicks due to damage during the manufacturing process. An evaluation of click detection algorithms against listening test results focuses on the ability of the click-detection algorithms to detect perceptible clicks. The results presented in this paper show that an algorithm that employs a hearing model detected audible clicks with a lower false detection rate than the other algorithms in the test and that the wavelet transform–based algorithm with a dynamic threshold outperformed the other algorithms.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!
The Wave Digital Filter (WDF) formalism is becoming a popular approach for the digital emulation of audio circuits. Nonlinear WDFs, like other kinds of discrete-time nonlinear filters used in Virtual Analog modeling applications, are often affected by aliasing distortion. Recently formalized Antiderivative Antialiasing (ADAA) methods are capable of significant aliasing reduction even with low oversampling factors. This paper discusses different strategies to integrate pth-order ADAA methods into stateful WDFs with a single one-port or multiport nonlinearity while preserving the modularity property typical of traditional WDFs. The effectiveness of the proposed approach is verified by applying the discussed ADAA techniques to three nonlinear audio circuits containing diode-based nonlinearities and a BJT transistor.
Download Now (1.1 MB)
This paper is Open Access which means you can download it for free.
Start a discussion about this paper!
We propose time-varying Schroeder allpass filters and Gerzon allpass reverberators that remain energy preserving irrespective of arbitrary variation of their allpass gains or feedback matrices over time. We propose various ways of realizing the unitary matrix involved in the Schroeder structure, based on classic ladder and lattice filters and their generalizations. We show how to construct more elaborate structures including nestings and cascade, giving various strategies for reducing their implementation cost. Extending these algorithms to the multi-input, multi-output case yields time-varying, energy-preserving generalizations of Gerzon’s reverberator, providing a link between Schroeder allpass filters and Schelcht’s recently proposed “Allpass Feedback Delay Networks.” Stability proofs are given for common uses of Schroeder allpass filters, such as inside of Feedback Delay Network reference structures. Finally we give a substantial review of the properties of time-invariant Schroeder allpass filters.
Download Now (2.0 MB)
This paper is Open Access which means you can download it for free.
Start a discussion about this paper!
Sound field rendering is computation-intensive and memory-intensive. This research investigates an FPGA-based accelerator for sound field rendering with an FDTD scheme, in which wave equations are directly implemented by reconfigurable hardware, and spatial blocking is applied to alleviate the memory bandwidth requirement. Compared to software simulation performed on a desktop machine with 128 GB DDR4 RAMs and an Intel i7-7820X processor running at 3.6 GHz, the proposed FPGA-based accelerator achieves up to 2.98 times more in computing performance in the case of different layer sizes and different numbers of nodes computed in parallel even though the FPGA system runs at about 267 MHz.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!
Delay Network reverberators are an efficient tool for synthesizing reverberation. We propose a novel architecture, called the Grouped Feedback Delay Network (GFDN) reverberator, with groups of delay lines sharing different target decay rates, and use it to simulate coupled room acoustics. Coupled spaces are common in apartments, concert halls, and churches where two or more volumes with different reverberation characteristics are linked via an aperture. The difference in reverberation times (T60s) of the coupled spaces leads to unique phenomena, such as multi-stage decay. Here the GFDN is used to simulate coupled spaces with groups of delay line filters representing the T60s of the coupled rooms. A parameterized, orthonormal mixing matrix is presented that provides control over the mixing times of the rooms and amount of coupling between the rooms. As an example application we measure a coupled bedroom and bathroom system separated by a door in an apartment and use the GFDN to synthesize the late field for different openings of the door separating the two rooms, thereby varying coupling between the rooms.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!
Despite the popularity of guitar effects there is very little existing research on classification and parameter estimation of specific plugins or effect units from guitar recordings. In this paper, convolutional neural networks were used for classification and parameter estimation for 13 overdrive, distortion, and fuzz guitar effects. A novel dataset of processed electric guitar samples was assembled, with four sub-datasets consisting of monophonic or polyphonic samples and discrete or continuous settings values, for a total of about 250 hours of processed samples. Results were compared for networks trained and tested on the same or a different subdataset. We found that discrete datasets could lead to equally high performance as continuous ones while being easier to design, analyze, and modify. Classification accuracy was above 80%, with confusion matrices reflecting similarities in the effects timbre and circuits design. With parameter values between 0.0 and 1.0, the mean absolute error is in most cases below 0.05, while the root mean square error is below 0.1 in all cases but one.
Download Now (836 KB)
This paper is Open Access which means you can download it for free.
Start a discussion about this paper!
A modeling system for the impulse responses (IRs) of reverberators is presented. The overarching purpose of this system is to offer similar levels of control over captured IRs to that of algorithmic reverberators while retaining their acoustic plausibility and, where desired, realism. Specifically an approach to estimating the parameters of the model is presented that offers a significant reduction in the computational requirements of the matrix decomposition method ESPRIT, while offering vastly improved quality than is possible by using a single Fourier analysis. These methods are compared, first on large sets of short-duration synthetic signals and then on a wide range of typical IRs, some many seconds in duration. Finally systems that employ the model described and the analysis method it uses are discussed.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!
This article further explores a previously proposed gray-box neural network approach to modeling LFO (low-frequency oscillator) modulated time-varying audio effects. The network inputs are both the unprocessed audio and LFO signal. This allows the LFO to be freely controlled after model training. This paper introduces an improved process for accurately measuring the frequency response of a time-varying system over time, which is used to annotate the neural network training data with the LFO of the effect being modeled. Accuracy is improved by using a frequency domain synthesized chirp signal and using shorter and more closely spaced chirps. A digital flanger effect is used to test the accuracy of the method and neural network models of two guitar effects pedals, a phaser and flanger, were created. The improvement in the system measurement method is reflected in the accuracy of the resulting models, which significantly outperform previously reported results. When modeling a phaser and flanger pedal, error-to-signal ratios of 0.2% and 0.3% were achieved, respectively. Previous work suggests errors of this size are often inaudible. The model architecture can run in real time on a modern computer while using relatively little processing power.
Download Now (2.2 MB)
This paper is Open Access which means you can download it for free.
Start a discussion about this paper!
This paper is concerned with the modeling of ferromagnetic coils with audio applications in mind. The proposed approach derives a macroscopic, energy-based formulation from statistical physics. This choice allows for thermodynamic variables to be explicitly taken into account. As a consequence, macroscopic features such as saturation and hysteresis arise directly. As the proposed model is expressed through a port-Hamiltonian formulation, power balance and passivity are guaranteed. Moreover the model may be straightforwardly connected to other multi-physical components and included in more complex systems. The proposed model is compared to measurements on a real ferromagnetic coil. Simulations of a passive band-pass filter and a transformer built around the model are presented as an illustration.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Start a discussion about this paper!