

Home  Technical Program  Exhibition  Visitors  Students  Press 

Last Updated: 20060425, mei P17  Posters: Signal Processing and HighResolution AudioMonday, May 22, 09:00 — 10:30 P171 All Amplifiers Are Analog, but Some Amplifiers Are More Analog than Others—Bruno Putzeys, Hypex Electronics B.V.  Groningen, The Netherlands; André Veltman, Paul van der Hulst, Piak Electronic Design b.v.  Culemborg, The Netherlands; René Groenenberg, Mueta b.v.  Wijk en Aalburg, The Netherlands This paper intends to clarify the terms “digital” and “analog” as applied to classD audio power amplifiers. Since loudspeaker terminals require an analog voltage, an audio power amplifier must have an analog output. If its input is digital, digitaltoanalog conversion is necessarily executed at some point. Once a designer acknowledges the analog output properties of a classD power stage, amplifier quality can improve. The incorrect assumption that some amplifiers are supposedly digital, causes many designers to come up with complicated patches to ordinary analog phenomena such as timing distortion or supply rejection. This irrational approach blocks the way to a rich world of wellestablished analog techniques to avoid and correct many of these problems and realize otherwise unattainable characteristics such as excellent THD+N and extremely low output impedance throughout the audio band. [Poster Presentation Associated with Paper Presentation P81] Convention Paper 6690 (Purchase now) P172 Toward an Ideal Switching (ClassD) Power Amplifier: How to Control the Flow of Power in a Switching Power Circuit—Rolf Esslinger, Dieter Jurzitza, Harman/Becker Automotive Systems  Karlsbad, Germany The design of a switching (classD) audio power amplifier suitable for highend audio applications is still a very challenging task for circuit design and signal processing engineers. Classical power stage topologies using PulseWidth Modulation (PWM) in combination with voltagecontrolled MOSFET Hbridges are already available on the market, but their performance in terms of signal bandwidth and linearity is still far below the one of traditional classA and A/B power stages. Moreover, EMC is an issue that is very hard to control. ClassD output stages are considered from a totally different point of view in this paper. The flow of power in the output stage, containing the switching power stage as a power control element, the output filter as an energy store, and the load as both a power sink and a power source in case the load is not a resistor but a real world loudspeaker device. It is shown, where in a typical power stage the power loss occurs, which is dissipated as heat. To improve the quality and efficiency of highfrequency switched power stages, investigation has to be taken into the way, how to control the flow of power into the storage elements and how to charge them most precisely and most efficiently. Some fundamental approaches for this will be shown in this paper. [Poster Presentation Associated with Paper Presentation P82] Convention Paper 6691 (Purchase now) P173 PWM Amplifier Control Loops with Minimum Aliasing Distortion—Lars Risbo, Texas Instruments Denmark A/S  Lyngby, Denmark; Claus Neesgaard, Texas Instruments Inc.  Dallas, TX, USA PWM classD audio power amplifiers typically contain a control loop filter network and a comparator producing the PWM signal. The comparator performs a sampling operation whenever it changes state. A previous paper by the author analyzed this sampling behavior from a small signal point of view. The present paper attempts to formulate a largesignal model that accounts for the nonlinear effects of the sampling due to aliasing of high frequency carrier components. Closedform expressions for the intrinsic THD of the traditional first and secondorder loops are derived. The model is validated using simulations, and a class of Minimum Aliasing Error (MAE) loop filters is presented that obtains minimum aliasing distortion thanks to the use of quadrature sampling. Finally, measurement data are presented for real applications using the principles described. [Poster Presentation Associated with Paper Presentation P84] Convention Paper 6693 (Purchase now) P174 Simple, Ultralow Distortion Digital Pulse Width Modulator—Bruno Putzeys, Hypex Electronics BV  Groningen Belgium A core problem with digital pulse width modulators is that effective sampling occurs at signaldependent intervals, falsifying the ztransform on which the input signal and the noise shaping process are based. In a first step the noise shaper is reformulated to operate at the timer clock rate instead of the pulse repetition frequency. This solves the uniform/natural sampling problem, but gives rise to new nonlinearities akin to ripple feedback in analog modulators. By modifying the feedback signal such that it reflects only the modulated edge of the pulse train this effect is practically eliminated, yielding vastly reduced distortion without increasing complexity. [Poster Presentation Associated with Paper Presentation P85] Convention Paper 6694 (Purchase now) P175 A High Performance Open Loop AllDigital ClassD Audio Power Amplifier Using Zero Positioning Coding (ZePoC)—Olaf Schnick, Wolfgang Mathis, University of Hannover  Hannover, Germany Open loop alldigital ClassD amplifiers are uncommon due to the lack of the correcting feedback path leading to several problems resulting in high distortion compared to analog controlled classD amplifiers. This paper shows that SBZePoC lowers switching frequency to 100 kHz. Therefore, these problems can be solved, so that it is possible to design an open loop alldigital classD audio amplifier with low total distortions in the whole audioband (20 Hz to 20 kHz) and an efficiency that reaches 90 percent. Results of a testsetup will be presented. The sonic performance will be demonstrated during the session. [Poster Presentation Associated with Paper Presentation P86] Convention Paper 6695 (Purchase now) P176 A ThreeLevel Trellis Noise Shaping Converter for Class D Amplifiers—Ludovico Ausiello, Riccardo Rovatti, University of Bologna  Bologna, Italy; Gianluca Setti, University of Ferrara  Ferrara, Italy Class D amplifiers can represent signals with three different output levels, +V_{cc}, 0, V_{cc}, with no distortion. Exploiting this in order to achieve a better performance with no switching frequency increase, an extension to the classic pulse width modulation two level A/D conversion is proposed. Coding is achieved by extending output waveforms of a trellisbased sigma delta modulation to three levels. Simulation results have shown that, using the same symbol rate, a threelevel pattern is achieved from 3.7 to 8.2 dB of SINAD improvement and a power consumption up to 5 times smaller. [Poster Presentation Associated with Paper Presentation P87] Convention Paper 6696 (Purchase now) P177 Using SIP Techniques to Verify the TradeOff between SNR and Information Capacity of a Sigma Delta Modulator—Charlotte YukFan Ho, Joshua Reiss, Queen Mary, University of London  London, UK; Bingo WingKuen Ling, King’s College London  London, UK The GerzonCraven noise shaping theorem states that the ideal information capacity of a sigma delta modulator design is achieved if and only if the noise transfer function (NTF) is minimal phase. In this paper it is found that there is a tradeoff between the signaltonoise ratio (SNR) and the information capacity of the noise shaped channel. In order to verify this result, loop filters satisfying and not satisfying the minimal phase condition of the NTF are designed via semiinfinite programming (SIP) techniques and solved using dual parameterization. Numerical simulation results show that the design with a minimal phase NTF achieves near the ideal information capacity of the noise shaped channel, but the SNR is low. On the other hand, the design with a nonminimal phase NTF achieves a positive value of the information capacity of the noise shaped channel, but the SNR is high. Results are also provided that compare the SIP design technique with Butterworth and Chebyshev structures and ideal theoretical SDMs, and evaluate the performance in terms of SNR and a variety of information theoretic measures which capture noise shaping qualities. [Poster Presentation Associated with Paper Presentation P88] Convention Paper 6697 (Purchase now) P178 Estimation of Initial States of SigmaDelta Modulators—Charlotte YukFan Ho, Queen Mary, University of London  London, UK; Bingo WingKuen Ling, King’s College London  London, UK; Joshua Reiss, Queen Mary, University of London  London, UK In this paper an initial condition of a sigmadelta modulator is estimated based on quantizer output bit streams and an input signal. The set of initial conditions that generate a stable trajectory is characterized. It is found that this set, as well as the set of initial conditions corresponding to the quantizer output bit streams, are convex. Also, it is found that the mapping from the set of initial conditions to the stable admissible set of quantizer output bit streams is invertible if the loop filter is unstable. Hence, the initial condition corresponding to given stable admissible quantizer output streams and an input signal is uniquely defined when the loop filter is unstable, and a projection onto convex set approach is employed for approximating the initial condition. [Poster Presentation Associated with Paper Presentation P89] Convention Paper 6698 (Purchase now) P179 Clean Clocks, Once and for All?—Christian G. Frandsen, TC Electronic A/S  Risskov, Denmark; Chris Travis, Sonopsis Ltd.  WottonunderEdge, Gloucestershire, UK Networkbased digital audio interfaces are becoming increasingly popular. But they do pose a significant jitter problem wherever highquality conversion to/from analog is required. This is true even with networks such as 1394 that provide dedicated support for isochronous flows. Conventional PLL solutions have toolittle jitter attenuation, toomuch intrinsic jitter, and/or toonarrow a frequency range. More advanced solutions tend to have toohigh a cost. A new clocking technology that boasts high performance and low cost is presented. It has been implemented in a recent audioover1394 chip. We show comparative performance results and explore systemlevel implications, including for systems that use pointtopoint links such as AES3, SPDIF, and ADAT. [Poster Presentation Associated with Paper Presentation P811] Convention Paper 6700 (Purchase now) P1710 SigmaStudio. A UserFriendly, Intuitive and Expandable, Graphical Development Environment for Audio/DSP Applications—Miguel Chavez, Camille Huin, Analog Devices, Inc.  Wilmington, MA, USA Graphical development environments have been used in the audio industry for a number of years. Those who have fewer limitations have persisted and found a wellestablished pool of users that is reluctant to modify their design patterns and adopt different embedded processors and design environments. This paper provides a small history of the evolution of integrated development environments (IDEs). It then describes and explains the software architecture decisions and design challenges that were used to develop SigmaStudio. It will also show the advantages that those decisions have meant for the SigmaDSP family of audiocentric embedded processors. [Poster Presentation Associated with Paper Presentation P121] Convention Paper 6714 (Purchase now) P1711 Adaptive Filters in Wavelet Transform Domain—Vladan Bajic, AudioTechnica US  Stow, OH, USA This paper presents performance comparison between two methods of implementing adaptive filtering algorithms for noise reduction, namely the Normalized time domain Least Mean Squares (NLMS) algorithm and the Wavelet transform domain LMS (WLMS). A brief theoretical development of both methods is explained, and then both algorithms are implemented on a realtime Digital Signal Processing (DSP) system used for audio signals processing. Results are presented showing the performance of each algorithm both in time and frequency domains. Noise reduction effects produced by different algorithms were shown across the spectrum, and distorting effects were analyzed. Tradeoffs of convergence speed versus added noise were analyzed. Overall results show convergence speed improvement when using WLMS algorithms over the NLMS algorithm. [Poster Presentation Associated with Paper Presentation P123] Convention Paper 6716 (Purchase now) P1712 Adaptive TimeFrequency Resolution for Analysis and Processing of Audio—Alexey Lukin, Moscow State University  Moscow, Russia; Jeremy Todd, iZotope, Inc.  Cambridge, MA, USA Filter banks with fixed timefrequency resolution, such as the ShortTime Fourier Transform (STFT), are a common tool for many audio analysis and processing applications allowing effective implementation via the Fast Fourier Transform (FFT). The fixed timefrequency resolution of the STFT can lead to the undesirable smearing of events in both time and frequency. In this paper we suggest adaptively varying STFT timefrequency resolution in order to reduce filter bankspecific artifacts while retaining adequate frequency resolution. Several strategies for systematic adaptation of timefrequency resolution are proposed. The introduced approach is demonstrated as applied to spectrogram displays, noise reduction, and spectral effects processing. [Poster Presentation Associated with Paper Presentation P124] Convention Paper 6717 (Purchase now) P1713 Advanced Methods for Shaping TimeFrequency Areas for the Selective Mixing of Sounds—Piotr Kleczkowski, AGH University of Science and Technology  Krakow, Poland; Adam Kleczkowski, University of Cambridge  Cambridge, UK The “Selective Mixing of Sounds” (AES 119th Convention Paper 6552) contains a large and conceptually challenging part, which had not been developed previously. This is a method of determining the areas of dominance by different tracks in the timefrequency plane. It has a major effect on the overall quality of the sound. In this paper we propose and compare a range of appropriate algorithms. We begin with a simple twodimensional running mean combined with a rule selecting the track characterized by the maximum energy, followed by a lowpass filter based on the twodimensional Fourier transform. We also propose two novel methods based on the MonteCarlo approach, in which local probabilistic rules are iterated many times to produce a required level of smoothing. [Poster Presentation Associated with Paper Presentation P125] Convention Paper 6718 (Purchase now) P1714 Demixing Commercial Music Productions via HumanAssisted TimeFrequency Masking—Marc Vinyes, Jordi Bonada, Alex Loscos, Pompeu Fabra University  Barcelona, Spain Audio blind separation in real commercial music recordings is still an open problem. In the last few years some techniques have provided interesting results. This paper presents a humanassisted clusterization of the DFT coefficients for the timefrequency masking demixing technique. The DFT coefficients are grouped by adjacent pan, interchannel phase difference, and magnitude and magnitudevariance with a realtime interactive graphical interface. Results prove that an implementation of such technique can be used to demix tracks from nowadays commercial songs. Sample sounds can be found at http://www.iua.upf.es/~mvinyes/abs/demos. [Poster Presentation Associated with Paper Presentation P126] Convention Paper 6719 (Purchase now) P1715 Enhanced Control of Sound Field Radiated by CoAxial Loudspeaker Systems Using Digital Signal Processing Techniques—Hmaied Shaiek, ENST de Bretagne  Brest Cedex, France; Bernard Debail, Cabasse Acoustic Center  Plouzané, France; Jean Marc Boucher, ENST de Bretagne  Brest Cedex, France; Yvon Kerneis, Pierre Yves Diquelou, Cabasse Acoustic Center  Plouzané, France In multiway loudspeaker systems, digital signal processing techniques have been used so far mainly to correct frequency response, time alignment, and out of axis lobbing. In this paper a dedicated signal processing technique is described in order to also control the sound field radiated by coaxial loudspeaker systems in the overlap frequency band of drivers. Tradeoffs and practical constraints (crossover, time shift, gain, etc.) are discussed and an optimization algorithm is proposed to provide the best achievable result. Realtime implementation of this technique is presented and leads to a nearly ideal point source. [Poster Presentation Associated with Paper Presentation P1210] Convention Paper 6723 (Purchase now) P1716 Network Music Performance (NMP) in Narrow Band Networks—Alexander Carôt, International School of New Media (ISNM)  Lübeck, Germany; Ulrich Krämer, Gerald Schuller, Fraunhofer Institute for Digital Media Technology  Ilmenau, Germany Playing live music on the Internet is one of the hardest disciplines in terms of low delay audio capture and transmission, time synchronization, and bandwidth requirements. This has already been successfully evaluated with the Soundjack software, which can be described as a low latency UDP streaming application. In combination with the new Fraunhofer ULD Codec this technology could now be used in narrow band DSL networks without a significant increase of latency. This paper first describes the essential basics of network music performances in terms of soundcard and network issues and finally reviews the context under DSL narrow band network restrictions and the usage of the ULD Codec. [Poster Presentation Associated with Paper Presentation P1211] Convention Paper 6724 (Purchase now) P1717 Intensive Noise Reduction Utilizing Inharmonic Frequency Analysis of GHA—Teruo Muraoka, University of Tokyo  Komaba Meguroku, Tokyo, Japan; Ryuji Takamizawa, Matsushita Electric Industrial Co., Ltd.  Kadoma City, Osaka, Japan; Yoshihiro Kanda, Musashi Institute of Technology  Tamadutumi Setagaya, Tokyo, Japan; Takumi Ohta, Kenwood Corporation  Hachiouji City, Tokyo, Japan Removal of noise in SP record reproduction were attempted utilizing GHA (Generalized Harmonic Analysis) as inharmonic frequency analysis. Spectrum subtraction is most common among conventional noise reduction techniques, however it has a side effect of musical noise generation. It is caused by inaccurate frequency resolution inherent to conventional harmonic frequency analysis. One method of inharmonic frequency analysis of GHA is equipped with excellent frequency resolution, and it has been put in practical use recently. The authors applied GHA for noise reduction and obtained better results than those by conventional spectrum subtraction. However, there still remained musical noise problems, and its major reason is spectral incoincidence between presampled reference noise and actually remained residual noise. The authors tried several countermeasures such as prespectral shaping of object signal and spectral similarity calculation of residual noise, etc. Through combining countermeasures, the authors achieved satisfactory noise reduction. [Poster Presentation Associated with Paper Presentation P1212] Convention Paper 6725 (Purchase now) P1718 Multichannel NoiseReductionSystems for Speaker Identification in an Automotive Environment—Volker Mildner, Stefan Goetze, KarlDirk Kammeyer, University Bremen  Bremen, Germany Devices for communication and information utilized by car drivers are facing two essential requirements: handsfree operation via distant microphones but also robustness against different noises depending on car speed, etc. Automatic loudspeaker identification can be utilized within such devices to either supply speech recognition systems with so called a priori information to achieve higher recognition rates or even to enable applications such as heating systems to adjust to the preferences of the driver. Thus identifying the driver from a predefined group of possible system users may be a task for future applications. The aim in this paper is to investigate to what extent multichannel noise reduction systems are suitable for improving the performance of loudspeaker identification algorithms under different acoustic conditions in an automotive environment. Convention Paper 6756 (Purchase now) P1719 Optimal Quantized Linear Prediction Coefficients for Lossless Audio Compression—Scalar Quantization Revisited—Florin Ghido, Tampere University of Technology  Tampere, Finland Uniform scalar quantization of linear prediction coefficients is traditionally done by multiplying each coefficient with Q=2^{B} and rounding it to the nearest integer. We propose an improved, optimal quantization method by replacing the rounding with a more elaborated procedure. The method uses 2 bits less per quantized prediction coefficient for a similar misadjustment and allows an accurate estimate of the misadjustment as a function of Q. We introduce several efficient timeconstrained probabilistic search methods for obtaining near optimal solutions. No changes are required at the decoder and the method is applicable on a wider area of cases (mono, stereo, and multichannel prediction) than quantization of reflection coefficients. Moreover, it enables near optimal compression for 24 bit audio using only 32 bit arithmetic operations. Convention Paper 6757 (Purchase now) P1720 Efficient Out of Head Localization System for Mobile Applications—Tacksung Choi, Yonsei University  Seoul, Korea; Young Cheol Park, Yonsei University  Wonju, Korea; Dae Hee Youn, Yonsei University  Seoul, Korea Headphone reproduction of stereo sources often gives intheheadlocalization. One possible solution to this problem is to give directional filtering and room response to the headphone reproduction system. Conventional out of head localization (OHL) schemes consist usually of a tapped delay line to simulate the direct signal path and early room reflections. Each of the taps must be filtered by a pair of HRTF, which leads to a very high processing cost. Our study is based on the fact that spatial impression (SI) can increase the effects of OHL. Our research is to generate the maximum SI with a minimum cost. Through subjective listening tests, the degree of SI was found to be the greatest for reflections within 15 to 30msec time frame from the direct sound, and it is greatest for those in opposite direction to the listener’s ears. Based on the test results, we propose an efficient OHL system. In the proposed system, multiple reflections are replaced by a pair of reflections, and HRTF filtering required to simulate directivity of the reflections is implemented using a set of first order IIR shelving filters. According to the subjective tests, we show that the proposed system efficiently creates OHL with a small computational figure, and its performance is comparable to the conventional scheme of high complexity. Convention Paper 6758 (Purchase now) P1721 A Psychoacoustic Noise Reduction Approach for Stereo HandsFree Systems—Stefan Goetze, Volker Mildner, KarlDirk Kammeyer, University of Bremen  Bremen, Germany One demand for comfortable high quality handsfree video conferencing systems is the transmission of a spatial acoustical impression. Therefore a major task is the transmission of stereo speech signals from a noisy environment. The suppression of the noise components must not corrupt the stereo effect. In this context different single channel, multichannel, and hybrid speech enhancement systems will be evaluated in this paper. The problem of musical noise in postfilteralgorithms is addressed. Therefore, a psychoacoustic masking threshold for the noise reduction algorithms is considered. Convention Paper 6759 (Purchase now) P1722 Estimation of Talker’s Head Orientation Based on Oriented Global Coherence Field—Alessio Brutti, ITCirst  Trento, Italy, Università di Trento, Trento, Italy; Maurizio Omologo, Piergiorgio Svaizer, ITC irst  Trento, Italy This work describes a new method for estimating the orientation of a not omnidirectional sound source given a distributed microphone network. The technique requires that a set of microphone pairs be distributed over a room, and it exploits the coherence computed from each sensor pair in order to derive an estimation of the head orientation. A database consisting of an audio sequence reproduced by a loudspeaker with different orientations and different positions was collected in order to evaluate the algorithm behavior. Experiments conducted on that database show that our approach can provide an efficient estimation of the sound source orientation, with an RMS error of about 10 degrees. Satisfactory performance was confirmed by tests with real human speakers. Convention Paper 6760 (Purchase now) P1723 HighQuality Blind Bandwidth Extension of Audio for Portable Player Applications—Manish Arora, Joonhyun Lee, Sangil Park, Samsung Electronics Co. Ltd.  Suwon, Korea Bandwidth limitation in lossy audio coding schemes significantly reduces the perceived quality. High frequency bandwidth extension schemes have been proposed but are difficult to implement in applications where they are needed most, in portable audio devices with severe complexity constraints. This paper describes a highquality blind bandwidth extension method proposing efficient initial audio bandwidth detection, bandbased nonlinear processing, and simple regenerated spectral envelop shaping enhancements. Objective and subjective measurements of the processed signal have yielded significant quality improvements with very low complexity requirements allowing easy implementation on a wide variety of portable player platforms. Convention Paper 6761 (Purchase now) P1724 Coherence Enhanced Minimum Statistics Spectral Subtraction in Bimicrophone Systems—Jonathan FillionDeneault, Roch Lefebvre, Sherbrooke University  Sherbrooke, Quebec, Canada A novel system for 2channel spectral subtraction is presented. The objective is to improve the intelligibility of speech in noisy environments by enhancing noise reduction of single microphone techniques as well as to greatly reduce the amount of musical noise that they introduce. The system consists of two different blocks. The first processing consists of a generalized spectral subtraction block on the primary channel using minimum statistics for noise estimation followed by a coherencebased postfilter for additional noise suppression. Subjective and objective testing of both simulated and realworld recordings show that listeners prefer the proposed system to other stateoftheart speech enhancement reduction techniques. Convention Paper 6762 (Purchase now) P1725 Sound Field Analysis Based on Generalized Prolate Spheroidal Wave—Mathieu Guillaume, Yves Grenier, Télécom Paris  Paris, France In this paper an array process to improve the quality of sound field analysis, which aims to extract spatial properties of a sound field, is described. In this domain, the notion of spatial aliasing inevitably occurs due to the finite number of microphones used in the array. It is linked to the Fourier transform of the discrete analysis window, which constitutes a mainlobe, fixing the resolution achievable by the spatial analysis, and also from sidelobes, which degrade the quality of spatial analysis by introducing artifacts not present in the original sound field. A method to design an optimal analysis window with respect to a particular wave vector is presented, aiming to realize the best localization possible in the wave vector domain. The efficiency of the approach is then demonstrated for several geometrical configurations of the microphone array, on the whole bandwidth of sound fields. Convention Paper 6763 (Purchase now) P1726 Optimization of Cocentered Rigid and Open Spherical Microphone Arrays—Abhaya Parthy, Craig Jin, André van Schaik, University of Sydney  Sydney, New South Wales, Australia We present a novel microphone array that consists of an open spherical array with a smaller rigid spherical array at its center. The distribution of microphones, which results in the array having the largest frequency range, for a given beamforming order, was obtained by analyzing microphone errors. For a fixed number of microphones, the results for several examples indicate that the maximum frequency range is obtained when the microphones are relatively evenly distributed between the open and rigid spheres. Convention Paper 6764 (Purchase now) P1727 Review and Discussion on Classical STFTBased Frequency Estimators—Michaël Betser, Patrice Collen, France Télécom R&D  CessonSévigné, France; Gaël Richard, Bertrand David, Telecom Paris  Paris, France Sinusoidal modeling is based on the decomposition of audio signals into a sum of sinusoidal components plus a noise residual part. It involves accurate sinusoid parameters estimation and, in particular, accurate frequency estimation. A broad category of methods uses the Fast Fourier Transform (FFT) as a starting point to compute frequency. All these methods present very similar forms of estimators, but the relations between them are not yet fully understood. This paper proposes to take a deeper look into these relations. The first goal of this paper is to present a clear review and description of the classical FFTbased frequency estimators. A new estimator similar to the phase vocoder is presented. The secod goal of this paper is to identify the common hypotheses and the common steps of the processes for this category of estimators. Last, experimental comparisons are given. Convention Paper 6765 (Purchase now) P1728 Accurate Phase Estimation for ChirpLike Signals—Michaël Betser, Patrice Collen, JeanBernard Rault, France Télécom R&D  CessonSévigné, France Sinusoidal modeling relies on the decomposition of a given signal (continuous or discrete) into a set of sinusoidal components plus a residual signal. The sinusoidal parameters, namely the amplitude, frequency, and phase, may vary upon time. Generally, the tracking of these parameters is performed via ShortTime Fourier Transform (STFT) analysis, providing in fine, for each sinusoidal component, estimates of the amplitude, frequency, and phase for a considered time slot. The duration of the analysis time slots is chosen in order to guarantee that the signal under analysis is stationary enough to deliver useful data. If this requirement is not met, in particular if the frequency varies in the analysis slot, the phase estimation is biased. This paper introduces a method to estimate and to correct this bias as a function of the analysis parameters (window type and size) and of the frequency slope. Convention Paper 6766 (Purchase now) P1729 Equalization of Audio Systems Using Kautz Filters with LogLike Frequency Resolution—Tuomas Paatero, Matti Karjalainen, Helsinki University of Technology  Espoo, Finland This paper presents a new digital filtering approach to the equalization of audio systems such as loudspeaker and room responses. The equalization scheme utilizes a particular infinite impulse response (IIR) filter configuration called Kautz filters, which can be seen as generalizations of finite impulse response (FIR) filters and their warped counterparts. The desired frequency resolution allocation, in this case h4e logarithmic one, is attained by a chosen set of fixed pole positions that define the particular Kautz filter. The frequency resolution mapping is characterized by the allpass part of the Kautz filter, which is interpreted as a formal generalization of the warping concept. The second step in the actual equalizer design consists of assigning the Kautz filter tapoutput weights, which is then, in turn, more or less a standard leastsquare configuration. The proposed method is demonstrated using measured loudspeaker and room responses. Convention Paper 6767 (Purchase now) 

(C) 2006, Audio Engineering Society, Inc. 
