V. Välimäki, and S. Bilbao, "Giant FFTs for Sample-Rate Conversion," J. Audio Eng. Soc., vol. 71, no. 3, pp. 88-99, (2023 March.). doi: https://doi.org/10.17743/jaes.2022.0061
V. Välimäki, and S. Bilbao, "Giant FFTs for Sample-Rate Conversion," J. Audio Eng. Soc., vol. 71 Issue 3 pp. 88-99, (2023 March.). doi: https://doi.org/10.17743/jaes.2022.0061
Abstract: The audio industry uses several sample rates interchangeably, and high-quality sample-rate conversion is crucial. This paper describes a frequency-domain sample-rate conversion method that employs a single large ("giant") fast Fourier transform (FFT). Large FFTs, corresponding to the duration of a track or full-length album, are now extremely fast, with execution times on the order of a few seconds on standard commercially available hardware. The method first transforms the signal into the frequency domain, possibly using zero-padding. The key part of the technique modifies the length of the spectral buffer to change the ratio of the audio content to the Nyquist limit. For up-sampling, an appropriate number of zeros is inserted between the positive and negative frequencies. In down-sampling, the spectrum is truncated. Finally, the inverse FFT synthesizes a time-domain signal at the new sample rate. The proposed method does not result in surviving folded spectral images, which occur in some instances with timedomain methods. However, it causes ringing at the Nyquist limit, which can be suppressed by tapering the spectrum and by low-pass filtering. The proposed sample-rate conversion method is targeted to offline audio applications in which sound files need to be converted between sample rates at high quality.
@article{välimäki2023giant,
author={välimäki, vesa and bilbao, stefan},
journal={journal of the audio engineering society},
title={giant ffts for sample-rate conversion},
year={2023},
volume={71},
number={3},
pages={88-99},
doi={https://doi.org/10.17743/jaes.2022.0061},
month={march},}
@article{välimäki2023giant,
author={välimäki, vesa and bilbao, stefan},
journal={journal of the audio engineering society},
title={giant ffts for sample-rate conversion},
year={2023},
volume={71},
number={3},
pages={88-99},
doi={https://doi.org/10.17743/jaes.2022.0061},
month={march},
abstract={the audio industry uses several sample rates interchangeably, and high-quality sample-rate conversion is crucial. this paper describes a frequency-domain sample-rate conversion method that employs a single large ("giant") fast fourier transform (fft). large ffts, corresponding to the duration of a track or full-length album, are now extremely fast, with execution times on the order of a few seconds on standard commercially available hardware. the method first transforms the signal into the frequency domain, possibly using zero-padding. the key part of the technique modifies the length of the spectral buffer to change the ratio of the audio content to the nyquist limit. for up-sampling, an appropriate number of zeros is inserted between the positive and negative frequencies. in down-sampling, the spectrum is truncated. finally, the inverse fft synthesizes a time-domain signal at the new sample rate. the proposed method does not result in surviving folded spectral images, which occur in some instances with timedomain methods. however, it causes ringing at the nyquist limit, which can be suppressed by tapering the spectrum and by low-pass filtering. the proposed sample-rate conversion method is targeted to offline audio applications in which sound files need to be converted between sample rates at high quality.},}
TY - paper
TI - Giant FFTs for Sample-Rate Conversion
SP - 88
EP - 99
AU - Välimäki, Vesa
AU - Bilbao, Stefan
PY - 2023
JO - Journal of the Audio Engineering Society
IS - 3
VO - 71
VL - 71
Y1 - March 2023
TY - paper
TI - Giant FFTs for Sample-Rate Conversion
SP - 88
EP - 99
AU - Välimäki, Vesa
AU - Bilbao, Stefan
PY - 2023
JO - Journal of the Audio Engineering Society
IS - 3
VO - 71
VL - 71
Y1 - March 2023
AB - The audio industry uses several sample rates interchangeably, and high-quality sample-rate conversion is crucial. This paper describes a frequency-domain sample-rate conversion method that employs a single large ("giant") fast Fourier transform (FFT). Large FFTs, corresponding to the duration of a track or full-length album, are now extremely fast, with execution times on the order of a few seconds on standard commercially available hardware. The method first transforms the signal into the frequency domain, possibly using zero-padding. The key part of the technique modifies the length of the spectral buffer to change the ratio of the audio content to the Nyquist limit. For up-sampling, an appropriate number of zeros is inserted between the positive and negative frequencies. In down-sampling, the spectrum is truncated. Finally, the inverse FFT synthesizes a time-domain signal at the new sample rate. The proposed method does not result in surviving folded spectral images, which occur in some instances with timedomain methods. However, it causes ringing at the Nyquist limit, which can be suppressed by tapering the spectrum and by low-pass filtering. The proposed sample-rate conversion method is targeted to offline audio applications in which sound files need to be converted between sample rates at high quality.
The audio industry uses several sample rates interchangeably, and high-quality sample-rate conversion is crucial. This paper describes a frequency-domain sample-rate conversion method that employs a single large ("giant") fast Fourier transform (FFT). Large FFTs, corresponding to the duration of a track or full-length album, are now extremely fast, with execution times on the order of a few seconds on standard commercially available hardware. The method first transforms the signal into the frequency domain, possibly using zero-padding. The key part of the technique modifies the length of the spectral buffer to change the ratio of the audio content to the Nyquist limit. For up-sampling, an appropriate number of zeros is inserted between the positive and negative frequencies. In down-sampling, the spectrum is truncated. Finally, the inverse FFT synthesizes a time-domain signal at the new sample rate. The proposed method does not result in surviving folded spectral images, which occur in some instances with timedomain methods. However, it causes ringing at the Nyquist limit, which can be suppressed by tapering the spectrum and by low-pass filtering. The proposed sample-rate conversion method is targeted to offline audio applications in which sound files need to be converted between sample rates at high quality.
Open Access
Authors:
Välimäki, Vesa; Bilbao, Stefan
Affiliations:
Acoustics Laboratory, Department of Information and Communications Engineering, Aalto University, Espoo, Finland; Acoustics and Audio Group, University of Edinburgh, Edinburgh, United Kingdom(See document for exact affiliation information.) JAES Volume 71 Issue 3 pp. 88-99; March 2023
Publication Date:
March 7, 2023Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=22033