Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression
×
Cite This
Citation & Abstract
Q. Huang, X. Wu, and T. Qu, "Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression," Paper 9954, (2018 May.). doi:
Q. Huang, X. Wu, and T. Qu, "Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression," Paper 9954, (2018 May.). doi:
Abstract: The compression ratio of core-encoder can be improved significantly by reducing the bandwidth of the audio signal, resulting in the poor listening perception. This paper proposes a bandwidth extension method based on generative adversarial nets (GAN) for extending the bandwidth of an audio signal, to create a more natural sound. The method uses GAN as a generative model to fit the distribution of the MDCT coefficients of the audio signals in the high-frequency components. Through minimax two-player gaming, more natural high-frequency information can be estimated. On this basis, a codec system is built up. To evaluate the proposed bandwidth extension system the MUSHRA experiments were carried on and the results show that there is comparable performance with HE-AAC.
@article{huang2018bandwidth,
author={huang, qingbo and wu, xihong and qu, tianshu},
journal={journal of the audio engineering society},
title={bandwidth extension method based on generative adversarial nets for audio compression},
year={2018},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{huang2018bandwidth,
author={huang, qingbo and wu, xihong and qu, tianshu},
journal={journal of the audio engineering society},
title={bandwidth extension method based on generative adversarial nets for audio compression},
year={2018},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={the compression ratio of core-encoder can be improved significantly by reducing the bandwidth of the audio signal, resulting in the poor listening perception. this paper proposes a bandwidth extension method based on generative adversarial nets (gan) for extending the bandwidth of an audio signal, to create a more natural sound. the method uses gan as a generative model to fit the distribution of the mdct coefficients of the audio signals in the high-frequency components. through minimax two-player gaming, more natural high-frequency information can be estimated. on this basis, a codec system is built up. to evaluate the proposed bandwidth extension system the mushra experiments were carried on and the results show that there is comparable performance with he-aac.},}
TY - paper
TI - Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression
SP -
EP -
AU - Huang, Qingbo
AU - Wu, Xihong
AU - Qu, Tianshu
PY - 2018
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2018
TY - paper
TI - Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression
SP -
EP -
AU - Huang, Qingbo
AU - Wu, Xihong
AU - Qu, Tianshu
PY - 2018
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2018
AB - The compression ratio of core-encoder can be improved significantly by reducing the bandwidth of the audio signal, resulting in the poor listening perception. This paper proposes a bandwidth extension method based on generative adversarial nets (GAN) for extending the bandwidth of an audio signal, to create a more natural sound. The method uses GAN as a generative model to fit the distribution of the MDCT coefficients of the audio signals in the high-frequency components. Through minimax two-player gaming, more natural high-frequency information can be estimated. On this basis, a codec system is built up. To evaluate the proposed bandwidth extension system the MUSHRA experiments were carried on and the results show that there is comparable performance with HE-AAC.
The compression ratio of core-encoder can be improved significantly by reducing the bandwidth of the audio signal, resulting in the poor listening perception. This paper proposes a bandwidth extension method based on generative adversarial nets (GAN) for extending the bandwidth of an audio signal, to create a more natural sound. The method uses GAN as a generative model to fit the distribution of the MDCT coefficients of the audio signals in the high-frequency components. Through minimax two-player gaming, more natural high-frequency information can be estimated. On this basis, a codec system is built up. To evaluate the proposed bandwidth extension system the MUSHRA experiments were carried on and the results show that there is comparable performance with HE-AAC.
Authors:
Huang, Qingbo; Wu, Xihong; Qu, Tianshu
Affiliation:
Peking University, Beijing, China
AES Convention:
144 (May 2018)
Paper Number:
9954
Publication Date:
May 14, 2018Import into BibTeX
Subject:
Audio Coding, Analysis, and Synthesis
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=19471