Time Domain Extrapolative Packet Loss Concealment for MDCT Based Voice Codec
×
Cite This
Citation & Abstract
S. Huang, and X. Sun, "Time Domain Extrapolative Packet Loss Concealment for MDCT Based Voice Codec," Paper 9263, (2015 May.). doi:
S. Huang, and X. Sun, "Time Domain Extrapolative Packet Loss Concealment for MDCT Based Voice Codec," Paper 9263, (2015 May.). doi:
Abstract: A novel low latency packet loss concealment technique for transform-based codecs is proposed. The algorithm combines signals from Inverse Modulated Discrete Cosine Transform (IMDCT) domain and the previous reconstructed signal from time domain with aligned phase, with which a pitch-synchronized concealment is performed. This minimizes aliasing artifacts that occur in MDCT domain concealment for voiced speech signals. For unvoiced speech, speech-shaped comfort noise is inserted. When there is a burst loss, a position-dependent concealment process is performed for different stages of packet losses. Subjective listening tests using both naïve and expert listeners suggest that the proposed algorithm generates fewer artifacts and offers significantly better performance against legacy packet repetition based approaches.
@article{huang2015time,
author={huang, shen and sun, xuejing},
journal={journal of the audio engineering society},
title={time domain extrapolative packet loss concealment for mdct based voice codec},
year={2015},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{huang2015time,
author={huang, shen and sun, xuejing},
journal={journal of the audio engineering society},
title={time domain extrapolative packet loss concealment for mdct based voice codec},
year={2015},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={a novel low latency packet loss concealment technique for transform-based codecs is proposed. the algorithm combines signals from inverse modulated discrete cosine transform (imdct) domain and the previous reconstructed signal from time domain with aligned phase, with which a pitch-synchronized concealment is performed. this minimizes aliasing artifacts that occur in mdct domain concealment for voiced speech signals. for unvoiced speech, speech-shaped comfort noise is inserted. when there is a burst loss, a position-dependent concealment process is performed for different stages of packet losses. subjective listening tests using both naïve and expert listeners suggest that the proposed algorithm generates fewer artifacts and offers significantly better performance against legacy packet repetition based approaches.},}
TY - paper
TI - Time Domain Extrapolative Packet Loss Concealment for MDCT Based Voice Codec
SP -
EP -
AU - Huang, Shen
AU - Sun, Xuejing
PY - 2015
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2015
TY - paper
TI - Time Domain Extrapolative Packet Loss Concealment for MDCT Based Voice Codec
SP -
EP -
AU - Huang, Shen
AU - Sun, Xuejing
PY - 2015
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2015
AB - A novel low latency packet loss concealment technique for transform-based codecs is proposed. The algorithm combines signals from Inverse Modulated Discrete Cosine Transform (IMDCT) domain and the previous reconstructed signal from time domain with aligned phase, with which a pitch-synchronized concealment is performed. This minimizes aliasing artifacts that occur in MDCT domain concealment for voiced speech signals. For unvoiced speech, speech-shaped comfort noise is inserted. When there is a burst loss, a position-dependent concealment process is performed for different stages of packet losses. Subjective listening tests using both naïve and expert listeners suggest that the proposed algorithm generates fewer artifacts and offers significantly better performance against legacy packet repetition based approaches.
A novel low latency packet loss concealment technique for transform-based codecs is proposed. The algorithm combines signals from Inverse Modulated Discrete Cosine Transform (IMDCT) domain and the previous reconstructed signal from time domain with aligned phase, with which a pitch-synchronized concealment is performed. This minimizes aliasing artifacts that occur in MDCT domain concealment for voiced speech signals. For unvoiced speech, speech-shaped comfort noise is inserted. When there is a burst loss, a position-dependent concealment process is performed for different stages of packet losses. Subjective listening tests using both naïve and expert listeners suggest that the proposed algorithm generates fewer artifacts and offers significantly better performance against legacy packet repetition based approaches.
Authors:
Huang, Shen; Sun, Xuejing
Affiliation:
Dolby Laboratories, Beijing, China
AES Convention:
138 (May 2015)
Paper Number:
9263
Publication Date:
May 6, 2015Import into BibTeX
Subject:
Audio Signal Processing
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=17687