Preferred Levels for Background Ducking to Produce Esthetically Pleasing Audio for TV with Clear Speech
×
Cite This
Citation & Abstract
M. Torcoli, A. Freke-Morin, J. Paulus, C. Simon, and B. Shirley, "Preferred Levels for Background Ducking to Produce Esthetically Pleasing Audio for TV with Clear Speech," J. Audio Eng. Soc., vol. 67, no. 12, pp. 1003-1011, (2019 December.). doi: https://doi.org/10.17743/jaes.2019.0052
M. Torcoli, A. Freke-Morin, J. Paulus, C. Simon, and B. Shirley, "Preferred Levels for Background Ducking to Produce Esthetically Pleasing Audio for TV with Clear Speech," J. Audio Eng. Soc., vol. 67 Issue 12 pp. 1003-1011, (2019 December.). doi: https://doi.org/10.17743/jaes.2019.0052
Abstract: In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambiance, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. This report first analyzes the common practices found in TV documentaries, and it then describes a listening test that investigated the preferences of 22 normal-hearing participants on the Loudness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Statistically significant difference was found between nonexpert and expert listeners. On average, nonexperts preferred LDs that were 4 LU higher than the ones preferred by experts. A statistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambiance (CoA). Based on the test results, the authors recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy.
@article{torcoli2019preferred,
author={torcoli, matteo and freke-morin, alex and paulus, jouni and simon, christian and shirley, ben},
journal={journal of the audio engineering society},
title={preferred levels for background ducking to produce esthetically pleasing audio for tv with clear speech},
year={2019},
volume={67},
number={12},
pages={1003-1011},
doi={https://doi.org/10.17743/jaes.2019.0052},
month={december},}
@article{torcoli2019preferred,
author={torcoli, matteo and freke-morin, alex and paulus, jouni and simon, christian and shirley, ben},
journal={journal of the audio engineering society},
title={preferred levels for background ducking to produce esthetically pleasing audio for tv with clear speech},
year={2019},
volume={67},
number={12},
pages={1003-1011},
doi={https://doi.org/10.17743/jaes.2019.0052},
month={december},
abstract={in audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambiance, set the mood, or convey semantic cues. technical details for recommended ducking practices are not currently documented in the literature. this report first analyzes the common practices found in tv documentaries, and it then describes a listening test that investigated the preferences of 22 normal-hearing participants on the loudness difference (ld) between commentary and background during ducking. highly personal preferences were observed, highlighting the importance of object-based personalization. statistically significant difference was found between nonexpert and expert listeners. on average, nonexperts preferred lds that were 4 lu higher than the ones preferred by experts. a statistically significant difference was also found between commentary over music (com) and commentary over ambiance (coa). based on the test results, the authors recommend at least 10 lu difference for com and at least 15 lu for coa. moreover, a computational method based on the binaural distortion-weighted glimpse proportion (bidwgp) was found to match the median preferred ld for each item with good accuracy.},}
TY - paper
TI - Preferred Levels for Background Ducking to Produce Esthetically Pleasing Audio for TV with Clear Speech
SP - 1003
EP - 1011
AU - Torcoli, Matteo
AU - Freke-Morin, Alex
AU - Paulus, Jouni
AU - Simon, Christian
AU - Shirley, Ben
PY - 2019
JO - Journal of the Audio Engineering Society
IS - 12
VO - 67
VL - 67
Y1 - December 2019
TY - paper
TI - Preferred Levels for Background Ducking to Produce Esthetically Pleasing Audio for TV with Clear Speech
SP - 1003
EP - 1011
AU - Torcoli, Matteo
AU - Freke-Morin, Alex
AU - Paulus, Jouni
AU - Simon, Christian
AU - Shirley, Ben
PY - 2019
JO - Journal of the Audio Engineering Society
IS - 12
VO - 67
VL - 67
Y1 - December 2019
AB - In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambiance, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. This report first analyzes the common practices found in TV documentaries, and it then describes a listening test that investigated the preferences of 22 normal-hearing participants on the Loudness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Statistically significant difference was found between nonexpert and expert listeners. On average, nonexperts preferred LDs that were 4 LU higher than the ones preferred by experts. A statistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambiance (CoA). Based on the test results, the authors recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy.
In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambiance, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. This report first analyzes the common practices found in TV documentaries, and it then describes a listening test that investigated the preferences of 22 normal-hearing participants on the Loudness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Statistically significant difference was found between nonexpert and expert listeners. On average, nonexperts preferred LDs that were 4 LU higher than the ones preferred by experts. A statistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambiance (CoA). Based on the test results, the authors recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy.
Open Access
Authors:
Torcoli, Matteo; Freke-Morin, Alex; Paulus, Jouni; Simon, Christian; Shirley, Ben
Affiliations:
Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany; Acoustics Research Centre, University of Salford, UK; International Audio Laboratories Erlangen, Germany, A joint institution of Universität Erlangen-Nürnburg and Fraunhofer IIS(See document for exact affiliation information.) JAES Volume 67 Issue 12 pp. 1003-1011; December 2019
Publication Date:
December 30, 2019Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=20711