Near-Field Evaluation of Reproducible Speech Sources
×
Cite This
Citation & Abstract
R. Gonzalez, T. Mckenzie, A. Politis, and T. Lokki, "Near-Field Evaluation of Reproducible Speech Sources," J. Audio Eng. Soc., vol. 70, no. 7/8, pp. 621-633, (2022 July.). doi:
R. Gonzalez, T. Mckenzie, A. Politis, and T. Lokki, "Near-Field Evaluation of Reproducible Speech Sources," J. Audio Eng. Soc., vol. 70 Issue 7/8 pp. 621-633, (2022 July.). doi:
Abstract: The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.
@article{gonzalez2022near-field,
author={gonzalez, raimundo and mckenzie, thomas and politis, archontis and lokki, tapio},
journal={journal of the audio engineering society},
title={near-field evaluation of reproducible speech sources},
year={2022},
volume={70},
number={7/8},
pages={621-633},
doi={},
month={july},}
@article{gonzalez2022near-field,
author={gonzalez, raimundo and mckenzie, thomas and politis, archontis and lokki, tapio},
journal={journal of the audio engineering society},
title={near-field evaluation of reproducible speech sources},
year={2022},
volume={70},
number={7/8},
pages={621-633},
doi={},
month={july},
abstract={the spatial speech reproduction capabilities of a kemar mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. the speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. the four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.},}
TY - report
TI - Near-Field Evaluation of Reproducible Speech Sources
SP - 621
EP - 633
AU - Gonzalez, Raimundo
AU - Mckenzie, Thomas
AU - Politis, Archontis
AU - Lokki, Tapio
PY - 2022
JO - Journal of the Audio Engineering Society
IS - 7/8
VO - 70
VL - 70
Y1 - July 2022
TY - report
TI - Near-Field Evaluation of Reproducible Speech Sources
SP - 621
EP - 633
AU - Gonzalez, Raimundo
AU - Mckenzie, Thomas
AU - Politis, Archontis
AU - Lokki, Tapio
PY - 2022
JO - Journal of the Audio Engineering Society
IS - 7/8
VO - 70
VL - 70
Y1 - July 2022
AB - The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.
The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.
Authors:
Gonzalez, Raimundo; Mckenzie, Thomas; Politis, Archontis; Lokki, Tapio
Affiliations:
Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Audio & Speech Processing Group, Tampere University of Technology, Tampere, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland.(See document for exact affiliation information.) JAES Volume 70 Issue 7/8 pp. 621-633; July 2022
Publication Date:
July 19, 2022Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=21828