Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II—Perceptual Model
×
Cite This
Citation & Abstract
JO. G.. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullmann, J. Pomy, and M. Keyhl, "Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II—Perceptual Model," J. Audio Eng. Soc., vol. 61, no. 6, pp. 385-402, (2013 June.). doi:
JO. G.. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullmann, J. Pomy, and M. Keyhl, "Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II—Perceptual Model," J. Audio Eng. Soc., vol. 61 Issue 6 pp. 385-402, (2013 June.). doi:
Abstract: In this and the companion paper Part I, the authors present the Perceptual Objective Listening Quality Assessment (POLQA), the third-generation speech quality measurement algorithm, standardized by the International Telecommunication Union in 2011 as Recommendation P.863. This paper describes the newly developed perceptual model of this standard, allowing to assess speech quality over a wide range of distortions, from “High Definition” super-wideband speech (HD Voice, audio bandwidth up to 14 kHz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 kHz), using sample rates between 48 and 8 kHz. POLQA is suited for distortions that are outside the scope of PESQ, such as linear frequency response distortions, super-wideband degradations, time stretching/compression as found in Voice-over-IP, certain types of codec distortions, reverberations, and the impact of playback volume. Part II outlines the core elements of the underlying perceptual model and presents the final results.
@article{beerends2013perceptual,
author={beerends, john g. and schmidmer, christian and berger, jens and obermann, matthias and ullmann, raphael and pomy, joachim and keyhl, michael},
journal={journal of the audio engineering society},
title={perceptual objective listening quality assessment (polqa), the third generation itu-t standard for end-to-end speech quality measurement part ii—perceptual model},
year={2013},
volume={61},
number={6},
pages={385-402},
doi={},
month={june},}
@article{beerends2013perceptual,
author={beerends, john g. and schmidmer, christian and berger, jens and obermann, matthias and ullmann, raphael and pomy, joachim and keyhl, michael},
journal={journal of the audio engineering society},
title={perceptual objective listening quality assessment (polqa), the third generation itu-t standard for end-to-end speech quality measurement part ii—perceptual model},
year={2013},
volume={61},
number={6},
pages={385-402},
doi={},
month={june},
abstract={in this and the companion paper part i, the authors present the perceptual objective listening quality assessment (polqa), the third-generation speech quality measurement algorithm, standardized by the international telecommunication union in 2011 as recommendation p.863. this paper describes the newly developed perceptual model of this standard, allowing to assess speech quality over a wide range of distortions, from “high definition” super-wideband speech (hd voice, audio bandwidth up to 14 khz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 khz), using sample rates between 48 and 8 khz. polqa is suited for distortions that are outside the scope of pesq, such as linear frequency response distortions, super-wideband degradations, time stretching/compression as found in voice-over-ip, certain types of codec distortions, reverberations, and the impact of playback volume. part ii outlines the core elements of the underlying perceptual model and presents the final results.},}
TY - paper
TI - Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II—Perceptual Model
SP - 385
EP - 402
AU - Beerends, John G.
AU - Schmidmer, Christian
AU - Berger, Jens
AU - Obermann, Matthias
AU - Ullmann, Raphael
AU - Pomy, Joachim
AU - Keyhl, Michael
PY - 2013
JO - Journal of the Audio Engineering Society
IS - 6
VO - 61
VL - 61
Y1 - June 2013
TY - paper
TI - Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II—Perceptual Model
SP - 385
EP - 402
AU - Beerends, John G.
AU - Schmidmer, Christian
AU - Berger, Jens
AU - Obermann, Matthias
AU - Ullmann, Raphael
AU - Pomy, Joachim
AU - Keyhl, Michael
PY - 2013
JO - Journal of the Audio Engineering Society
IS - 6
VO - 61
VL - 61
Y1 - June 2013
AB - In this and the companion paper Part I, the authors present the Perceptual Objective Listening Quality Assessment (POLQA), the third-generation speech quality measurement algorithm, standardized by the International Telecommunication Union in 2011 as Recommendation P.863. This paper describes the newly developed perceptual model of this standard, allowing to assess speech quality over a wide range of distortions, from “High Definition” super-wideband speech (HD Voice, audio bandwidth up to 14 kHz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 kHz), using sample rates between 48 and 8 kHz. POLQA is suited for distortions that are outside the scope of PESQ, such as linear frequency response distortions, super-wideband degradations, time stretching/compression as found in Voice-over-IP, certain types of codec distortions, reverberations, and the impact of playback volume. Part II outlines the core elements of the underlying perceptual model and presents the final results.
In this and the companion paper Part I, the authors present the Perceptual Objective Listening Quality Assessment (POLQA), the third-generation speech quality measurement algorithm, standardized by the International Telecommunication Union in 2011 as Recommendation P.863. This paper describes the newly developed perceptual model of this standard, allowing to assess speech quality over a wide range of distortions, from “High Definition” super-wideband speech (HD Voice, audio bandwidth up to 14 kHz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 kHz), using sample rates between 48 and 8 kHz. POLQA is suited for distortions that are outside the scope of PESQ, such as linear frequency response distortions, super-wideband degradations, time stretching/compression as found in Voice-over-IP, certain types of codec distortions, reverberations, and the impact of playback volume. Part II outlines the core elements of the underlying perceptual model and presents the final results.
Open Access
Authors:
Beerends, John G.; Schmidmer, Christian; Berger, Jens; Obermann, Matthias; Ullmann, Raphael; Pomy, Joachim; Keyhl, Michael
Affiliations:
TNO, Delft, The Netherlands; OPTICOM GmbH, Erlangen, Germany; SwissQual AG, Zuchwil, Switzerland(See document for exact affiliation information.) JAES Volume 61 Issue 6 pp. 385-402; June 2013
Publication Date:
July 8, 2013Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=16830