An Objective Audio Quality Measure Based on Power and Envelope Power Cues
×
Cite This
Citation & Abstract
T. Biberger, J. Fleßner, R. Huber, and ST. D.. Ewert, "An Objective Audio Quality Measure Based on Power and Envelope Power Cues," J. Audio Eng. Soc., vol. 66, no. 7/8, pp. 578-593, (2018 July.). doi: https://doi.org/10.17743/jaes.2018.0031
T. Biberger, J. Fleßner, R. Huber, and ST. D.. Ewert, "An Objective Audio Quality Measure Based on Power and Envelope Power Cues," J. Audio Eng. Soc., vol. 66 Issue 7/8 pp. 578-593, (2018 July.). doi: https://doi.org/10.17743/jaes.2018.0031
Abstract: The generalized power spectrum model (GPSM), which has been shown to account for a large number of psychoacoustic and speech intelligibility (SI) experiments, was extended to assess audio quality. Like the GPSM, the suggested audio quality model, GPSMq, combines features from the power spectrum model (PSM) and envelope power-spectrum model (EPSM). GPSMq utilizes signal-to-noise ratios (SNRs) in the power and envelope power domains to model the addition or removal of energy by the signal processing under test. Four audio quality databases that introduce linear and nonlinear distortions to music and speech signals were assessed to cover a large variety of distortions cases. GPSMq provided better overall prediction performance than other state-of-the-art auditory-model-based objective quality measures. The results demonstrate that the power and envelope power SNR metric is appropriate for predicting audio quality for a variety of signal distortions in addition to psychoacoustics and SI. This supports the notion that the auditory system extracts a universal set of auditory features to be analyzed in a task-dependent decision stage.
@article{biberger2018an,
author={biberger, thomas and fleßner, jan-hendrik and huber, rainer and ewert, stephan d.},
journal={journal of the audio engineering society},
title={an objective audio quality measure based on power and envelope power cues},
year={2018},
volume={66},
number={7/8},
pages={578-593},
doi={https://doi.org/10.17743/jaes.2018.0031},
month={july},}
@article{biberger2018an,
author={biberger, thomas and fleßner, jan-hendrik and huber, rainer and ewert, stephan d.},
journal={journal of the audio engineering society},
title={an objective audio quality measure based on power and envelope power cues},
year={2018},
volume={66},
number={7/8},
pages={578-593},
doi={https://doi.org/10.17743/jaes.2018.0031},
month={july},
abstract={the generalized power spectrum model (gpsm), which has been shown to account for a large number of psychoacoustic and speech intelligibility (si) experiments, was extended to assess audio quality. like the gpsm, the suggested audio quality model, gpsmq, combines features from the power spectrum model (psm) and envelope power-spectrum model (epsm). gpsmq utilizes signal-to-noise ratios (snrs) in the power and envelope power domains to model the addition or removal of energy by the signal processing under test. four audio quality databases that introduce linear and nonlinear distortions to music and speech signals were assessed to cover a large variety of distortions cases. gpsmq provided better overall prediction performance than other state-of-the-art auditory-model-based objective quality measures. the results demonstrate that the power and envelope power snr metric is appropriate for predicting audio quality for a variety of signal distortions in addition to psychoacoustics and si. this supports the notion that the auditory system extracts a universal set of auditory features to be analyzed in a task-dependent decision stage.},}
TY - paper
TI - An Objective Audio Quality Measure Based on Power and Envelope Power Cues
SP - 578
EP - 593
AU - Biberger, Thomas
AU - Fleßner, Jan-Hendrik
AU - Huber, Rainer
AU - Ewert, Stephan D.
PY - 2018
JO - Journal of the Audio Engineering Society
IS - 7/8
VO - 66
VL - 66
Y1 - July 2018
TY - paper
TI - An Objective Audio Quality Measure Based on Power and Envelope Power Cues
SP - 578
EP - 593
AU - Biberger, Thomas
AU - Fleßner, Jan-Hendrik
AU - Huber, Rainer
AU - Ewert, Stephan D.
PY - 2018
JO - Journal of the Audio Engineering Society
IS - 7/8
VO - 66
VL - 66
Y1 - July 2018
AB - The generalized power spectrum model (GPSM), which has been shown to account for a large number of psychoacoustic and speech intelligibility (SI) experiments, was extended to assess audio quality. Like the GPSM, the suggested audio quality model, GPSMq, combines features from the power spectrum model (PSM) and envelope power-spectrum model (EPSM). GPSMq utilizes signal-to-noise ratios (SNRs) in the power and envelope power domains to model the addition or removal of energy by the signal processing under test. Four audio quality databases that introduce linear and nonlinear distortions to music and speech signals were assessed to cover a large variety of distortions cases. GPSMq provided better overall prediction performance than other state-of-the-art auditory-model-based objective quality measures. The results demonstrate that the power and envelope power SNR metric is appropriate for predicting audio quality for a variety of signal distortions in addition to psychoacoustics and SI. This supports the notion that the auditory system extracts a universal set of auditory features to be analyzed in a task-dependent decision stage.
The generalized power spectrum model (GPSM), which has been shown to account for a large number of psychoacoustic and speech intelligibility (SI) experiments, was extended to assess audio quality. Like the GPSM, the suggested audio quality model, GPSMq, combines features from the power spectrum model (PSM) and envelope power-spectrum model (EPSM). GPSMq utilizes signal-to-noise ratios (SNRs) in the power and envelope power domains to model the addition or removal of energy by the signal processing under test. Four audio quality databases that introduce linear and nonlinear distortions to music and speech signals were assessed to cover a large variety of distortions cases. GPSMq provided better overall prediction performance than other state-of-the-art auditory-model-based objective quality measures. The results demonstrate that the power and envelope power SNR metric is appropriate for predicting audio quality for a variety of signal distortions in addition to psychoacoustics and SI. This supports the notion that the auditory system extracts a universal set of auditory features to be analyzed in a task-dependent decision stage.
Authors:
Biberger, Thomas; Fleßner, Jan-Hendrik; Huber, Rainer; Ewert, Stephan D.
Affiliations:
Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany; HörTech gGmbH and Cluster of Excellence Hearing4All, Oldenburg, Germany(See document for exact affiliation information.) JAES Volume 66 Issue 7/8 pp. 578-593; July 2018
Publication Date:
August 20, 2018Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=19707