A careful evaluation of listening tests designed to measure audio quality shows that they are vulnerable to systematic errors, which include biases due to affective judgments, response mapping bias, and interface bias. As a result of factors such as personal preferences, the appearance of the equipment, and the listeners' expectations or mood, errors can range up to 40% with respect to the total range of the scale. As a general conclusion, test results should be considered relative, rather than absolute. Scales in previous studies, which have been assumed to be linear, may exhibit departure from linearity. The visual appearance of the user interface may lead to severe quantization of the distribution of scores. Recommendations are offered to improve audio quality tests.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.