AES Rome 2013
Poster Session P11
P11 - Perception and Education
Sunday, May 5, 15:30 — 17:00 (Foyer)
P11-1 Improve the Listening Ability Using E-Learning Methods—Bartlomiej Kruk, Wroclaw University of Technology - Wroclaw, Poland; Bartosz Zawieja, Wroclaw University of Technology - Wroclaw, Poland
The main aim of this paper is to show the possibility of listening training using the methods of e-learning combined with the classical method of teaching. The described technique uses all available electronic media as well as traditional teaching methods. Theory and practice examples are discussed. However, e-learning allows the students to work independently and provide tests at a convenient place and time. The e-learning exercises have been designed to develop skills including memorization, timbre description, and improving hearing sensitivity for changes in sound.
Convention Paper 8870 (Purchase now)
P11-2 Optimizing Teaching Room Acoustics: Investigating the Exclusive Use of a Distributed Electroacoustic Installation to Improve the Speech Intelligibility—Panagiotis Hatziantoniou, University of Patras - Patras, Greece; Nicolas Tatlas, Technological Educational Institute of Piraeus - Athens, Greece; Stelios M. Potirakis, Technological Educational Institute of Piraeus - Athens, Greece
The possibility to improve speech intelligibility in classrooms of inadequate acoustic design, exclusively by using an electroacoustic installation is investigated in this paper. Measurement results derived from an overall six-loudspeaker arrangement response in different locations inside a test classroom are compared to those derived from a one-loudspeaker electroacoustic response in the speaker’s location. Preliminary results indicate that well-established parameters such as C-50 and D-50 automatically calculated, show no significant improvement. Extended investigation of the responses as well as other criteria such as direct to reverb ratios (DRR) shown that there is noteworthy enhancement in addition to the subjective acoustic perception. Moreover, DRR is shown to further improve from the employment of a room correction technique based on smoothed responses.
Convention Paper 8871 (Purchase now)
P11-3 Software Techniques for Good Practice in Audio and Music Research—Luis Figueira, Queen Mary University of London - London, UK; Chris Cannam, Queen Mary University of London - London, UK; Mark Plumbley, Queen Mary University of London - London, UK
In this paper we discuss how software development can be improved in the audio and music research community by implementing tighter and more effective development feedback loops. We suggest first that researchers in an academic environment can benefit from the straightforward application of peer code review, even for ad-hoc research software; and second, that researchers should adopt automated software unit testing from the start of research projects. We discuss and illustrate how to adopt both code reviews and unit testing in a research environment. Finally, we observe that the use of a software version control system provides support for the foundations of both code reviews and automated unit tests. We therefore also propose that researchers should use version control with all their projects from the earliest stage.
Convention Paper 8872 (Purchase now)
P11-4 A Practical Step-by-Step Guide to the Time-Varying Loudness Model of Moore, Glasberg, and Baer (1997; 2002)—Andrew J. R. Simpson, Queen Mary University of London - London, UK; Michael J. Terrell, Queen Mary University of London - London, UK; Joshua D. Reiss, Queen Mary University of London - London, UK
In this tutorial article we provide a condensed, practical step-by-step guide to the excitation pattern loudness model of Moore, Glasberg, and Baer [J. Audio Eng. Soc., vol. 45, 224–240 (1997 Apr.); J. Audio Eng. Soc., vol. 50, 331–342 (2002 May)]. The various components of this model have been separately described in the well-known publications of Patterson et al. [J. Acoust. Soc. Am., vol. 72, 1788–1803 (1982)], Moore [Hearing, 161-205 (Academic Press 1995)], Moore et al. (1997), and Glasberg and Moore (2002). This paper provides a consolidated and concise introduction to the complete model for those who find the disparate and complex references intimidating and who wish to understand the function of each of the component parts. Furthermore, we provide a consolidated notation and integral forms. This introduction may be useful to the loudness theory beginner and to those who wish to adapt and apply the model for novel, practical purposes.
Convention Paper 8873 (Purchase now)
P11-5 Subjective Evaluation of Sound Quality of Musical Recordings Transmitted via the DAB+ System—Maurycy Kin, Wroclaw University of Technology - Wroclaw, Poland
The results of research on the sound quality of various kinds of music transmitted via Digital Audio Broadcasting using Absolute Category Rating and Comparison Category Rating methods of scaling are presented. The results showed that bit-rate values influence significantly the results. A Spectral Band Replication processing of signals increases the sound quality higher for low bit-rates than for higher values, dependently on a kind of music. The spatial attributes of sound, as a perspective, spaciousness, localization stability and an accuracy of phantom source, also are dependent on the bit-rate, but these relations are different. It was also found that a method of evaluation gives different results, and a CCR method is more accurate for sound assessment for higher bit-rates.
Convention Paper 8874 (Purchase now)
P11-6 Music and Emotions: A Comparison of Measurement Methods—Judith Liebetrau, Fraunhofer IDMT - Ilmenau, Germany; Ilmenau University of Technology - Ilmenau, Germany; Sebastian Schneider, Ilmenau University of Technology - Ilmenau, Germany
Music emotion recognition (MER) as a part of music information retrieval (MIR), examines the question which parts of music evoke what emotions and how can they be automatically classified. Classification systems need to be trained in terms of feature selection and prediction. Due to the subjectivity of emotions, the generation of appropriate ground truth data poses challenges for MER. This paper describes obstacles of defining and measuring emotions evoked by music. Two methods, in principle able to overcome problems in measuring affective states induced by music, are outlined and their results are compared. Although the results of both methods are in line with psychological theories of emotions, the question remains how good the perceived emotions are captured by either method and if these methods are sufficient for ground truth generation.
Convention Paper 8875 (Purchase now)
P11-7 Multidimensional Scaling Analysis Applied to Music Mood Recognition—Magdalena Plewa, Gdansk University of Technology - Gdansk, Poland; Bozena Kostek, Gdansk University of Technology - Gdansk, Poland
The paper presents two experiments aimed at categorizing mood associated with music. Two parts of a listening test were designed and carried out with a group of students, most of whom where users of online social music services. The initial experiment was designed to evaluate the extent to which a given label describes the mood of the particular music excerpt. The second subjective test was conducted to collect the similarity data for the MDS (Multidimensional Scaling) analysis. Results were subject of various MDS and correlation analysis. Obtained MDS representation is relevant and remains coherent with acclaimed 2-dimensional Thayer’s model as well as with evaluation using six mood labels.
Convention Paper 8876 (Purchase now)
P11-8 Artificial Stereo Extension Based on Gaussian Mixture Model—Nam In Park, Gwangju Institute of Science and Technology (GIST) - Gwangju, Korea; Kwang Myung Jeon, Gwangju Institute of Science and Technology (GIST) - Gwangju, Korea; Chan Jun Chun, Gwangju Institute of Science and Technology (GIST) - Gwangju, Korea; Hong Kook Kim, Gwangju Institute of Science and Tech (GIST) - Gwangju, Korea
In this paper an artificial stereo extension method is proposed to provide the stereophonic sound. The proposed method employs a minimum mean squared error (MMSE) estimator based on a Gaussian mixture model (GMM) to produce stereo signals from a mono signal. The performance of the proposed stereo extension method is evaluated using a multiple stimuli with a hidden reference and anchor (MUSHRA) test and compared with that of the parametric stereo method. It is shown from the test that the mean opinion score of the signals extended by the proposed stereo extension method is around 5% higher than that of the conventional stereo extension method based on inter-channel coherence (ICC).
Convention Paper 8877 (Purchase now)