Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits

Schuller, Björn; Wöllmer, Martin; Eyben, Florian; Rigoll, Gerhard; Arsic, Dejan

AES E-Library

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits

A number of paralinguistic problems are often dealt with in isolation, such as emotion, health state or personality. However, there are also good examples of mutual benefit, mostly incorporating speaker gender knowledge. In this paper we deal with the question how further paralinguistic information, such as speaker age, height, or race can provide beneficial information when their ground truth knowledge is provided within single-task speaker classification. Tests with openSMILE's 1.5 k Paralinguistic Challenge Feature set on the TIMIT corpus of 630 speakers reveal significant boost in accuracy or cross-correlation|depending on the representation form of the problem at hand.

Authors: Schuller, Björn; Wöllmer, Martin; Eyben, Florian; Rigoll, Gerhard; Arsic, Dejan
Affiliations: Müller-BBM Vibroakustiksysteme, Planegg, Germany; Technische Universität München, Munich, Germany(See document for exact affiliation information.)
AES Conference: 42nd International Conference: Semantic Audio (July 2011)
Paper Number: 2-1
Publication Date: July 22, 2011 Import into BibTeX
Subject: Speech Processing and Analysis
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=15947

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 42ndPapers) /conf/42/aes42-000026.pdf

Start a discussion about this paper!

AES E-Library

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits

ABOUT AES

Contact Us