Characterizing Formant Tracks in Viennese Diphthongs for Forensic Speaker Comparison

Enzinger, Ewald

AES E-Library

Characterizing Formant Tracks in Viennese Diphthongs for Forensic Speaker Comparison

This study evaluates methods that capture time-dynamic properties of diphthongs produced by speakers of Viennese German for application in a forensic setting. Polynomials, discrete cosine transform and B-splines along with experimental features based on bent-cable regression models were used to characterise the first three formant tracks of two /aE/ diphthong segments. The resulting coefficients were in turn used as parameters in a speaker discrimination procedure based on likelihood ratios which were calculated using a multi-variate kernel density formula (MVKD). A comparison of the achieved performance based on cross-validation is presented in terms of equal error rate (EER) and the log-likelihood ratio cost metric as well as DET plots.

Author: Enzinger, Ewald
Affiliation: Acoustics Research Institute, Austrian Academy of Sciences, Vienna, Austria
AES Conference: 39th International Conference: Audio Forensics: Practices and Challenges (June 2010)
Paper Number: 2-2
Publication Date: June 17, 2010 Import into BibTeX
Subject: Speech and Forensics - Voice Identification
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=15488

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 39thPapers) /39/aes39-000015.pdf

Start a discussion about this paper!

AES E-Library

Characterizing Formant Tracks in Viennese Diphthongs for Forensic Speaker Comparison

ABOUT AES

Contact Us