In This Section
AES Store
- Learn From The Experts:

Frank Laico "Studio Recording"- Oral History Project Gallery
- Other AES Publications
Journal Forum
Virtual Localization by Blind Persons - July 2012
1 comment
Effect of Spatial Location and Presentation Rate on the Reaction to Auditory Displays - July 2012
1 comment
Watermark-Aided Pre-Echo Reduction in Low Bit-Rate Audio Coding - June 2012
1 comment
AES E-Library
Characterizing Formant Tracks in Viennese Diphthongs for Forensic Speaker Comparison
This study evaluates methods that capture time-dynamic properties of diphthongs produced by speakers of Viennese German for application in a forensic setting. Polynomials, discrete cosine transform and B-splines along with experimental features based on bent-cable regression models were used to characterise the first three formant tracks of two /aE/ diphthong segments. The resulting coefficients were in turn used as parameters in a speaker discrimination procedure based on likelihood ratios which were calculated using a multi-variate kernel density formula (MVKD). A comparison of the achieved performance based on cross-validation is presented in terms of equal error rate (EER) and the log-likelihood ratio cost metric as well as DET plots.
Click to purchase paper or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!






