Forensic Automatic Speaker Recognition with Degraded and Enhanced Speech

Künzel, Hermann J.; Alexander, Paul

AES E-Library

Forensic Automatic Speaker Recognition with Degraded and Enhanced Speech

Various types of noise and other forms of degradation in the acoustic signal are typical of speech recordings used in forensic speaker recognition. The results of this study suggest that certain speech enhancement algorithms can be a useful tool for preprocessing speech samples before attempting automated recognition. This is particularly true for additive noise such as instrumental music and noise inside of a moving car. Comparing equal-error rates of identification experiments for ten male speakers based on the original, degraded, and enhanced voice signals, the performance of the speaker recognition system was most affected by pop music in both single-channel and 2-channel recordings. In contrast, road traffic and restaurant noise do not markedly degrade recognition performance.

Authors: Künzel, Hermann J.; Alexander, Paul
Affiliations: Dept. of Phonetics, University of Marburg, Marburg, Germany; Cedar Audio Ltd., Cambridge, UK(See document for exact affiliation information.)
JAES Volume 62 Issue 4 pp. 244-253; April 2014
Publication Date: April 16, 2014 Import into BibTeX
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17136

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD JAES62) /jaes62/4/pg244.pdf

DOI: https://doi.org/10.17743/jaes.2014.0014

Start a discussion about this paper!

AES E-Library

Forensic Automatic Speaker Recognition with Degraded and Enhanced Speech

ABOUT AES

Contact Us