Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

Marti, Amparo; Cobos, Máximo; Lopez, José J.

AES E-Library

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.

Authors: Marti, Amparo; Cobos, Máximo; Lopez, José J.
Affiliations: Universitat de València, Valencia, Spain; Universitat Politècnica de València, Valencia, Spain(See document for exact affiliation information.)
AES Convention: 132 (April 2012) Paper Number: 8635
Publication Date: April 26, 2012 Import into BibTeX
Subject: Analysis and Synthesis and Content Management
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=16273

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 132Papers) /conv/132/8635.pdf

Start a discussion about this paper!

AES E-Library

Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training

ABOUT AES

Contact Us