Speech Separation with Microphone Arrays Using the Mean Shift Algorithm
×
Cite This
Citation & Abstract
D. Ayllón, R. Gil-Pita, and M. Rosa-Zurera, "Speech Separation with Microphone Arrays Using the Mean Shift Algorithm," Paper 8799, (2012 October.). doi:
D. Ayllón, R. Gil-Pita, and M. Rosa-Zurera, "Speech Separation with Microphone Arrays Using the Mean Shift Algorithm," Paper 8799, (2012 October.). doi:
Abstract: Microphone arrays provide spatial resolution that is useful for speech source separation due to the fact that sources located in different positions cause different time and level differences in the elements of the array. This feature can be combined with time-frequency masking in order to separate speech mixtures by means of clustering techniques, such as the so-called DUET algorithm, which uses only two microphones. However, there are applications where larger arrays are available, and the separation can be performed using all these microphones. A speech separation algorithm based on mean shift clustering technique has been recently proposed using only two microphones. In this work the aforementioned algorithm is generalized for arrays of any number of microphones, testing its performance with echoic speech mixtures. The results obtained show that the generalized mean shift algorithm notably outperforms the results obtained by the original DUET algorithm.
@article{ayllón2012speech,
author={ayllón, david and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={speech separation with microphone arrays using the mean shift algorithm},
year={2012},
volume={},
number={},
pages={},
doi={},
month={october},}
@article{ayllón2012speech,
author={ayllón, david and gil-pita, roberto and rosa-zurera, manuel},
journal={journal of the audio engineering society},
title={speech separation with microphone arrays using the mean shift algorithm},
year={2012},
volume={},
number={},
pages={},
doi={},
month={october},
abstract={microphone arrays provide spatial resolution that is useful for speech source separation due to the fact that sources located in different positions cause different time and level differences in the elements of the array. this feature can be combined with time-frequency masking in order to separate speech mixtures by means of clustering techniques, such as the so-called duet algorithm, which uses only two microphones. however, there are applications where larger arrays are available, and the separation can be performed using all these microphones. a speech separation algorithm based on mean shift clustering technique has been recently proposed using only two microphones. in this work the aforementioned algorithm is generalized for arrays of any number of microphones, testing its performance with echoic speech mixtures. the results obtained show that the generalized mean shift algorithm notably outperforms the results obtained by the original duet algorithm.},}
TY - paper
TI - Speech Separation with Microphone Arrays Using the Mean Shift Algorithm
SP -
EP -
AU - Ayllón, David
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2012
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - October 2012
TY - paper
TI - Speech Separation with Microphone Arrays Using the Mean Shift Algorithm
SP -
EP -
AU - Ayllón, David
AU - Gil-Pita, Roberto
AU - Rosa-Zurera, Manuel
PY - 2012
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - October 2012
AB - Microphone arrays provide spatial resolution that is useful for speech source separation due to the fact that sources located in different positions cause different time and level differences in the elements of the array. This feature can be combined with time-frequency masking in order to separate speech mixtures by means of clustering techniques, such as the so-called DUET algorithm, which uses only two microphones. However, there are applications where larger arrays are available, and the separation can be performed using all these microphones. A speech separation algorithm based on mean shift clustering technique has been recently proposed using only two microphones. In this work the aforementioned algorithm is generalized for arrays of any number of microphones, testing its performance with echoic speech mixtures. The results obtained show that the generalized mean shift algorithm notably outperforms the results obtained by the original DUET algorithm.
Microphone arrays provide spatial resolution that is useful for speech source separation due to the fact that sources located in different positions cause different time and level differences in the elements of the array. This feature can be combined with time-frequency masking in order to separate speech mixtures by means of clustering techniques, such as the so-called DUET algorithm, which uses only two microphones. However, there are applications where larger arrays are available, and the separation can be performed using all these microphones. A speech separation algorithm based on mean shift clustering technique has been recently proposed using only two microphones. In this work the aforementioned algorithm is generalized for arrays of any number of microphones, testing its performance with echoic speech mixtures. The results obtained show that the generalized mean shift algorithm notably outperforms the results obtained by the original DUET algorithm.
Authors:
Ayllón, David; Gil-Pita, Roberto; Rosa-Zurera, Manuel
Affiliation:
University of Alcala, Alcalá de Henares, Spain
AES Convention:
133 (October 2012)
Paper Number:
8799
Publication Date:
October 25, 2012Import into BibTeX
Subject:
Analysis and Synthesis of Sound
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=16541