Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
×
Cite This
Citation & Abstract
P. Bratoszewski, M. Szykulski, and A. Czyzewski, "Examining Influence of Distance to Microphone on Accuracy of Speech Recognition," Engineering Brief 194, (2015 May.). doi:
P. Bratoszewski, M. Szykulski, and A. Czyzewski, "Examining Influence of Distance to Microphone on Accuracy of Speech Recognition," Engineering Brief 194, (2015 May.). doi:
Abstract: The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal the Mel- Frequency Cepstral Coefficients (MFCC) are used. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit) for the Automatic Speech Recognition (ASR) task. The dictionary of 184 words was employed and WER (Word Error Rate), correctness and accuracy measures were calculated in order to verify and to compare obtained results of speech recognition.
@article{bratoszewski2015examining,
author={bratoszewski, piotr and szykulski, marcin and czyzewski, andrzej},
journal={journal of the audio engineering society},
title={examining influence of distance to microphone on accuracy of speech recognition},
year={2015},
volume={},
number={},
pages={},
doi={},
month={may},}
@article{bratoszewski2015examining,
author={bratoszewski, piotr and szykulski, marcin and czyzewski, andrzej},
journal={journal of the audio engineering society},
title={examining influence of distance to microphone on accuracy of speech recognition},
year={2015},
volume={},
number={},
pages={},
doi={},
month={may},
abstract={the problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. a laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). for feature extraction from the voice signal the mel- frequency cepstral coefficients (mfcc) are used. the experiments are conducted employing the htk engine (hidden markov toolkit) for the automatic speech recognition (asr) task. the dictionary of 184 words was employed and wer (word error rate), correctness and accuracy measures were calculated in order to verify and to compare obtained results of speech recognition.},}
TY - paper
TI - Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
SP -
EP -
AU - Bratoszewski, Piotr
AU - Szykulski, Marcin
AU - Czyzewski, Andrzej
PY - 2015
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2015
TY - paper
TI - Examining Influence of Distance to Microphone on Accuracy of Speech Recognition
SP -
EP -
AU - Bratoszewski, Piotr
AU - Szykulski, Marcin
AU - Czyzewski, Andrzej
PY - 2015
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - May 2015
AB - The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal the Mel- Frequency Cepstral Coefficients (MFCC) are used. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit) for the Automatic Speech Recognition (ASR) task. The dictionary of 184 words was employed and WER (Word Error Rate), correctness and accuracy measures were calculated in order to verify and to compare obtained results of speech recognition.
The problem of controlling a machine by the distant-talking speaker without a necessity of handheld or body-worn equipment usage is considered. A laboratory setup is introduced for examination of performance of the developed automatic speech recognition system fed by direct and by distant speech acquired by microphones placed at three different distances from the speaker (0.5 m to 1.5 m). For feature extraction from the voice signal the Mel- Frequency Cepstral Coefficients (MFCC) are used. The experiments are conducted employing the HTK engine (Hidden Markov Toolkit) for the Automatic Speech Recognition (ASR) task. The dictionary of 184 words was employed and WER (Word Error Rate), correctness and accuracy measures were calculated in order to verify and to compare obtained results of speech recognition.
Authors:
Bratoszewski, Piotr; Szykulski, Marcin; Czyzewski, Andrzej
Affiliation:
Gdansk University of Technology, Gdansk, Poland
AES Convention:
138 (May 2015)eBrief:194
Publication Date:
May 5, 2015Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=17629
The Engineering Briefs at this Convention were
selected on the basis of a submitted synopsis,
ensuring that they are of interest to AES members,
and are not overly commercial. These briefs have
been reproduced from the authors' advance
manuscripts, without editing, corrections, or
consideration by the Review Board. The AES takes no
responsibility for their contents. Paper copies are
not available, but any member can freely access
these briefs. Members are encouraged to provide
comments that enhance their usefulness.