Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain
×
Cite This
Citation & Abstract
J. Bonada, and A. Loscos, "Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain," Paper 6952, (2006 October.). doi:
J. Bonada, and A. Loscos, "Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain," Paper 6952, (2006 October.). doi:
Abstract: Although esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. This article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. The analysis-transformation-synthesis technique creates a non-pathological spectrum for those utterances featured as voiced and filters those unvoiced. Healthy spectrum generation implies transforming the original timbre, modeling harmonic phase coupling from the spectral shape envelope, and deriving pitch from frame energy analysis. Resynthesized speech aims to improve intelligibility, minimize artificial artifacts, and acquire resemblance to patient’s pre-surgery original voice.
@article{bonada2006esophageal,
author={bonada, jordi and loscos, alex},
journal={journal of the audio engineering society},
title={esophageal voice enhancement by modeling radiated pulses in frequency domain},
year={2006},
volume={},
number={},
pages={},
doi={},
month={october},}
@article{bonada2006esophageal,
author={bonada, jordi and loscos, alex},
journal={journal of the audio engineering society},
title={esophageal voice enhancement by modeling radiated pulses in frequency domain},
year={2006},
volume={},
number={},
pages={},
doi={},
month={october},
abstract={although esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. this article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. the analysis-transformation-synthesis technique creates a non-pathological spectrum for those utterances featured as voiced and filters those unvoiced. healthy spectrum generation implies transforming the original timbre, modeling harmonic phase coupling from the spectral shape envelope, and deriving pitch from frame energy analysis. resynthesized speech aims to improve intelligibility, minimize artificial artifacts, and acquire resemblance to patient’s pre-surgery original voice.},}
TY - paper
TI - Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain
SP -
EP -
AU - Bonada, Jordi
AU - Loscos, Alex
PY - 2006
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - October 2006
TY - paper
TI - Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain
SP -
EP -
AU - Bonada, Jordi
AU - Loscos, Alex
PY - 2006
JO - Journal of the Audio Engineering Society
IS -
VO -
VL -
Y1 - October 2006
AB - Although esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. This article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. The analysis-transformation-synthesis technique creates a non-pathological spectrum for those utterances featured as voiced and filters those unvoiced. Healthy spectrum generation implies transforming the original timbre, modeling harmonic phase coupling from the spectral shape envelope, and deriving pitch from frame energy analysis. Resynthesized speech aims to improve intelligibility, minimize artificial artifacts, and acquire resemblance to patient’s pre-surgery original voice.
Although esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. This article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. The analysis-transformation-synthesis technique creates a non-pathological spectrum for those utterances featured as voiced and filters those unvoiced. Healthy spectrum generation implies transforming the original timbre, modeling harmonic phase coupling from the spectral shape envelope, and deriving pitch from frame energy analysis. Resynthesized speech aims to improve intelligibility, minimize artificial artifacts, and acquire resemblance to patient’s pre-surgery original voice.
Authors:
Bonada, Jordi; Loscos, Alex
Affiliation:
Music Technology Group, Universitat Pompeu Fabra
AES Convention:
121 (October 2006)
Paper Number:
6952
Publication Date:
October 1, 2006Import into BibTeX
Subject:
Signal Processing
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=13786