You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper describes an application of speech rate conversion technology to video editing. In video editing, it is common to search through the material at several times normal speed. The speech rate conversion technology maintains the original pitch and timbre of speech despite playing it back at a faster rate, which is varied adaptively to permit fast listening in real-time. In listening tests, users were able to comprehend speech played at up to 5 times normal speed which was incomprehensible without the adaptive rate conversion.
Author (s): Imai, Atsushi;
Seiyama, Nobumasa;
Mishima, Takeshi;
Takagi, Tohru;
Miyasaka, Eiichi;
Affiliation:
NHK (Japanese Broadcasting Corp.) Science and Technical Research Laboratories, Setagaya-ku, Tokyo, Japan
(See document for exact affiliation information.)
Publication Date:
2001-10-06
Session subject:
Archiving, Restoration, and New Methods of Recording
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Imai, Atsushi; Seiyama, Nobumasa; Mishima, Takeshi; Takagi, Tohru; Miyasaka, Eiichi; 2001; Application Of Speech Rate Conversion Technology To Video Editing: Allows Up To 5 Times Normal Speed Playback While Maintaining Speech Intelligibility [PDF]; NHK (Japanese Broadcasting Corp.) Science and Technical Research Laboratories, Setagaya-ku, Tokyo, Japan; Paper 1934; Available from: https://aes.org/publications/elibrary-page/?id=10055
Imai, Atsushi; Seiyama, Nobumasa; Mishima, Takeshi; Takagi, Tohru; Miyasaka, Eiichi; Application Of Speech Rate Conversion Technology To Video Editing: Allows Up To 5 Times Normal Speed Playback While Maintaining Speech Intelligibility [PDF]; NHK (Japanese Broadcasting Corp.) Science and Technical Research Laboratories, Setagaya-ku, Tokyo, Japan; Paper 1934; 2001 Available: https://aes.org/publications/elibrary-page/?id=10055
@inproceedings{Imai2001application,
title={{Application Of Speech Rate Conversion Technology To Video Editing: Allows Up To 5 Times Normal Speed Playback While Maintaining Speech Intelligibility}},
author={Imai, Atsushi and Seiyama, Nobumasa and Mishima, Takeshi and Takagi, Tohru and Miyasaka, Eiichi},
year={2001},
month={oct},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 1934; AES Conference: 20th International Conference: Archiving, Restoration, and New Methods of Recording; October 2001},
number={1934},
organization={AES},
}
TY – paper
TI – Application Of Speech Rate Conversion Technology To Video Editing: Allows Up To 5 Times Normal Speed Playback While Maintaining Speech Intelligibility
AU – Imai, Atsushi
AU – Seiyama, Nobumasa
AU – Mishima, Takeshi
AU – Takagi, Tohru
AU – Miyasaka, Eiichi
PY – 2001
JO – Journal of the Audio Engineering Society
VL – 1934
Y1 – October 2001
Notifications