AES E-Library

AES E-Library

Application Of Speech Rate Conversion Technology To Video Editing: Allows Up To 5 Times Normal Speed Playback While Maintaining Speech Intelligibility

This paper describes an application of speech rate conversion technology to video editing. In video editing, it is common to search through the material at several times normal speed. The speech rate conversion technology maintains the original pitch and timbre of speech despite playing it back at a faster rate, which is varied adaptively to permit fast listening in real-time. In listening tests, users were able to comprehend speech played at up to 5 times normal speed which was incomprehensible without the adaptive rate conversion.

AES Conference:
Paper Number:
Publication Date:

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!

AES - Audio Engineering Society