The purpose of this study is to propose a framework of unified speech and audio coding scheme that can compress speech and music equally well, and then to verify the feasibility of a highly efficient low-rate coding scheme. In this paper, a coding scheme is introduced by utilizing flexible time and frequency representation of a filter bank called Frequency Varying Modulated Lapped Transform (FV-MLT). Experimental results show that the proposed technology can improve the performance of existing technologies for speech and audio contents at low bitrates of 16 - 24kits/sec mono and stereo. We hope that this framework for unified speech and audio coding can deal with various kinds of audio contents at low bitrates and eventually provide a new opportunity for converged applications in mobile devices.
https://www.aes.org/e-lib/browse.cfm?elib=14429
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!