In the spectrographic analysis of voice exemplars, there is a question of the impact of commercially available compression schemes on the visual representation of the voice. Specifically, there is a question whether or not the codec reconstruction of the time-domain waveform might alter features of the wide-band spectrogram, including formant shape and position. This paper contributes a collection of comparative analyses of English speech exemplars that are recorded simultaneously via both linear and compression-based recording methods. G.723-encoded recordings (max 6.3kbps, used in IC-recorders and surveillance recording equipment) and the Sony Memory stick format using MSV LPEC-SP compression are evaluated.
https://www.aes.org/e-lib/browse.cfm?elib=13255
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!