For professional quality transcription of audio signals, a floating-point representation can give substantial savings over a straight binary representation. This paper shows, based on psychoacoustic data of masking of noise, how floating-point mantissa length, radix value, and exponent length can be arrived at. Listening tests, using real-time computer-generated music, yielded excellent agreement with the calculated values.
https://www.aes.org/e-lib/browse.cfm?elib=3374
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is free for AES members and E-Library subscribers.
Learn more about the AES E-Library
Start a discussion about this paper!