Full-Text Indexing of Very Large Audio Data Bases
We present a system for indexing of and index-based search in PCM-based audio material. Given a short excerpt of a waveform signal as a query, the index returns all pieces in a data base containing that waveform. Additionally, the precise position of the waveform within those pieces is returned. The indexing method is robust against several signal processing operations such as lossy compression or addition of noise. Indexing of a test data base consisting of approx. 10 GB of audio data results in an index of size 16 MB. Response times to queries of lengths of about one or a half of a second are only fractions of a second.
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!
This paper costs $33 for non-members and is temporarily free for AES members.