Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard

Moreau, Nicolas; Kim, Hyoung-Gook

AES E-Library

Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard

This paper presents a phone-based approach of spoken document retrieval, developed in the framework of the emerging MPEG-7 standard. The audio part of MPEG-7 encloses a SpokenContent tool that provides a standardized description of the content of spoken documents. In the context of MPEG-7, we propose an indexing and retrieval method that uses phonetic information only and a vector space IR model. Experiments are conducted on a database of German spoken documents, with 10 city name queries. Two phone-based retrieval approaches are presented and combined. The first one is based on the combination of phone N-grams of different lengths used as indexing terms. The other consists of expanding the document representation by means of phone confusion probabilities.

Authors: Moreau, Nicolas; Kim, Hyoung-Gook
Affiliation: Communication Systems Group, Technical University of Berlin, Germany
AES Conference: 25th International Conference: Metadata for Audio (June 2004)
Paper Number: 2-4
Publication Date: June 1, 2004 Import into BibTeX
Subject: Metadata for Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=12806

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 25thPapers) /25/aes25-000026.pdf

Start a discussion about this paper!

AES E-Library

Phone-based Spoken Document Retrieval in Conformance with the MPEG-7 Standard

ABOUT AES

Contact Us