You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
A speech classification system is proposed which has applications for accessibility of content for younger children. To allow a young child to access online content (where typical interfaces such as search engines or hierarchical navigation would be inappropriate) we propose a voice classification system trained to recognise a range of sounds and vocabulary typical of younger children. As an example we design a system for classifying animal noises. Acoustic features are extracted from a corpus of animal noises made by a class of young children. A Support Vector Machine is trained to classify the sounds into one of 12 corresponding animals. We investigate the precision and recall of the classifier for various classification parameters. We investigate an appropriate choice of features to extract from the audio and compare the performance when using mean Mel-frequency Cepstral Coefficients (MFCC), a single-Gaussian model fitted to the MFCCs as well as a range of temporal features. To investigate the real-world applicability of the system we pay particular attention to the difference between training a generic classifier from a collected corpus of examples and one trained to a particular voice.
Author (s): Lowis, Christopher;
Pike, Christopher;
Raimond, Yves;
Affiliation:
BBC R&D, London, UK
(See document for exact affiliation information.)
AES Convention: 132
Paper Number:8589
Publication Date:
2012-04-06
Session subject:
Emerging Audio Technologies
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Lowis, Christopher; Pike, Christopher; Raimond, Yves; 2012; A Voice Classification System for Younger Children with Applications to Content Navigation [PDF]; BBC R&D, London, UK; Paper 8589; Available from: https://aes.org/publications/elibrary-page/?id=16227
Lowis, Christopher; Pike, Christopher; Raimond, Yves; A Voice Classification System for Younger Children with Applications to Content Navigation [PDF]; BBC R&D, London, UK; Paper 8589; 2012 Available: https://aes.org/publications/elibrary-page/?id=16227
@inproceedings{Lowis2012a,
title={{A Voice Classification System for Younger Children with Applications to Content Navigation}},
author={Lowis, Christopher and Pike, Christopher and Raimond, Yves},
year={2012},
month={apr},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 8589; AES Convention 132; April 2012},
number={8589},
organization={AES},
}
Notifications