AES Store

Journal Forum

Perceptual Effects of Dynamic Range Compression in Popular Music Recordings - January 2014
4 comments

Accurate Calculation of Radiation and Diffraction from Loudspeaker Enclosures at Low Frequency - June 2013
9 comments

New Measurement Techniques for Portable Listening Devices: Technical Report - October 2013
1 comment

Access Journal Forum

AES E-Library

Word-Recognition Strategy

From a practical point of view, automatic speech recognition involves the speaker at least as much, if not more, than it involves the mechanism. I will try to illustrate this point by describing some of the strategies which have evolved during work on a word-recognition computer program. Many of the details that I may omit are included in my report for the Research Laboratory of Electronics of M.I.T. -Word-Recognition Computer Program.- Before getting into the specifics of the work, I would like to say a few words about the overall aims of such work. In my own experience, there are three areas for which automatic speech recognition, and, in particular, automatic word recognition, may be pertinent. These are, speech bandwidth compression, voice-actuated mechanisms and -talking to computers.- I believe that the last of these three possible applications is presently the most practical and pertinent one. Most of my recent efforts have been directed towards supplying a useful verbal command program as an aid to a graphical communications system whereby the computer operator can draw flow charts on the oscilloscope with either a light pen or a Rand tablet and have the resulting drawing translated into a program by the master program. Such an aid appears to be very useful for mode changes in the master program. Chronologically then, the work proceeded as follows: first, the development of a set of rules to be used as the basis of a word-recognition scheme, second, the development of a strategy which appeared to be appropriate for on-line control of other programs. Finally, and this is the step we have not yet taken, is the actual connecting of the word-recognizer to the graphical communications program.

Author:
Affiliation:
AES Convention: Paper Number:
Publication Date:

Click to purchase paper or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $20 for non-members, $5 for AES members and is free for E-Library subscribers.

Learn more about the AES E-Library

E-Library Location:

Start a discussion about this paper!


 
Facebook   Twitter   LinkedIn   Google+   YouTube   RSS News Feeds  
AES - Audio Engineering Society