Noise Robustness Automatic Speech Recognition with Convolutional Neural Network and Time Delay Neural Network

Wang, Jie; Wang, Dunze; Chen, Yunda; Lu, Xun; Zheng, Chengshi

AES E-Library

Noise Robustness Automatic Speech Recognition with Convolutional Neural Network and Time Delay Neural Network

To improve the performance of automatic speech recognition in noisy environments, the convolutional neural network (CNN) combined with time-delay neural network (TDNN) is introduced, which is referred as CNN-TDNN. The CNN-TDNN model is further optimized by factoring the parameter matrix in the time-delay neural network hidden layers and adding a time-restricted self-attention layer after the CNN-TDNN hidden layers. Experimental results show that the optimized CNN-TDNN model has better performance than DNN, CNN, TDNN, and CNN-TDNN. The average recognition word error rate (WER) can be reduced by 11.76% when comparing with the baselines.

Authors: Wang, Jie; Wang, Dunze; Chen, Yunda; Lu, Xun; Zheng, Chengshi
Affiliations: Guangzhou University, Guangzhou, China; Power Grid Planning Center, Guandgong Power Grid Company, Guangdong, China; Institute of Acoustics, Chinese Academy of Sciences, Beijing, China(See document for exact affiliation information.)
AES Convention: 147 (October 2019) Paper Number: 10272
Publication Date: October 8, 2019 Import into BibTeX
Subject: Posters: Applications in Audio
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=20645

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: /conv/147/10272.pdf

Start a discussion about this paper!

AES E-Library

Noise Robustness Automatic Speech Recognition with Convolutional Neural Network and Time Delay Neural Network

ABOUT AES

Contact Us