arxiv:1808.03570

Densely Connected Convolutional Networks for Speech Recognition

Published on Aug 10, 2018

Authors:

Abstract

Densely Connected Convolutional Networks significantly outperform other neural-based models in acoustic modeling for automatic speech recognition, even with reduced training data.

AI-generated summary

This paper presents our latest investigation on Densely Connected Convolutional Networks (DenseNets) for acoustic modelling (AM) in automatic speech recognition. DenseN-ets are very deep, compact convolutional neural networks, which have demonstrated incredible improvements over the state-of-the-art results on several data sets in computer vision. Our experimental results show that DenseNet can be used for AM significantly outperforming other neural-based models such as DNNs, CNNs, VGGs. Furthermore, results on Wall Street Journal revealed that with only a half of the training data DenseNet was able to outperform other models trained with the full data set by a large margin.

View arXiv page View PDF Add to collection