DL models for ASR
A number of DL algorithms or models have been used in ASR. A Deep Belief Network (DBN) is one of the early implementations of DL in ASR. Generally, it has been used as a pre-training layer with a single supervised layer of a Deep Neural Network (DNN). Long Short-Term Memory (LSTM) has been used for large-scale acoustic modeling. Time Delay Neural Network (TDNN) architectures have been used for audio signal processing. CNN, which has popularized DL, is also used as DL architecture for ASR. Use of DL architectures has significantly improved the speech recognition accuracy of ASRs. However, not all DL architectures have shown improvements, especially in different types of audio signals and environments, such as noisy and reverberant ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access