17 LSTMs and automatic speech recognition
This chapter covers
- Preparing a dataset for automatic speech recognition using the LibriSpeech corpus
- Training a long short-term memory (LSTM) RNN for converting speech to text
- Evaluating the LSTM performance during and after training
Speaking and talking to your electronic devices is commonplace nowadays. Years ago, on an early version of my smartphone, I clicked the microphone button and used its dictation function to try to speak an email into existence. The email that my boss received at work had a whole bunch of typos and phonetic errors, though, and he wondered whether I was mixing a little too much after-work activity with my official duties!
The world has evolved, and so has the accuracy of ...