17 LSTMs and automatic speech recognition

This chapter covers

  • Preparing a dataset for automatic speech recognition using the LibriSpeech corpus
  • Training a long short-term memory (LSTM) RNN for converting speech to text
  • Evaluating the LSTM performance during and after training

Speaking and talking to your electronic devices is commonplace nowadays. Years ago, on an early version of my smartphone, I clicked the microphone button and used its dictation function to try to speak an email into existence. The email that my boss received at work had a whole bunch of typos and phonetic errors, though, and he wondered whether I was mixing a little too much after-work activity with my official duties!

The world has evolved, and so has the accuracy of ...

Get Machine Learning with TensorFlow, Second Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.