17 LSTMs and automatic speech recognition

This chapter covers

  • Preparing a dataset for automatic speech recognition using the LibriSpeech corpus
  • Training a long short-term memory (LSTM) RNN for converting speech to text
  • Evaluating the LSTM performance during and after training

Speaking and talking to your electronic devices is commonplace nowadays. Years ago, on an early version of my smartphone, I clicked the microphone button and used its dictation function to try to speak an email into existence. The email that my boss received at work had a whole bunch of typos and phonetic errors, though, and he wondered whether I was mixing a little too much after-work activity with my official duties!

The world has evolved, and so has the accuracy of ...

Get Machine Learning with TensorFlow, Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.