Chapter 4. Recognizing images and sounds using convnets

This chapter covers

  • How images and other perceptual data, such as audio, are represented as multidimensional tensors
  • What convnets are, how they work, and why they are especially suitable for machine-learning tasks involving images
  • How to write and train a convnet in TensorFlow.js to solve the task of classifying hand-written digits
  • How to train models in Node.js to achieve faster training speeds
  • How to use convnets on audio data for spoken-word recognition

The ongoing deep-learning revolution started with breakthroughs in image-recognition tasks such as the ImageNet competition. There is a wide range of useful and technically interesting problems that involve images, from recognizing ...

Get Deep Learning with JavaScript now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.