Get full access to What's New in TensorFlow 2.0 and 60K+ other titles, with a free 10-day trial of O'Reilly.

There are also live events, courses curated by job role, and more.

Designing and constructing the data pipeline

One of the most important requirements when it comes to training machine learning (ML) models and deep neural networks (DNNs) is having large training datasets with distributions (mostly unknown, which we learn about during ML or DNN training) from a given sample space so that ML models and DNNs can learn from this given training data and generalize well to unseen future or separated out test data. Also, a validation dataset, which often comes from the same source as the training set distribution, is critical to fine-tuning model hyperparameters. In many cases, developers start with whatever data is available—either a little or a lot—to train machine learning models, including high capacity deep ...

Get What's New in TensorFlow 2.0 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Don’t leave empty-handed

Get Mark Richards’s Software Architecture Patterns ebook to better understand how to design components—and how they should interact.

It’s yours, free.

Get it now

Check it out now on O’Reilly

Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.

Start your free trial Become a member now