Skip to Content
Data Engineering Best Practices
book

Data Engineering Best Practices

by Richard J. Schiller, David Larochelle
October 2024
Intermediate to advanced
550 pages
20h 8m
English
Packt Publishing
Content preview from Data Engineering Best Practices

16

Machine Learning Pipeline Best Practices and Processes

At this point in the book, you are equipped with a fine understanding of how to produce a data factory pipeline and render data into releasable sets in a consumable area. After making them clearly available as a knowledge base with transparent metadata and lineage services, you can provide analysis capabilities in your analytics workbench. What we have not yet covered is the iterative cycles needed to tease out insights from your quality information via machine learning algorithms. This involves minimizing the technical effort, implementing a high degree of objective quality, organizing flows and optimized models, while integrating cutting-edge technologies. All this must take place to ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley

Publisher Resources

ISBN: 9781803244983Supplemental Content