© Harsh Chawla and Pankaj Khattar 2020
H. Chawla, P. KhattarData Lake Analytics on Microsoft Azurehttps://doi.org/10.1007/978-1-4842-6252-8_6

6. Data Preparation and Training Part I

Harsh Chawla1   and Pankaj Khattar2
(1)
Bengaluru, India
(2)
Delhi, India
 
The data preparation and training phase is the most important phase of the data analytics solution. During this phase, data ingested from various sources is merged and crunched together (Figure 6-1). The transformed data further gets infused with machine learning models or is sent to the model and serve phase. The entire data journey is planned, based on the target use case. This phase has been split into two chapters. In this chapter, the discussion is on the various technologies that are applicable ...

Get Data Lake Analytics on Microsoft Azure: A Practitioner's Guide to Big Data Engineering now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.