© Harsh Chawla and Pankaj Khattar 2020
H. Chawla, P. KhattarData Lake Analytics on Microsoft Azurehttps://doi.org/10.1007/978-1-4842-6252-8_6

6. Data Preparation and Training Part I

Harsh Chawla1   and Pankaj Khattar2
(1)
Bengaluru, India
(2)
Delhi, India
 
The data preparation and training phase is the most important phase of the data analytics solution. During this phase, data ingested from various sources is merged and crunched together (Figure 6-1). The transformed data further gets infused with machine learning models or is sent to the model and serve phase. The entire data journey is planned, based on the target use case. This phase has been split into two chapters. In this chapter, the discussion is on the various technologies that are applicable ...

Get Data Lake Analytics on Microsoft Azure: A Practitioner's Guide to Big Data Engineering now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.