Chapter 3: Creating and Using SQL Server 2019 Big Data Clusters

In this chapter, you will learn how to prepare and deploy a SQL Server 2019 Big Data Cluster. You will learn how Big Data Clusters can be used in data management operations, implemented as SQL Server Integration Services (SSIS) solutions. We will show you how to use SSIS to load data into a Big Data Cluster Hadoop Distributed File System (HDFS) file store and how to perform basic data processing operations on the Spark platform, and we will finish the chapter by demonstrating how to retrieve data from a Big Data Cluster, to be used in a database hosted locally.

After you have created and deployed your new Big Data Cluster instance in Azure, we are also going to show you how to efficiently ...

Get ETL with Azure Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.