SQL Server 2019 Big Data Clusters Crash Course: Installing and Using a Big Data Cluster for Data Analysis

by Buck Woody

Released May 2020

Publisher(s): Apress

ISBN: 978148426021

Start your free trial

Video description

Learn the architecture and implementation of Microsoft’s latest SQL Server capability - Big Data Clusters. Combine SQL Server for data, PolyBase for data virtualization, Data Pools for scale-out data processing, and Spark with HDFS for scalable non-relational data processing and analysis.

You’ll begin this video with a quick overview of the big data space, along with an easy-to-understand introduction to containers, Kubernetes, Spark, and HDFS. You’ll put those pieces together by planning and deploying your own cluster. Then it’s on to operationalizing the cluster through problem-solving with Transact-SQL, PolyBase, data processing at scale, and working with Spark and HDFS.

What You Will Learn

Understand big data technologies such as Spark and HDFS
Deploy a Big Data Cluster on a Kubernetes environment in Azure
Implement PolyBase queries to bring in data from external sources
Run Transact-SQL statements and process relational data at scale
Execute Spark jobs and machine learning from within SQL Server 2019
Implement basic security protocols to protect your data from theft

Who This Video Is For

Database professionals who are curious about SQL Server 2019’s flagship feature. For those who want to get started in querying and analyzing massive amounts of data using Spark and HDFS clusters while taking advantage of their SQL Server investment.

Overview 00:02:10
The Big Data Landscape 00:09:21
Architecture and Installation 00:12:16
Data Ingestion and Transact-SQL 00:07:50
Data Virtualization with PolyBase 00:09:38
Creating a Data Mart with Scaled Relational Data 00:05:28
Running Spark Jobs in Big Data Clusters 00:10:07
Security Considerations 00:05:01
How to Keep Learning 00:01:04
Wrap Out 00:00:27

Product information

Title: SQL Server 2019 Big Data Clusters Crash Course: Installing and Using a Big Data Cluster for Data Analysis
Author(s): Buck Woody
Release date: May 2020
Publisher(s): Apress
ISBN: 978148426021

video