The two prior chapters gave us a taste of the new capabilities in PolyBase with SQL Server 2019. In this chapter, we will go one step further and integrate with several data sources using PolyBase’s generic ODBC capabilities. We will first start with a basic flow that we will follow for each driver and ODBC data source. From there, we will survey different data platform technologies to test PolyBase integration capabilities. Although our survey will not be exhaustive, we will cover integrating with three technologies. The first two, Apache Spark and Apache Hive, are critical parts of the greater Hadoop ...
Get PolyBase Revealed: Data Virtualization with SQL Server, Hadoop, Apache Spark, and Beyond now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.