Chapter 13: Integrating External Tools with Spark SQL

Business intelligence (BI) refers to the capabilities that enable organizations to make informed, data-driven decisions. BI is a combination of data processing capabilities, data visualizations, business analytics, and a set of best practices that enable, refine, and streamline organizations' business processes by helping them in both strategic and tactical decision making. Organizations typically rely on specialist software called BI tools for their BI needs. BI tools combine strategy and technology to gather, analyze, and interpret data from various sources and provide business analytics about the past and present state of a business.

BI tools have traditionally relied on data warehouses ...

Get Essential PySpark for Scalable Data Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.