Book description
Data professionals are confronting the most disruptive change since relational databases appeared in the 1980s. SQL is still a major tool for data analytics, but conventional relational database management systems can’t handle the increasing size and complexity of today’s datasets. This updated edition teaches you best practices for Greenplum Database, the open source massively parallel processing (MPP) database that accommodates large sets of nonrelational and relational data.
Marshall Presser, field CTO at Pivotal, introduces Greenplum’s approach to data analytics and data-driven decisions, beginning with its shared-nothing architecture. IT managers, developers, data analysts, system architects, and data scientists will all gain from exploring data organization and storage, data loading, running queries, and learning to perform analytics in the database. Discover how MPP and Greenplum will help you go beyond the traditional data warehouse.
This ebook covers:
- Greenplum features, use case examples, and techniques for optimizing use
- Four Greenplum deployment options to help you balance security, cost, and time to usability
- Why each networked node in Greenplum’s architecture includes an independent operating system, memory, and storage
- Additional tools for monitoring, managing, securing, and optimizing query responses in the Pivotal Greenplum commercial database
Table of contents
- Foreword to the Second Edition
- Foreword to the First Edition
- Preface
- 1. Introducing the Greenplum Database
- 2. Whatâs New in Greenplum?
- 3. Deploying Greenplum
- 4. Organizing Data in Greenplum
- 5. Loading Data
- 6. Gaining Analytic Insight
- 7. Monitoring and Managing Greenplum
- 8. Accessing External Data
- 9. Optimizing Query Response
Product information
- Title: Data Warehousing with Greenplum, 2nd Edition
- Author(s):
- Release date: July 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492058120
You might also like
book
Data Warehousing with Greenplum
Relational databases haven’t gone away, but they are evolving to integrate messy, disjointed unstructured data into …
book
Getting Started with Greenplum for Big Data Analytics
A hands-on guide on how to execute an analytics project from conceptualization to operationalization using Greenplum …
book
SQL Server Big Data Clusters: Data Virtualization, Data Lake, and AI Platform
Use this guide to one of SQL Server 2019’s most impactful features—Big Data Clusters. You will …
book
Simplifying Data Engineering and Analytics with Delta
Explore how Delta brings reliability, performance, and governance to your data lake and all the AI …