O'Reilly logo

Getting Started with Greenplum for Big Data Analytics by Sunila Gollapudi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Summary

In this chapter, we have explored various implementation aspects of Greenplum UAP. We started with understanding data loading strategies for Greenplum and HD. We have looked at loading data into Greenplum using internal utilities and functions such as gpload and gpfdist and also using Informatica PowerExchange connector. For HD, we have explored Hive and Greenplum bulk loader utility.

We moved on to take a dive deep into distribution and partitioning aspects of Greenplum along with strategies for querying Greenplum and HD. We have looked at various functions such as ANALYZE and EXPLAIN to optimize the queries and interpretation of query plans. Finally, we have explored some in-database analytics options with Greenplum (using Windows function, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required