O'Reilly logo

Getting Started with Greenplum for Big Data Analytics by Sunila Gollapudi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Using MADlib with Greenplum

MAD stands for Magnetic, Agile, and Deep; and lib denotes a library of scalable, parallel, and advanced in-database functions. The following figure shows the architecture of MADlib. The MADlib version used in the following example is v1.1:

Using MADlib with Greenplum

Greenplum Database extensions for MADlib would need to be installed on the segment servers on DCA.

$ pgxn install madlib
$ gppkg –i MADlib

The gppkg utility installs the MADlib extensions on all the Greenplum segment servers in parallel.

MADlib based in-database analytics is benchmarkedagainst PL/R and is found to be superior in terms of scalability and performance, and MADlib is a truly ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required