O'Reilly logo

Getting Started with Greenplum for Big Data Analytics by Sunila Gollapudi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

In-database analytics using MADlib

MADlib is an open source library for in-database analytics. It is integrated with Greenplum database and is known for highly efficient analytics. It was first reported at VLDB 2009 in which MAD Skills: New Analysis Practices for Big Data was presented. Read about it at http://db.cs.berkeley.edu/papers/vldb09-madskills.pdf.

The steps to install the latest version of MADlib are:

  1. Visit http://MADlib.net.
  2. Download the latest release.
  3. Click on the MADlib Wiki link and follow the installation guide for PostgreSQL or Greenplum.
    In-database analytics using MADlib

Listed are the in-database analytic functions available natively in Greenplum and as Madlib functions ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required