O'Reilly logo

Getting Started with Greenplum for Big Data Analytics by Sunila Gollapudi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Modeling methods

In the next few sections, we will cover the following important analytical methods in detail:

  • Decision trees (classification)
  • Association rules (unsupervised learning)
  • Linear and logistic regression
  • Naive Bayesian classifier (classification)
  • K-means clustering (unsupervised learning)
  • Text analysis.

Decision trees

Decision trees are an example of classification technique. Here, we classify data in a tree format using data features or attributes. Since decision trees depict the flows and possible outcome for each flow, they are used in identifying the best strategy to reach the goal.

In decision trees, we start with testing an attribute and split the data based on that attribute:

  • We continue with the process.
  • We can build multiple decision ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required