Modeling methods

In the next few sections, we will cover the following important analytical methods in detail:

  • Decision trees (classification)
  • Association rules (unsupervised learning)
  • Linear and logistic regression
  • Naive Bayesian classifier (classification)
  • K-means clustering (unsupervised learning)
  • Text analysis.

Decision trees

Decision trees are an example of classification technique. Here, we classify data in a tree format using data features or attributes. Since decision trees depict the flows and possible outcome for each flow, they are used in identifying the best strategy to reach the goal.

In decision trees, we start with testing an attribute and split the data based on that attribute:

  • We continue with the process.
  • We can build multiple decision ...

Get Getting Started with Greenplum for Big Data Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.