Comparison of Data Science Algorithms

Classification: Predicting a categorical target variable

AlgorithmDescriptionModelInputOutputProsConsUse Cases
Decision trees Partitions the data into smaller subsets where each subset contains (mostly) responses of one class (either “yes” or “no”) A set of rules to partition a data set based on the values of the different predictors No restrictions on variable type for predictors The label cannot be numeric. It must be categorical Intuitive to explain to nontechnical business users. Normalizing predictors is not necessary Tends to overfit the data. Small changes in input data can yield substantially different trees. Selecting the right parameters can be challenging Marketing segmentation, fraud ...

Get Data Science, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.