Monadic data transformation

The first step is to define a trait and method that describe the transformation of data by the computation units of a workflow. The data transformation is the foundation of any workflow for processing and classifying a dataset, training and validating a model, and displaying results.

There are two symbolic models used for defining a data processing or data transformation:

  • Explicit model: The developer creates a model explicitly from a set of configuration parameters. Most of deterministic algorithms and unsupervised learning techniques use an explicit model.
  • Implicit model: The developer provides a training set that is a set of labeled observations (observations with an expected outcome). A classifier extracts a model ...

Get Scala for Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.