Time for action – calculating football match statistics by grouping data

Let's continue working with the football matches file. Suppose that you want to take that information to obtain some statistics, for example, the maximum number of goals per match in a given day. To do this, follow these instructions:

  1. Create a new transformation, give it a name and description, and save it.
  2. By using a Text file input step, read the matches.txt file, just like you did it in the previous section.
  3. Do a preview just to confirm that the step is well configured.
  4. Add a Sort rows step to the transformation, and sort the fields by region and match_date in ascending order.
  5. Expand the Statistics category of steps, and drag a Group by step to the canvas. Create a hop from ...

Get Pentaho Data Integration Beginner's Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.