Working with groups of data

In the preceding example, we run the inner transformation once for every row in the list. As said, this is the default behavior. There is also the possibility to run the inner transformation for a group of rows. You may consider creating a Transformation that processes rows grouped by client number, by product line, by date, just to mention some examples. Also, you may just want to process groups of N rows instead of the whole dataset at once.

Once you have the inner transformation ready, you can use a Transformation Executor to execute it with the proper groups of rows. The way you group the rows is configured in the Row grouping tab of the Executor. There are several possibilities.

You can send groups of N rows, ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.