Executing part of a job once for every row in a dataset

Assume that you have a list of things or entities such as students, files, dates, products, and so on. Now, suppose that you want to execute a group of job entries once for every entity in that list.

Suppose that you have a file with a list of names, for example:

name
Paul
Santiago
Lourdes
Anna

For each person, you want to do the following:

  • Generate a file saying hello to that person
  • Wait for 2 seconds
  • Write a message to the log

For a single person, these tasks can be done with a couple of entries. If you have a small known list of entities (persons in this example), you could copy and paste that group of entries, once for each. On the other hand, if the list is long, or you do not know the values ...

Get Pentaho Data Integration Cookbook Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.