Sampling rows

For our final example of processing techniques, we will look at how to extract specific rows from a data flow. For this technique, the Studio offers a tSampleRow component which filters rows according to their line numbers. We're going to use the same data as used in the previous example —country-codes.csv—and the job we create will be similar to the last example, so we'll reuse this and add some modifications.

  1. In the repository, right-click on the FindAndReplace job and select Duplicate.
  2. In the pop-up window, change the name to SampleRow and click on OK.
  3. Double-click on the new job to open it and delete the tReplace component.
  4. In the Palette, search for sample and drag a tSampleRow component onto the Job Designer, in between the delimited ...

Get Getting Started with Talend Open Studio for Data Integration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.