Validating and reporting errors to the log

For this tutorial, we will use the sales_data.csv file that we used in Chapter 4, Reading and Writing Files. This is a simplified version of the file with the same name that comes with the PDI bundle.

The following are sample lines of our file:

ORDERDATE,ORDERNUMBER,ORDERLINENUMBER,PRODUCTCODE,PRODUCTLINE,QUANTITYORDERED,PRICEEACH,SALES2/20/2004 0:00 ,10223,10,S24_4278 ,Planes ,23,74.62,1716.2611/21/2004 0:00,10337,3,S18_4027 ,Classic Cars ,36,100 ,5679.366/16/2003 0:00 ,10131,2,S700_4002,Planes ,26,85.13,2213.387/6/2004 0:00 ,10266,5,S18_1984 ,Classic Cars ,49,100 ,6203.410/16/2004 0:00,10310,4,S24_2972 ,Classic Cars ,33,41.91,1383.0312/4/2004 0:00 ,10353,4,S700_2834,Planes ,48,68.8 ,3302.41/20/2005 ...

Get Learning Pentaho Data Integration 8 CE - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.