December 2017
Beginner to intermediate
500 pages
12h 10m
English
A Hadoop Distributed File System (HDFS) is a Java-based, distributed, scalable, and portable file system for the Hadoop framework. With PDI, pulling data back out from the Hadoop File System is really easy. In fact, we can treat it just like any other flat file source. Here are the steps for reading a file from Hadoop:
Read now
Unlock full access