O'Reilly logo

Integrating Hadoop by Jake Dolezal, William McKnight

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

6 Unloading/Distributing Data from Hadoop

Hadoop Extracts

In a robust data architecture, Hadoop can serve as a source or hub for data distribution. Some professionals in the space may refer to this as a “data lake.” The concept of a data lake has received much attention in recent years; basically, a data lake is a massive repository for storing the majority of an organization’s data to be used for analytics.

Architecturally and technically, Hadoop can serve this purpose. Whether or not your Hadoop instance is considered a data lake is immaterial. Regardless of the particular arrangement of Hadoop and how it sits in your overall information management architecture, you will undoubtedly encounter cases where data must be extracted from Hadoop ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required