O'Reilly logo

Hadoop Blueprints by Tanmay Deshpande, Anurag Shrivastava

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Data Lake business requirements

Data lakes are supposed to provide access to structured, unstructured, and semi-structured data to users. The business requirements of data lakes drive what kind of data will be stored in a data lake and who will have access to it. In the next section, we will understand the business requirements of a company that wants to build a data lake.

Note

Origins of the word Data Lake

James Dixon, the founder and CTO of Pentaho, coined the term data lake in his blog. He has defined the concept of a Data Lake as follows:" If you think of a datamart as a store of bottled water - cleansed and packaged and structured for easy consumption - the data lake is a large body of water in a more natural state. The contents of the data ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required