Chapter 2. Enterprise Data Lake using HDInsight

Current IT architecture uses a Enterprise Data Warehouse (EDW) as the centralized repository that feeds several business data marts to drive business intelligence and data mining systems. With the advent of smart connected devices and social media that generate petabytes of data, these current relational EDWs are not able to scale and meet the business needs. This chapter will discuss how to build a modern data architecture that extends the EDW with the Hadoop ecosystem.

In this chapter, we will cover the following topics:

  • Enterprise Data Warehouse architecture
  • Next generation Hadoop-based Data Lake architecture
  • The journey to your Data Lake dream
  • Tools and technology in the Hadoop ecosystem
  • Use case powered ...

Get HDInsight Essentials - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.