Chapter 2. Enterprise Data Lake using HDInsight

Current IT architecture uses a Enterprise Data Warehouse (EDW) as the centralized repository that feeds several business data marts to drive business intelligence and data mining systems. With the advent of smart connected devices and social media that generate petabytes of data, these current relational EDWs are not able to scale and meet the business needs. This chapter will discuss how to build a modern data architecture that extends the EDW with the Hadoop ecosystem.

In this chapter, we will cover the following topics:

  • Enterprise Data Warehouse architecture
  • Next generation Hadoop-based Data Lake architecture
  • The journey to your Data Lake dream
  • Tools and technology in the Hadoop ecosystem
  • Use case powered ...

Get HDInsight Essentials - Second Edition now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.