O'Reilly logo

HDInsight Essentials - Second Edition by Rajesh Nadipalli

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 2. Enterprise Data Lake using HDInsight

Current IT architecture uses a Enterprise Data Warehouse (EDW) as the centralized repository that feeds several business data marts to drive business intelligence and data mining systems. With the advent of smart connected devices and social media that generate petabytes of data, these current relational EDWs are not able to scale and meet the business needs. This chapter will discuss how to build a modern data architecture that extends the EDW with the Hadoop ecosystem.

In this chapter, we will cover the following topics:

  • Enterprise Data Warehouse architecture
  • Next generation Hadoop-based Data Lake architecture
  • The journey to your Data Lake dream
  • Tools and technology in the Hadoop ecosystem
  • Use case powered ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required