O'Reilly logo

Hadoop for Finance Essentials by Rajiv Tiwari

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

The data lake

Data lake is undoubtedly one of the most popular architecture patterns to land all types of data at a single place. The key points are:

  • Combine the power of traditional RDBMS with Hadoop to process data
  • Use traditional data RDBMS to process low-volume high-value data
  • Use Hadoop for high-volume and new types of data sources—semistructured and unstructured data sources, such as legal documents, e-mails, web data, and machine log data

The following screenshot is from the Hortonworks website and shows how a co-existing traditional RDBMS and Hadoop provides a good balance to process all types of data:

The data lake

The analytics and visualization of data ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required