Skip to Content
Data Lake for Enterprises
book

Data Lake for Enterprises

by Vivek Mishra, Tomcy John, Pankaj Misra
May 2017
Beginner to intermediate
596 pages
15h 2m
English
Packt Publishing
Content preview from Data Lake for Enterprises

Advantages of Flume

Some of the core advantages of Apache Flume which made this technology chosen are as detailed here in bullet points:

  • Open source.
  • Very good documentation, with many examples and patterns of how these can be applied, is available.
  • High throughput with low latency.
  • Declarative configuration.
  • Inherently distributed.
  • Highly reliable, available, and scalable (horizontally).
  • Highly extensible and customizable.
  • Less costly installation, operation and maintenance.
  • Contextual routing aspect has a dedication subsection in this chapter. But for you to have a heads-up, this is an aspect of Flume to look at the payload (stream data or event) and construct a routing which is apt.
  • Build-in support for a variety of source and destination ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

The Enterprise Big Data Lake

The Enterprise Big Data Lake

Alex Gorelik
Operationalizing the Data Lake

Operationalizing the Data Lake

Holden Ackerman, Jon King
Data Lakes

Data Lakes

Anne Laurent, Dominique Laurent, Cédrine Madera

Publisher Resources

ISBN: 9781787281349Supplemental Content