Book description
Organizations striving to build applications for streaming data have a new possibility to ponder: the use of ingestion engines at the front end of their Hadoop systems. With this O’Reilly report, you’ll learn how these fast data front ends process data before it reaches the Hadoop Data File System (HDFS), and provide intelligence and context in real time. This helps you reduce response times from hours to minutes, or even minutes to seconds.
Author and independent consultant Akmal Chaudhri looks at several popular ingestion engines, including Apache Spark, Apache Storm, and the VoltDB in-memory database. Among them, VoltDB stands out by providing full Atomicity, Consistency, Isolation, and Durability (ACID) support. VoltDB also lets you build a fast data front-end that uses the familiar SQL language and standards.
Learn the advantages of ingestion engines as well as the theoretical and practical problems that can come up in an implementation. You’ll discover how this option can handle streaming data, provide state, ensure durability, and support transactions and real-time decisions.
Akmal B. Chaudhri is an Independent Consultant, specializing in big data, NoSQL, and NewSQL database technologies. He has previously held roles as a developer, consultant, product strategist, and technical trainer with several blue-chip companies and big data startups. Akmal regularly presents at international conferences and serves on program committees for several major conferences and workshops.
Publisher resources
Product information
- Title: Fast Data Front Ends for Hadoop
- Author(s):
- Release date: February 2016
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491937808
You might also like
book
Microsoft® SQL Server 2012 Unleashed
Buy the print version of¿ and get the eBook version for free! eBook version includes chapters …
book
Understanding System.IO for .NET Core 3: Implementing Internal and Commercial Tools
Take full advantage of the .NET APIs in System.IO to achieve fundamental I/O operations and produce …
book
Fast Data: Smart and at Scale
The need for fast data applications is growing rapidly, driven by the IoT, the surge in …
book
Microsoft® Office Communications Server 2007 R2 Resource Kit
In-depth, comprehensive, and fully revised for R2 this RESOURCE KIT delivers the information you need to …