O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Building a Near Real-Time Analytical Application with Kudu

Video Description

Building near real-time analytical applications that combine real-time data inserts, updates, and fast analytics is almost impossible with any single Hadoop storage technology. The introduction of Apache Kudu and the "KIKS" stack breaks through this barrier, making it possible to build near real-time analytical applications that are simple, fast, and reliable. In this course, designed for developers, architects, and engineers with some experience working with common Hadoop components (Kafka, Hive, Spark, Impala, etc.), you'll use "KIKS" to create an app that demonstrates the real-time ingestion, persistence, and visualization of time-series events.

Kudu is at the center of this architecture. It combines real-time inserts, random lookups, and fast analytics into a single storage layer without the need for the complexities of the lambda architecture, making time-series and IOT use-cases much easier to conquer than with previous generation big data technologies. The app you'll build uses real-time financial data, but it also applies to use cases in IOT, retail, manufacturing, and other industries with real-time analytical needs.

  • Gain hands-on experience building a powerful near real-time analytical application
  • Discover how Kudu combines random lookups and fast analytics into a single storage layer
  • See how Kudu eliminates the need for the complexities of lambda architecture
  • Understand how the "KIKS" stack works to make apps that are fast, simple, and reliable
Ryan Bosshart is a Principal Systems Engineer at Cloudera, where he leads a specialized team focused on Hadoop ecosystem storage technologies such as HDFS, Hbase, and Kudu. An architect and builder of large-scale distributed systems since 2006, Ryan is co-chair of the Twin Cities Spark and Hadoop User Group. He speaks about Hadoop technologies at conferences throughout North America and holds a degree in computer science from Augsburg College.