What kinds of platforms have Netflix, LinkedIn, CERN, and PayPal constructed to handle big data operations unique to their businesses? And how can you apply some of these solutions to your own business? The answers lie in this unique O’Reilly video collection, taken from live sessions at Strata + Hadoop World 2015 in San Jose, California.
This video collection includes:
Big Data at Netflix: Faster and Easier
Kurt Brown (Netflix)
Learn the technologies that drive the Netflix Data Platform (Hadoop 2, Pig on Tez, Presto on AWS), as well as the motivations behind their architecture and approach, and the benefits that they (and hopefully you) can achieve.
Building Interactive Data Applications at Scale
Fangjin Yang (Metamarkets), Vadim Ogievetsky (Independent)
Find out how to build data applications for visualizing, navigating, and interpreting reams of data, using the facet.js data query framework on the front end and the Druid open source data store on the back end.
Open Source Real Time BI using Storm, Hadoop, Titan, Druid & D3
Anil Madan (PayPal)
Get acquainted with PayPal’s behavioral analytics lineup: Storm and Hadoop in the Real Time Analytics pipeline, Druid as a real time distributed OLAP metrics store, the D3 visualization framework, and Apache Titan & Gremlin for visitor pathing and funnel analytics.
Building Real-time Data Products at LinkedIn with Apache Samza
Martin Kleppmann (Independent)
Sometimes you need to process data continuously and react to it within a few seconds. Learn how LinkedIn uses Apache Samza to solve real-time data problems, and understand how you can structure real-time data pipelines for scale and flexibility.
An Open Source Approach to Gathering and Analyzing Device Sourced Health Data
Ian Eslick (VitalLabs)
Discover how VitalLabs captures and integrates device-based and other health data for research, using the Switchboard application for routing data and the Trusted Analytic Container (TAC) for consolidating data for analytics.
Ticketmaster: Marketing and Selling the World's Tickets
John Carnahan (Ticketmaster)
Learn about the solutions that Ticketmaster uses for ticket sales and marketing, including Storm for stream processing, trend prediction, and anomaly detection; and Kafka, Storm, and Hbase for real-time "n-squared" marketing.
Unlocking Big Data at CERN
Matthias Braeger (CERN), Manish Devgan (Software AG Terracotta)
Unlock the architecture of CERN projects—including C2MON, CERN’s Control & Monitoring Platform—that leverage Hadoop and Terracotta In-Memory Data platform to gain real-time insights from sensor data.
Unboxing Data Startups
Michael Abbott (Kleiner Perkins Caufield & Byers)
Investor and entrepreneur Michael Abbott unboxes three startups to look at the technology, architecture, and innovations they’ve harnessed to deliver their products and services.
Table of contents
- Introduction - Building big data platforms at Strata+Hadoop World - Ben Lorica
- Big Data at Netflix: Faster and Easier - Kurt Brown
- Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky
- Open Source Real Time BI using Storm, Hadoop, Titan, Druid D3 - Anil Madan
- Building Real-time Data Products at LinkedIn with Apache Samza - Martin Kleppmann
- An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick
- Ticketmaster: Marketing and Selling the World's Tickets - John Carnahan
- Unlocking Big Data at CERN - Matthias Braeger and Manish Devgan
- Unboxing Data Startups - Michael Abbott
- Title: Building Big Data Platforms
- Release date: June 2015
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491931035
You might also like
O'Reilly Strata Data Conference 2019 - New York, New York
The 2019 Strata Data Conference NYC, the biggest Big Data conference in the world, was a …
Strata Conference New York + Hadoop World 2014: Video Compilation
Use the power of big data to drive business strategy What happens when cutting-edge data science …
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
Strata Data Conference - San Jose 2018
Strata San Jose 2018 offered thousands of top data scientists, analysts, engineers, and executives from around …