Skip to Content conference Strata Conference New York + Hadoop World 2014: Video Compilation November 2014
Beginner to intermediate
142h 15m
English
Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional) Course outline Business & Industry 14h 30m
Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 149m 28s
Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 256m 4s
Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 339m 44s
Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 446m 44s
Just Enough Math - Paco Nathan and Allen Day - Part 141m 36s
Just Enough Math - Paco Nathan and Allen Day - Part 244m 25s
Just Enough Math - Paco Nathan and Allen Day - Part 351m 45s
Just Enough Math - Paco Nathan and Allen Day - Part 452m 30s
Solving the Right Problem - Max Shron and Sasha Laundy43m 22s
Transforming to a Data Driven Operations Model - Denise Asplund37m 49s
From Experiments to Insights at Pinterest - Andrea Burbank38m 5s
Case Study: -A Forensic Look at Success and Failure of Predictive Analytics in Healthcare - Eugene Kolker31m 52s
The Open Data 500: Building Businesses on Free Government Data - Joel Gurin and Laura Manley34m 51s
Decided by Data: Case Studies from a Data Driven Product Culture - Nellwyn Thomas44m 3s
Preemptive Shipping: How Gilt Predicts Which Customers Will Buy Products It Has Never Sold Before - Igor Elbert45m 11s
What are VCs Really Looking For? - Michael Dauber, Renee DiResta, Matt Turck, James Cham, and Jake Flomenberg42m 27s
PDF Prison Break: Freeing Data, Empowering Experts at Edmunds.com - John Akred and Karim Qazi44m 37s
Fashioning Fit: Determining Fit Through Data - Liza Kindred, David Whittemore, Gina Mancuso, and Rasmus Thofte41m 29s
From Runway to Database, the Season's Hottest Fashion: Data - Rachel Kalmar41m 18s
How Public Data Creates Revenue for a Scandinavian Retailer - Majken Sander43m 5s
Data Science at the Command Line - Jeroen Janssens - Part 144m 54s
Data Science at the Command Line - Jeroen Janssens - Part 236m 16s
Data Science at the Command Line - Jeroen Janssens - Part 341m 5s
Data Science at the Command Line - Jeroen Janssens - Part 446m 5s
Becoming a Scalable Data Scientist - Alice Zheng1h 6m 30s
All the Data and Still Not Enough! - Claudia Perlich41m 37s
The Great Debate: If You Can't Code, You Can't Be a Data Scientist - Joseph Adler, Hilary Mason, Scott Nicholson, Lucian Lita, and Roger Magoulas37m 58s
Data Science Bootcamp - Laurie Skelly41m 24s
The Day Zach Galifianakis Saved Healthcare - Chris Harland33m 33s
Computing Professional Identity for the Economic Graph - Vitaly Gordon42m 52s
Multi-language Data Science with IPython, IJulia, IR, and Friends - Brian Granger and Fernando Pérez40m 58s
Using Data Science on Internet Search Behavior as a Proxy for Human Behavior - Juan Miguel Lavista26m 21s
AI in 2014: Progress and Problems - Beau Cronin40m 32s
Big Data Anti-Patterns - Douglas Moore39m 24s
Machine Learning system architecture – Microsoft Translator, a Case Study - Vishal Chowdhary37m 28s
Secure Machine Learning - Bahman Bahmani40m 10s
Fashioning Data: The Balance Between Creativity and Data-Driven Decisions - Karen Moon, Vijay Subramanian, and Liza Kindred40m 3s
Distributed Gradient Boosting Machine - Cliff Click38m 10s
Deploying and Evaluating Data Products - Josh Levy28m 11s
Design & Interfaces 5h 59m
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 137m 45s
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 243m 2s
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 350m 56s
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 443m 11s
Tackling Data Curation in Three Generations - Michael Stonebraker40m 20s
Advantages of a Domain-Specific Language Approach to Data Transformation - Joe Hellerstein and Sean Kandel43m 41s
Stories from the Trenches: The Challenges of Building an Analytics Stack - Fangjin Yang and Xavier Léauté36m 57s
Tachyon: A Memory Centric Storage System for Big Data Computing - Haoyuan Li40m 8s
Anomaly Detection with Apache Spark - Sean Owen32m 56s
Mixing Structured Data and Analytics with Spark SQL - Michael Armbrust51m 54s
Interactive Visual Data Exploration with Spark - Hossein Falaki40m 12s
Open Source Real Time BI using Storm, Hadoop, Titan, Druid & D3 - Anil Madan50m 36s
Highly Scalable Tile-Based Visualization for Exploratory Data Analysis - David Jonker and Rob Harper37m 35s
Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 149m 32s
Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 238m 49s
Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 346m 22s
Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 443m 41s
From Raw Data to Analytics with No ETL - Marcel Kornacker and Lenni Kuff40m 27s
SQL on Everything, in Memory - Julian Hyde40m 6s
From Oracle to Hadoop - Guy Harrison, David Robson, and Kathleen Ting37m 59s
Hive on Apache Tez: Benchmarked at Yahoo! Scale - Mithun Radhakrishnan45m 52s
Scaling Storm: Cluster Sizing and Performance Optimization - P. Taylor Goetz39m 46s
Building Real-time Data Products at LinkedIn with Apache Samza - Martin Kleppmann49m 42s
HBase: Where Online Meets Low Latency - Nick Dimiduk and Nicolas Liochon36m 3s
Apache HBase Application Archetypes - Jonathan Hsieh and Lars George48m 29s
Hadoop Operations - Best Practices from the Field - Chris Nauroth and Suresh Srinivas40m 32s
Resource Management with YARN - Anubhav Dhoot40m 2s
Bulk Loading Your Big Data into Apache HBase, a Full Walkthrough - Jean-Daniel Cryans35m 57s
An Independent Comparison of Open Source SQL-on-Hadoop - Greg Rahn41m 42s
Bringing PyData to Impala - Uri Laserson28m 45s
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 136m 6s
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 250m 19s
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 342m 30s
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 442m 38s
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 139m 35s
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 235m 4s
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 351m 47s
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 456m 50s
How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns16m 54s
Customer Intelligence: Harnessing Elephants at Transamerica - Stephen Lloyd, Vishal Bamba, and David Beaudoin42m 36s
Transitioning from Original Big Data to the New Big Data: L.L.Bean’s Journey - Chris Wilson and Doug Bryan42m 36s
Unlocking Big Data at CERN - Matthias Braeger and Manish Devgan41m 13s
Big Data Modeling: How FICO is Turning DBAs and into Data Engineers - Lelanie Moll, Deb Brooks, and Silaphet Mounkhaty39m 34s
How LinkedIn Democratizes Big Data Visualization - Praveen Neppalli Naga, Chi-Yi Kuan, and Jonathan Wu40m 21s
Better Care with Big Data: A Panel Discussion - Ryan Goldman, Ryan Brush, Sabrina Dahlgren, Aashima Gupta, and Michael Thompson38m 28s
Renaissance in Medicine: Next-Generation Big Data Workloads - Allen Day40m 4s
Image Processing on Hadoop - Ailey Crow39m 18s
The Next Generation of Big Data in the Cloud - Daniel Weeks41m 17s
Building an Enterprise Data Hub to Bridge the Gap Between Business and IT - Sabrina Dahlgren and Rajiv Synghal37m 42s
Law, Ethics & Open Data 3h 16m
Enterprise Adoption 1h 48m
Hardcore Data Science 5h 6m
Data-Driven Business Day 5h 27m
Industrial Internet 5h 46m
Got the T-shirt: Real Experiences from a Hadoop Veteran - Jim Scott43m 44s
See the Fastest Spark-Powered Disparate Data Blending & Analysis Solution - Vaibhav Nivargi35m 12s
Disrupting the Traditional Analyst Workflow with Platfora and Spark - Peter Schlampp and Ed Smith40m 15s
Big Data Architectural Patterns - Todd Papaioannou39m 38s
An End-to-End Approach to Offloading the Data Warehouse with Hadoop - Jorge A Lopez35m 10s
Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar and Brett Rudenstein39m 59s
Using Graph to Discover Unseen Relationships in Big Data - Mike Hoskins42m 43s
Hadoop Effortlessly: A Data Inventory is Key to Data Self-service - Moderated by: Alex Gorelik - Panelists: Suresh Srinivas, Mike Sutten, John Mount, Clark Farrey, and Sunil Soares46m 35s
Building Real-Time Platforms with MemSQL and Apache Spark - Eric Frenkiel31m 58s
Unlocking Hadoop’s Potential with YARN - Sanjay Radia41m 21s
Real-time streaming and analytics with Amazon Elastic MapReduce and Amazon Kinesis - Steve McPherson33m 0s
NoSQL Solutions for Big Data Problems - Don Pinto38m 24s
Big Data SQL and Query Franchising: An Architecture for SQL Beyond Hadoop - Dan McClary38m 39s
Drive Data Quality at Your Company: Create a Data Lake - George Corugedo37m 14s
Important Advances in Hadoop: A Panel Discussion - Joey Jablonski, Armando Costa, Jim Burmingham, and Rob Johnson46m 17s
Cloud Machine Learning - Joseph Sirosh38m 35s
Embracing Diversity - Sid Sipes33m 16s
The Art of Prediction: Seamless Visualization and Modeling With Hadoop - Adam Pilz31m 37s
Extending "Variety" of Data to "Variety" of Users - Tina Groves36m 38s
How to Architect Big Data Apps with the Lambda Architecture - with Real Work Examples on Merging Batch and Real-Time Processing - Altan Khendup and Ron Bodkin42m 30s
What do Al Capone & Hadoop Have in Common? Visualizing Data at Scale – Making Sense Out of Big Data - James Dixon41m 19s
Distributed R - A Scalable and High-performance Platform for R - Sunil Venkayala and Indrajit Roy39m 6s
Getting Big Data to Work: Agile Data Transformation in Hadoop - Stephanie McReynolds, Xavier Quintuna, Shirshanka Das, Charlie Crocker, and Anna Dorofiyenko40m 25s
Now Playing at Netflix: Advanced Decision-Making with Hadoop, Starring MicroStrategy - Michael Hiskey25m 30s
Analytics the Way Nature Intended - Donald Farmer40m 23s
Western Union: Implementing a Hadoop-based Enterprise Data Hub with Informatica - Pravin Darbare and Sumeet Agrawal41m 48s
For Red Hat, it's 1994 All Over Again - Sarangan Rangachari37m 10s
Hadoop Responsibly with Big Data Governance - Moderated by: Barry Devlin - Panelists: Sunil Soares, Joseph Dossantos, and Jay Zaidi43m 8s
Big Content: Finding the Why Behind the What - Sid Probstein34m 8s
Solutions Showcase Theater 8h 52m
Show More
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more. Watch now
Unlock full access
More than 5,000 organizations count on O’Reilly O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement. Julian F. I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology. Addison B. I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed. Amir M. I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do. Mark W.