Video description
Go right to the heart of big data
Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.
In ten tracks, this year’s conference captured the most challenging problems and compelling opportunities in data today, including:
- Business & Industry: How organizations of all sizes use data to make better decisions
- Connected World: Navigating in an always-connected, always-on world
- Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
- Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
- Hadoop & Beyond: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
- The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
- Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
- Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
- Machine Data: Extracting meaningful insights from data collected and generated by things
- Security: Fighting fraud, detecting threats, increasing trust—and securing data
You also have complete access to other conference events, such as Data-Driven Business Day, Hardcore Data Science Day, and Spark Camp.
Download these videos or stream them through our HD player, and gain a clear perspective on data, including all the analytics, architectures, techniques, tools, and technologies you need to use it successfully.
Publisher resources
Table of contents
-
Business Industry
- Hiding the Elephant - How Big Data Apps Make Magic While Hiding Hadoop - Ross Fubini, Ari Gesher, Wei Zheng, Omer Trajman, and Sylvain Le Borgne
- Pumping Up Retail Profits with Predictive Analytics - Adam Jorgensen
- If You Don't Have Anything Nice to Say, Please Say Something: Increasing Honesty in Airbnb Reviews - Dave Holtz
- Making Big Data Usable in Market Regulation - Scott Donaldson
- WANTED: Women in Data, Tech, and STEM - Moderated by: Cornelia Lévy-Bencheton, Panelists: Michele Chambers, Alice Zheng and Neha Narkhede
- Helping the Republican Party Use Data and Engineering to Win the US Senate - Azarias Reda
- Using Big Data to Identify the World's Top Experts - Nima Sarshar
- The New Data Organization: What do Successful Data-Driven Companies Look Like? - John Haddad
- Architecting for the Cloud - Chris Neumann
- Solving Customer Problems with Big Data across Thomson Reuters - Brian Ulicny
-
Connected World
- Improving Business Operations with Predictive Maintenance and Service - Oliver Mainka
- Forget the Valley: Middle America Is Where Data Is Having Its Biggest Impact - Matt Asay
- Robot Reporters: How The Associated Press Embraced Data Automation - Adam Smith
- Which is More Interesting - Millions of Thermostats, or Millions of Minds in the Internet of Things? - Doug Stein
- Economic Insights from LinkedIn's Professional Network - June Andrews
- Using Data to Help Farmers Feed Growing Populations in a Changing Climate - Stewart Collis
-
Data Science
- Bots Don't Drink Soda: Using Big Data to Find Real People - Michael Brown
- How to Detect Anomalies in High Cardinality Dimensions and Make Them Actionable - Shankar Vedaraman and Christopher Colburn
- Big Data and Design Working Together – When the Magic Happens - George Roumeliotis
- HOWTO Make Your Future Data Scientists Love You - Sasha Laundy
- From Academia to Data Science: Lessons Learned Founding the Insight Data Science Fellows Program - Jake Klamka and Kathy Copic
- The Two Cultures of People Science - Michelangelo D'Agostino
- Pro Bono Data Science in Action - Helping Teens in Crisis - Noelle Sio
- Data Applications: Speed vs Accuracy - Danielle Ben-Gera
- Behavior-driven Machine Translation - Irina Borisova and Asim Mathur
- Playing Nice in the Product Playground: Data Scientists, Engineers, and Product Managers Working Together to Create Innovative Data Products - Anu Tewary, Lucian Lita and Jonathan Goldman
- Machine Learning Building Blocks and the Workload Optimization Framework - Shai Fine
- Robust Event Detection Using Diverse Data Types - Harrison Mebane
- Purposeful Education with Job Market Data for Students, Educators, and Institutions - Jike Chong
- Real-Time Relevance for Mobile at LinkedIn - Michael Conover
-
Design Interfaces
- Building Interactive Data Visualizations - Jonathan Dinu - Part 1
- Building Interactive Data Visualizations - Jonathan Dinu - Part 2
- Building Interactive Data Visualizations - Jonathan Dinu - Part 3
- Building Interactive Data Visualizations - Jonathan Dinu - Part 4
- The Human-Data Interface: How to Design for “Irrational” Data Consumers - Cathy Tanimura
- Designing Delightful Data Products - Alonzo Canada
- Designing for Data - Etan Lightstone
- Humanizing Data - Building Systems and Interfaces for Domain Experts - Ari Gesher and James Thompson
- Architecting Interfaces that Learn - Tye Rattenbury and Jeffrey Heer
- What Designers and Data Scientists Can Learn from Each Other - Danyel Fisher and Miriah Meyer
- Data (Art ) Science - Eric Colson
- Designing with Data: A Human-centered Approach to Data-driven Design - Arianna McClain and Coe Leta Stafford
-
Hadoop Beyond
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 1
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 2
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 3
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 4
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 5
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Krishna Sankar - Part 6
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Christopher Fregly - Part 7
- Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 8
- Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 1
- Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 2
- Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 3
- Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 4
- Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 1
- Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 2
- Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 3
- Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 4
- Going Real-time: Data Collection and Stream Processing with Apache Kafka - Jay Kreps
- Stream Processing Everywhere - What to Use? - Jim Scott
- Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Engine - Richard Williamson
- From MapReduce to Programming Frameworks: Making Sense of Cloud Dataflow, Spark and New Tools for Big Data - Eric Schmidt
- Drill into Drill: How Providing Flexibility and Performance is Possible - Jacques Nadeau
- Three Approaches to Scalable Data Curation - Michael Stonebraker
- One Billion Objects in 2GB: Big Data Analytics on Small Clusters with Doradus OLAP - Randy Guck
- Big Data at Netflix: Faster and Easier - Kurt Brown
- Search Evolved: Unraveling Your Data - Costin Leau
- The Year in Review - Key Changes in the Hadoop Platform in the Past 12 Months - Jairam Ranganathan
- Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky
- YARN vs. MESOS: Can’t We All Just Get Along? - Ted Dunning
-
Hadoop Platform
- Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 1
- Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 2
- Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 3
- Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 4
- Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 1
- Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 2
- Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 3
- Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 1
- Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 2
- Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 3
- Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 4
- Hadoop Puzzlers Reloaded - Aaron Myers and Daniel Templeton
- The Future of Apache Hadoop Security - Joey Echeverria
- Making HBase Accessible to Scientists - Spencer Herath and Aaron Benz
- Data Discovery on Hadoop - Sumeet Singh and Thiruvel Thirumoolan
- Yarns about YARN: Migrating to MapReduce v2 - Kathleen Ting and Miklos Christine
- Maintaining Low Latency while Maximizing Throughput on a Single Cluster - Yuliya Feldman
- Running Production Hadoop Clusters in Docker Containers - Nasser Manesh
- How to use Parquet as a Basis for ETL and Analytics - Julien Le Dem
- Adding Insert, Update, and Delete to Hive - Alan Gates
- Top Ten Pitfalls to Avoid in a SQL-on-Hadoop Implementation - Monte Zweben
-
Hadoop in Action
- The Evolution of Hadoop at Spotify - Through Failures and Pain - Josh Baer and Rafal Wojdyla
- From Source to Solution: Building A System for Machine and Event-Oriented Data - Eric Sammer
- Design Patterns for Real Time Streaming Data Analytics - Sheetal Dolas
- Stock Market Order Flow Reconstruction in HBase on AWS - Tigran Khrimian
- Ticketmaster: Marketing and Selling the World's Tickets - John Carnahan
- Designing Data Architectures for Robust Decision Making - Gwen Shapira
- Friction-Free ETL: Automating Data Transformation with Impala - Marcel Kornacker
- The Truth About MapReduce Performance on SSDs - Yanpei Chen and Karthik Kambatla
- Hadoop as a Platform for Genomics - Allen Day and Sungwook Yoon
- Law, Ethics Open Data
-
Machine Data / IoT
- Transformational Case Studies in Machine Data Telemetry - Chad Meley and John Kreisa
- TSAR (the TimeSeries AggregatoR) - How to Count Tens of Billions of Daily Events in Real Time Using Open Source Technologies - Anirudh Todi
- An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick
- Building Adaptive Apps with APIs and Data - Anant Jhingran
- Dynamic Events in Massive Data Streams, from Astrophysics to Marketing Automation - Kirk Borne
- Forecasting Space-time Events - Jeremy Heffner
- The IoT P2P Backbone - Bruno Fernandez-Ruiz
- The Sushi Principle: Raw Data Is Better - Joseph Adler and Robert Johnson
- Practical Methods for Identifying Anomalies That Matter in Large Datasets - Robert Grossman
- Streaming Analytics: It’s Not The Same Game - Subutai Ahmad
- Machine Learning For Oil Exploration - Ben Hamner
-
Security
- Data Science vs. The Bad Guys: Using Data to Defend LinkedIn Against Fraud and Abuse - David Freeman
- How to Ensure Your Hadoop Installation is Not the Next Big Data Breach - Terence Spies
- Securing the New Wearable World - Gary Davis
- The Physics of Apache Hadoop: Choosing the Right Hardware and OS Configuration Mix for Your Workloads - Woody Christy, Steve Anderson, Patrick Schots and Floris Grandvarlet
-
Enterprise Adoption
- Database History from Codd to Brewer and Beyond - Douglas Turnbull
- Ideal Platform for Managing Log Data: Search or SQL? - Vinayak Borkar
- Getting Started with Data Governance: Paths Converge from Multiple Starting Points - Paula Wiles Sigmon
- Don’t Let Today’s Demands Kill Tomorrow’s Workforce! - Martin Waterhouse
-
Spark in Action
- Lessons from Running Large Scale Spark Workloads - Reynold Xin and Matei Zaharia
- Introducing Hive's New Execution Engine - Spark - Xuefu Zhang and Chengxiang Li
- Machine Learning with H2O and Spark - Cliff Click and Michal Malohlava
- Spark Streaming - The State of the Union, and Beyond - Tathagata Das
- Why Spark Is the Next Top (Compute) Model - Dean Wampler
- Tuning and Debugging in Apache Spark - Patrick Wendell
- Everyday I’m Shuffling - Tips for Writing Better Spark Programs - Vida Ha and Holden Karau
-
Hardcore Data Science
- Beyond DNNs towards New Architectures for Deep Learning, with Applications to Large Vocabulary Continuous Speech Recognition - Tara Sainath
- On the Computational and Statistical Interface and "Big Data" - Michael Jordan
- Interpretable Machine Learning in Practice - Maya Gupta
- Visual Understanding Beyond Naming - Alyosha Efros
- Finding Repeated Structure in Time Series Data: Commercial and Scientific Opportunities - Eamonn Keogh
- Tensor Methods for Large-scale Unsupervised Learning: Applications to Topic and Community Modeling - Anima Anandkumar
- A Quest for Visual Intelligence in Computers - Fei-Fei Li
- Graph Mining for Log Data - David Andrzejewski
- Why Julia's Important for Data Science - John Myles White
- Drugs, DNA, and Dinosaurs: Building High Quality Knowledge Bases with DeepDive - Chris Re
-
Data-Driven Business Day
- Don't Let Data Get in the Way of a Good Story - Mark Madsen
- Big Data Stories: Decisions That Drive Successful Projects - Ellen Friedman
- Making Business Model Innovation More of a (Data) Science - Jerry Overton
- Data "Driven" is Really Data "Accessible” - Ann Johnson
- When Ones and Zeros Can Put Billions at Risk... - Anne Johnson
- Find the Business in Your Data - Arnab Chakraborty, Dr. Alexander Prinz, Reena Tiwari and Anne Johnson
- Tech Magic: 10 Disruptors Shaping the Sensed World - Leah Hunter
- Leveraging Big Data and Data Science in Upstream Oil and Gas Industry - Satyam Priyadarshy
- Using Data from Many Streams to Drive Social Impact - India Swearingen
- Smartphone Data: Tell the Story of People's Lives - Joerg Blumtritt
- Big Data Impacts Marketing Productivity at Cisco - Reena Tiwari
- National Drug Index: Revealing Prescription Inflation in the US - AJ Loiacono
- Digital Business Era: Stretch Your Boundaries - Prith Banerjee
- Data Products and the Wearables Revolution - Emi Nomura
- Unlocking the Data in Paper: A Case Study of New York Life - Kuang Chen
-
R Day
- An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 1
- An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 2
- A Reactive Grammar of Graphics with ggvis - Winston Chang
- Reproducible R Reports with R Markdown - Garrett Grolemund - Part 1
- Reproducible R Reports with R Markdown - Garrett Grolemund - Part 2
- Analytic Web Applications with Shiny - Winston Chang - Part 1
- Analytic Web Applications with Shiny - Winston Chang - Part 2
-
PyData
- Machine Learning with scikit-learn - Andreas Mueller - Part 1
- Machine Learning with scikit-learn - Andreas Mueller - Part 2
- Slicing Through Data with NumPy - Jennifer Klay - Part 1
- Slicing Through Data with NumPy - Jennifer Klay - Part 2
- Intro to Numba and Performance Python - Travis Oliphant - Part 1
- Intro to Numba and Performance Python - Travis Oliphant - Part 2
- Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 1
- Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 2
- Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 1
- Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 2
-
Large-scale Machine Learning Day
- Large-scale Machine Learning Day - Yucheng Low - Part 2
- Large-scale Machine Learning Day - Yucheng Low - Part 3
- Large-scale Machine Learning Day - Alice Zheng - Part 4
- Large-scale Machine Learning Day - Chris DuBois - Part 5
- Large-scale Machine Learning Day - Alice Zheng - Part 6
- Large-scale Machine Learning Day - Shawn Scully - Part 7
-
Sponsored
- Bringing OLAP Fully Online: Analyze Changing Datasets in MemSQL and Spark with Pinterest Demo - Eric Frenkiel
- From Domain-specific Solutions to an Open Platform Architecture for Big Data Analytics Based on Hadoop and Spark - Vin Sharma and Jason (Jinquan) Dai
- SAS Analytic Solutions Running on a Hadoop Cluster using YARN - James Kochuba
- Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar
- SQL in Hadoop: To Boldly Go where No Data Warehouse has Gone Before - Emma McGrattan
- A Simple, Fast Approach to Analytics for Big Data/IoT with kdb+ - Fintan Quill and Doug Talbott
- Scalable Realtime Analytics with declarative SQL like Complex Event Processing Scripts - Srinath Perera
- The Data Unification Imperative - Andy Palmer
- From Monitoring To Monetization With The Data Lake - Bill Schmarzo
- Breaking Through the Top 5 Enterprise Data Quality Roadblocks Inside Hadoop - George Corugedo
- Data Dexterity: Immediate Visibility Into All Information - Greg Goldsmith
- Extreme Sports and Beyond: Exploring a New Frontier in Data - Josh Byrd and Darren Chinen
- Cloud Machine Learning - Joseph Sirosh
- Credit Suisse Puts Vendors in the Hot Seat on Data Quality and Governance - Nitesh Ambastha, David Brewster and Nenshad Bardoliwalla
- Hive on Spark is Blazing Fast... Or Is It? - Carter Shanklin and Mostafa Mokhtar
- Tackling the World’s Biggest Data: Human Data - Richard Caudle
- Case Study: Data Warehousing in the Cloud with Snowflake at Kixeye - Jon Bock
- PostgreSQL Rising: The Other Elephant in the Room - Ozgun Erdogan
- Your First Big Data Application on AWS - Rahul Pathak
- Smart Enterprise Big Data Bus for the Modern Responsive Enterprise - Anand Venugopal
- Driving Better Business Results at Allstate with Machine Learning on Hadoop - Ryan Michaluk and Alexander Gray
- Big Data Architectural Pattern - Clint Sharp
- Perform Fast Analytics on Hadoop Data Scalable Predictive Analytics with Open Innovations from HP Vertica - Steve Sarsfield and Sunil Venkayala
- Running Hadoop-as-a-Service in the Cloud - Lance Olson
- Real World Use Cases: Hadoop and NoSQL in Production - Ted Dunning and Ellen Friedman
-
Keynotes
- Hadoop's Impact on the Future of Data Management - Amr Awadallah
- Close Encounters with the Third Kind of Database - Eric Frenkiel
- Impacting Business as it Happens - Anil Gadre
- A Bigger Lens Through which to View the World- the IBM Twitter Alliance - Adam Kocoloski
- Data Science: Where are We Going? - DJ Patil
- The Emerging Age of Data-Driven Policy Design: Examples from Trying to Manage the Global Climate - Solomon Hsiang
- Data: Open for Good and Secure by Default - Eddie Garcia
- Year Zero: How We’ll Run Our Lives in Ten Years’ Time - Alistair Croll
- Intel and the Role of Open Source in Delivering on the Promise of Big Data - Michael Greene
- Big Data Lessons from Our Cybernetic Past - Eden Medina
- New Directions for Spark in 2015 - Matei Zaharia
- A New Approach to Big Data - Roman Shaposhnik
- Charting a Path Forward: The Future of Data Visualization - Jeffrey Heer
- Connected Cows? - Joseph Sirosh
- Startup Showcase Winner Announcement
-
Solutions Showcase Theater
- The Briefcase Cluster - Enabling Big Data Everywhere - Jim Scott
- Why Event Analytics Matter - Rohit Shrivastava
- Cracking the Data Conundrum - Steffin Harris
- Smart Data for Smarter Utilities - Irshad Raihan
- The Value of Churn Analytics at Cisco - Ivan Chen and Phil Hodsdon
- Big Data Governance - Felix Van de Maele
- Early Warnings for Customer Churn at a Leading Cloud Technology Firm! - Umair Rauf
- Harnessing Big Social Data to Deliver Human Data Intelligence - Jason Rose
- Operationalizing Hadoop – Are You Ready? - Valerie Fowler
- Multimedia Giant Turns Big Data into Real-Time Customer Insights - Brian Garrett
- Data Wrangling in the Wild - Sean Ma
- StreamAnalytix-Developing Enterprise Class, Real-time Streaming Applications on Apache Storm - Anand Venugopal
- Gaining Value From Data Where It's Born - Ryan Peterson
- Build a Foundation for Self-Service Data Prep, Analytics, and Governance - Oliver Claude
- Connecting the Big-Data Driven Enterprise in Online Retail - Ashley Stirrup
- Leading Telecommunications Company Uses BlueData to Spin Up Local, On Demand Hadoop and Spark Clusters to Enable Agile Deployment of Big Data Tools and Technologies - Nanda Vijaydev
- Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing - Alan Wagner
- Everything You Need To Know About HBase in 10 Minutes or Less - Alex Newman
- The Emergence of the Data Refinery - Chuck Yarbrough
- Big Data Cluster Planning and Optimization Using Wolf Island Simulation Technology - Laurent Isenegger
- Prosthetic Implant Surgery - Where Big Data Means Big Savings - Rola Shaar
- Close the Skills Gap and Deliver Rapid Business Value with Big Data Apps - Manan Goel
- Distributed R - Scaling the R Language for Even Bigger Data - Sunil Venkayala
- Transforming Big Data Landscape with Apache Spark - Rishi Yadav
- Data Warehousing in the Cloud - Jon Bock
- Proactive Product Intelligence for Electronics - Rami Lokas
- Massive-Scale Security Incident Response Leveraging a Hadoop Architecture - Michael A. Davis
- Don’t be a Hadoop Breach Headline - Discovery and Sensitive Data in Hadoop - Jeremy Stieglitz
- Big Data vs. Climate Change - Srivatsan Ramanujam and John Cardente
- ZEAS – Enabling anyone to create Hadoop Enterprise applications fast using a GUI - Aditya Agrawal
- Power Tools for Big Data Analytics - Dan Steinberg
- Big Data on OpenStack - Kirk Lewis and Frank Rego
- Fighting ATM Fraud in Real Time with Hadoop Analytics - Christy Maver
- Scale Big Data cost down, while scaling performance out. An NTT mobile personalization retrospective, re-thinking the Big Data solution stack. - Robert Greene
- Dato Enables Large-Scale Deduplication at Zillow using GraphLab Create - Rajat Arya
- To Catch a Thief with Big Data - Kevin Petrie
- Jump into the Data Lake with Hadoop-Scale Data Integration - Greg Benson
- Predicting The Future To Improve Customer Satisfaction - Joe Rossi
- The Practical, Profitable Magic of Prescriptive Analytics - Andy Flint
- Changing the Culture Around Data: Empowering More People with Analytics - Gary Cottrell
- How Havas Media Found New Revenue Streams with UNIFi Software - Sean Keenan
- What Enterprises Can Learn From Real-Time Bidding - Peter Corless
- Big Data and the Data Quality Imperative - Ed Wrazen
- Tapjoy Scales and Saves Costs with Riak - Tom Sigler
- Smart Execution: How to Optimize Performance by Intelligently Leveraging Multiple Hadoop Analytics Engines - Matt Schumpert
- Jagex Game Studio Case Study - Gregory McPhee
- Supercharge Sqoop with magical JDBC drivers - Sumit Sarkar
- Big Data Analytics: Diverse Use Cases, Diverse Architectures - Ben Conners
- Accelerate your data with SequoiaDB - Tao Wang
- Building reliable Hadoop clusters with two copies - Iyer Venkatesan
Product information
- Title: Strata + Hadoop World San Jose 2015: Video Compilation
- Author(s):
- Release date: March 2015
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491924143
You might also like
book
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
book
The Self-Service Data Roadmap
Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw …
audiobook
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
book
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition
Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. …