Strata + Hadoop World San Jose 2015: Complete Video Compilation

Video Description

Go right to the heart of big data

Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.

In ten tracks, this year’s conference captured the most challenging problems and compelling opportunities in data today, including:

  • Business & Industry: How organizations of all sizes use data to make better decisions
  • Connected World: Navigating in an always-connected, always-on world
  • Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
  • Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
  • Hadoop & Beyond: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
  • The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
  • Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
  • Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
  • Machine Data: Extracting meaningful insights from data collected and generated by things
  • Security: Fighting fraud, detecting threats, increasing trust—and securing data

You also have complete access to other conference events, such as Data-Driven Business Day, Hardcore Data Science Day, and Spark Camp.

Download these videos or stream them through our HD player, and gain a clear perspective on data, including all the analytics, architectures, techniques, tools, and technologies you need to use it successfully.

Publisher Resources

View/Submit Errata

Table of Contents

  1. Business & Industry
    1. Hiding the Elephant - How Big Data Apps Make Magic While Hiding Hadoop - Ross Fubini, Ari Gesher, Wei Zheng, Omer Trajman, and Sylvain Le Borgne 00:39:26
    2. Pumping Up Retail Profits with Predictive Analytics - Adam Jorgensen 00:18:30
    3. If You Don't Have Anything Nice to Say, Please Say Something: Increasing Honesty in Airbnb Reviews - Dave Holtz 00:21:38
    4. Making Big Data Usable in Market Regulation - Scott Donaldson 00:38:57
    5. WANTED: Women in Data, Tech, and STEM - Moderated by: Cornelia Lévy-Bencheton, Panelists: Michele Chambers, Alice Zheng and Neha Narkhede 00:47:29
    6. Helping the Republican Party Use Data and Engineering to Win the US Senate - Azarias Reda 00:35:10
    7. Using Big Data to Identify the World's Top Experts - Nima Sarshar 00:19:32
    8. The New Data Organization: What do Successful Data-Driven Companies Look Like? - John Haddad 00:25:38
    9. Architecting for the Cloud - Chris Neumann 00:26:32
    10. Solving Customer Problems with Big Data across Thomson Reuters - Brian Ulicny 00:37:22
  2. Connected World
    1. Improving Business Operations with Predictive Maintenance and Service - Oliver Mainka 00:39:50
    2. Forget the Valley: Middle America Is Where Data Is Having Its Biggest Impact - Matt Asay 00:18:37
    3. Robot Reporters: How The Associated Press Embraced Data Automation - Adam Smith 00:21:10
    4. Which is More Interesting - Millions of Thermostats, or Millions of Minds in the Internet of Things? - Doug Stein 00:20:25
    5. Economic Insights from LinkedIn's Professional Network - June Andrews 00:19:42
    6. Using Data to Help Farmers Feed Growing Populations in a Changing Climate - Stewart Collis 00:35:08
  3. Data Science
    1. Bots Don't Drink Soda: Using Big Data to Find Real People - Michael Brown 00:18:56
    2. How to Detect Anomalies in High Cardinality Dimensions and Make Them Actionable - Shankar Vedaraman and Christopher Colburn 00:39:26
    3. Big Data and Design Working Together – When the Magic Happens - George Roumeliotis 00:32:32
    4. HOWTO Make Your Future Data Scientists Love You - Sasha Laundy 00:16:10
    5. From Academia to Data Science: Lessons Learned Founding the Insight Data Science Fellows Program - Jake Klamka and Kathy Copic 00:21:13
    6. The Two Cultures of People Science - Michelangelo D'Agostino 00:19:31
    7. Pro Bono Data Science in Action - Helping Teens in Crisis - Noelle Sio 00:21:29
    8. Data Applications: Speed vs Accuracy - Danielle Ben-Gera 00:35:02
    9. Behavior-driven Machine Translation - Irina Borisova and Asim Mathur 00:42:11
    10. Playing Nice in the Product Playground: Data Scientists, Engineers, and Product Managers Working Together to Create Innovative Data Products - Anu Tewary, Lucian Lita and Jonathan Goldman 00:47:16
    11. Machine Learning Building Blocks and the Workload Optimization Framework - Shai Fine 00:30:50
    12. Robust Event Detection Using Diverse Data Types - Harrison Mebane 00:16:38
    13. Purposeful Education with Job Market Data for Students, Educators, and Institutions - Jike Chong 00:26:26
    14. Real-Time Relevance for Mobile at LinkedIn - Michael Conover 00:37:53
  4. Design & Interfaces
    1. Building Interactive Data Visualizations - Jonathan Dinu - Part 1 00:31:35
    2. Building Interactive Data Visualizations - Jonathan Dinu - Part 2 00:28:55
    3. Building Interactive Data Visualizations - Jonathan Dinu - Part 3 00:49:04
    4. Building Interactive Data Visualizations - Jonathan Dinu - Part 4 00:32:04
    5. The Human-Data Interface: How to Design for “Irrational” Data Consumers - Cathy Tanimura 00:40:37
    6. Designing Delightful Data Products - Alonzo Canada 00:30:33
    7. Designing for Data - Etan Lightstone 00:31:03
    8. Humanizing Data - Building Systems and Interfaces for Domain Experts - Ari Gesher and James Thompson 00:41:09
    9. Architecting Interfaces that Learn - Tye Rattenbury and Jeffrey Heer 00:36:44
    10. What Designers and Data Scientists Can Learn from Each Other - Danyel Fisher and Miriah Meyer 00:30:24
    11. Data (Art &) Science - Eric Colson 00:40:36
    12. Designing with Data: A Human-centered Approach to Data-driven Design - Arianna McClain and Coe Leta Stafford 00:36:12
  5. Hadoop & Beyond
    1. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 1 00:57:21
    2. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 2 00:56:00
    3. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 3 00:39:16
    4. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 4 00:30:41
    5. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 5 00:31:02
    6. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Krishna Sankar - Part 6 00:41:55
    7. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Christopher Fregly - Part 7 00:36:06
    8. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 8 00:40:58
    9. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 1 00:56:12
    10. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 2 00:27:42
    11. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 3 00:47:55
    12. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 4 00:36:35
    13. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 1 00:43:38
    14. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 2 00:48:22
    15. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 3 00:50:39
    16. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 4 00:25:52
    17. Going Real-time: Data Collection and Stream Processing with Apache Kafka - Jay Kreps 00:39:29
    18. Stream Processing Everywhere - What to Use? - Jim Scott 00:39:06
    19. Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Engine - Richard Williamson 00:29:17
    20. From MapReduce to Programming Frameworks: Making Sense of Cloud Dataflow, Spark and New Tools for Big Data - Eric Schmidt 00:40:56
    21. Drill into Drill: How Providing Flexibility and Performance is Possible - Jacques Nadeau 00:43:17
    22. Three Approaches to Scalable Data Curation - Michael Stonebraker 00:38:20
    23. One Billion Objects in 2GB: Big Data Analytics on Small Clusters with Doradus OLAP - Randy Guck 00:48:21
    24. Big Data at Netflix: Faster and Easier - Kurt Brown 00:40:27
    25. Search Evolved: Unraveling Your Data - Costin Leau 00:40:39
    26. The Year in Review - Key Changes in the Hadoop Platform in the Past 12 Months - Jairam Ranganathan 00:42:01
    27. Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky 00:42:56
    28. YARN vs. MESOS: Can’t We All Just Get Along? - Ted Dunning 00:40:03
  6. Hadoop Platform
    1. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 1 00:50:37
    2. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 2 00:38:39
    3. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 3 00:57:12
    4. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 4 00:44:54
    5. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 1 00:45:38
    6. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 2 00:28:39
    7. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 3 00:45:59
    8. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 1 00:46:36
    9. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 2 00:39:00
    10. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 3 00:42:54
    11. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 4 00:40:41
    12. Hadoop Puzzlers Reloaded - Aaron Myers and Daniel Templeton 00:44:02
    13. The Future of Apache Hadoop Security - Joey Echeverria 00:35:30
    14. Making HBase Accessible to Scientists - Spencer Herath and Aaron Benz 00:40:24
    15. Data Discovery on Hadoop - Sumeet Singh and Thiruvel Thirumoolan 00:39:40
    16. Yarns about YARN: Migrating to MapReduce v2 - Kathleen Ting and Miklos Christine 00:32:48
    17. Maintaining Low Latency while Maximizing Throughput on a Single Cluster - Yuliya Feldman 00:39:47
    18. Running Production Hadoop Clusters in Docker Containers - Nasser Manesh 00:45:44
    19. How to use Parquet as a Basis for ETL and Analytics - Julien Le Dem 00:40:08
    20. Adding Insert, Update, and Delete to Hive - Alan Gates 00:37:52
    21. Top Ten Pitfalls to Avoid in a SQL-on-Hadoop Implementation - Monte Zweben 00:35:05
  7. Hadoop in Action
    1. The Evolution of Hadoop at Spotify - Through Failures and Pain - Josh Baer and Rafal Wojdyla 00:40:03
    2. From Source to Solution: Building A System for Machine and Event-Oriented Data - Eric Sammer 00:41:59
    3. Design Patterns for Real Time Streaming Data Analytics - Sheetal Dolas 00:40:31
    4. Stock Market Order Flow Reconstruction in HBase on AWS - Tigran Khrimian 00:39:09
    5. Ticketmaster: Marketing and Selling the World's Tickets - John Carnahan 00:39:35
    6. Designing Data Architectures for Robust Decision Making - Gwen Shapira 00:38:35
    7. Friction-Free ETL: Automating Data Transformation with Impala - Marcel Kornacker 00:28:47
    8. The Truth About MapReduce Performance on SSDs - Yanpei Chen and Karthik Kambatla 00:37:13
    9. Hadoop as a Platform for Genomics - Allen Day and Sungwook Yoon 00:40:26
  8. Law, Ethics & Open Data
    1. Data Scientists and Lawyers - a Marriage made in Silicon Valley - Laura Fennell and Bill Loconzolo 00:39:07
    2. Big Data Ethics and a Future for Privacy - Jonathan King 00:38:05
    3. How Minority Becomes Majority - A Study of Gerrymandering - Tatsiana Maskalevich 00:39:48
  9. Machine Data / IoT
    1. Transformational Case Studies in Machine Data & Telemetry - Chad Meley and John Kreisa 00:42:18
    2. TSAR (the TimeSeries AggregatoR) - How to Count Tens of Billions of Daily Events in Real Time Using Open Source Technologies - Anirudh Todi 00:41:28
    3. An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick 00:41:41
    4. Building Adaptive Apps with APIs and Data - Anant Jhingran 00:38:36
    5. Dynamic Events in Massive Data Streams, from Astrophysics to Marketing Automation - Kirk Borne 00:40:06
    6. Forecasting Space-time Events - Jeremy Heffner 00:42:02
    7. The IoT P2P Backbone - Bruno Fernandez-Ruiz 00:27:05
    8. The Sushi Principle: Raw Data Is Better - Joseph Adler and Robert Johnson 00:38:14
    9. Practical Methods for Identifying Anomalies That Matter in Large Datasets - Robert Grossman 00:36:43
    10. Streaming Analytics: It’s Not The Same Game - Subutai Ahmad 00:38:46
    11. Machine Learning For Oil Exploration - Ben Hamner 00:35:30
  10. Security
    1. Data Science vs. The Bad Guys: Using Data to Defend LinkedIn Against Fraud and Abuse - David Freeman 00:29:49
    2. How to Ensure Your Hadoop Installation is Not the Next Big Data Breach - Terence Spies 00:34:17
    3. Securing the New Wearable World - Gary Davis 00:46:04
    4. The Physics of Apache Hadoop: Choosing the Right Hardware and OS Configuration Mix for Your Workloads - Woody Christy, Steve Anderson, Patrick Schots and Floris Grandvarlet 00:49:44
  11. Enterprise Adoption
    1. Database History from Codd to Brewer and Beyond - Douglas Turnbull 00:42:39
    2. Ideal Platform for Managing Log Data: Search or SQL? - Vinayak Borkar 00:43:29
    3. Getting Started with Data Governance: Paths Converge from Multiple Starting Points - Paula Wiles Sigmon 00:40:32
    4. Don’t Let Today’s Demands Kill Tomorrow’s Workforce! - Martin Waterhouse 00:29:29
  12. Spark in Action
    1. Lessons from Running Large Scale Spark Workloads - Reynold Xin and Matei Zaharia 00:38:58
    2. Introducing Hive's New Execution Engine - Spark - Xuefu Zhang and Chengxiang Li 00:40:33
    3. Machine Learning with H2O and Spark - Cliff Click and Michal Malohlava 00:38:56
    4. Spark Streaming - The State of the Union, and Beyond - Tathagata Das 00:36:46
    5. Why Spark Is the Next Top (Compute) Model - Dean Wampler 00:40:38
    6. Tuning and Debugging in Apache Spark - Patrick Wendell 00:45:15
    7. Everyday I’m Shuffling - Tips for Writing Better Spark Programs - Vida Ha and Holden Karau 00:36:24
  13. Hardcore Data Science
    1. Beyond DNNs towards New Architectures for Deep Learning, with Applications to Large Vocabulary Continuous Speech Recognition - Tara Sainath 00:34:05
    2. On the Computational and Statistical Interface and "Big Data" - Michael Jordan 00:48:48
    3. Interpretable Machine Learning in Practice - Maya Gupta 00:26:24
    4. Visual Understanding Beyond Naming - Alyosha Efros 00:36:31
    5. Finding Repeated Structure in Time Series Data: Commercial and Scientific Opportunities - Eamonn Keogh 00:21:25
    6. Tensor Methods for Large-scale Unsupervised Learning: Applications to Topic and Community Modeling - Anima Anandkumar 00:31:09
    7. A Quest for Visual Intelligence in Computers - Fei-Fei Li 00:29:39
    8. Graph Mining for Log Data - David Andrzejewski 00:27:59
    9. Why Julia's Important for Data Science - John Myles White 00:24:37
    10. Drugs, DNA, and Dinosaurs: Building High Quality Knowledge Bases with DeepDive - Chris Re 00:30:20
  14. Data-Driven Business Day
    1. Don't Let Data Get in the Way of a Good Story - Mark Madsen 00:26:42
    2. Big Data Stories: Decisions That Drive Successful Projects - Ellen Friedman 00:18:42
    3. Making Business Model Innovation More of a (Data) Science - Jerry Overton 00:17:58
    4. Data "Driven" is Really Data "Accessible” - Ann Johnson 00:14:28
    5. When Ones and Zeros Can Put Billions at Risk... - Anne Johnson 00:16:48
    6. Find the Business in Your Data - Arnab Chakraborty, Dr. Alexander Prinz, Reena Tiwari and Anne Johnson 00:28:00
    7. Tech Magic: 10 Disruptors Shaping the Sensed World - Leah Hunter 00:17:19
    8. Leveraging Big Data and Data Science in Upstream Oil and Gas Industry - Satyam Priyadarshy 00:23:43
    9. Using Data from Many Streams to Drive Social Impact - India Swearingen 00:20:08
    10. Smartphone Data: Tell the Story of People's Lives - Joerg Blumtritt 00:18:47
    11. Big Data Impacts Marketing Productivity at Cisco - Reena Tiwari 00:07:14
    12. National Drug Index: Revealing Prescription Inflation in the US - AJ Loiacono 00:20:48
    13. Shazam - Cait O'Riordan 00:13:40
    14. Digital Business Era: Stretch Your Boundaries - Prith Banerjee 00:13:26
    15. Data Products and the Wearables Revolution - Emi Nomura 00:12:50
    16. Unlocking the Data in Paper: A Case Study of New York Life - Kuang Chen 00:20:46
  15. R Day
    1. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 1 00:46:48
    2. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 2 00:24:43
    3. A Reactive Grammar of Graphics with ggvis - Winston Chang 00:52:54
    4. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 1 00:33:58
    5. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 2 00:30:40
    6. Analytic Web Applications with Shiny - Winston Chang - Part 1 00:31:13
    7. Analytic Web Applications with Shiny - Winston Chang - Part 2 00:30:05
  16. PyData
    1. Machine Learning with scikit-learn - Andreas Mueller - Part 1 00:44:30
    2. Machine Learning with scikit-learn - Andreas Mueller - Part 2 00:40:17
    3. Slicing Through Data with NumPy - Jennifer Klay - Part 1 00:46:14
    4. Slicing Through Data with NumPy - Jennifer Klay - Part 2 00:31:41
    5. Intro to Numba and Performance Python - Travis Oliphant - Part 1 00:44:32
    6. Intro to Numba and Performance Python - Travis Oliphant - Part 2 00:41:55
    7. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 1 00:47:41
    8. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 2 00:38:21
    9. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 1 00:42:15
    10. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 2 00:39:19
  17. Large-scale Machine Learning Day
    1. Large-scale Machine Learning Day - Yucheng Low - Part 2 00:34:00
    2. Large-scale Machine Learning Day - Yucheng Low - Part 3 00:26:23
    3. Large-scale Machine Learning Day - Alice Zheng - Part 4 00:30:17
    4. Large-scale Machine Learning Day - Chris DuBois - Part 5 00:38:33
    5. Large-scale Machine Learning Day - Alice Zheng - Part 6 00:42:19
    6. Large-scale Machine Learning Day - Shawn Scully - Part 7 00:54:18
  18. Sponsored
    1. Bringing OLAP Fully Online: Analyze Changing Datasets in MemSQL and Spark with Pinterest Demo - Eric Frenkiel 00:41:13
    2. From Domain-specific Solutions to an Open Platform Architecture for Big Data Analytics Based on Hadoop and Spark - Vin Sharma and Jason (Jinquan) Dai 00:36:56
    3. SAS Analytic Solutions Running on a Hadoop Cluster using YARN - James Kochuba 00:35:37
    4. Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar 00:45:13
    5. SQL in Hadoop: To Boldly Go where No Data Warehouse has Gone Before - Emma McGrattan 00:38:38
    6. A Simple, Fast Approach to Analytics for Big Data/IoT with kdb+ - Fintan Quill and Doug Talbott 00:34:58
    7. Scalable Realtime Analytics with declarative SQL like Complex Event Processing Scripts - Srinath Perera 00:43:37
    8. The Data Unification Imperative - Andy Palmer 00:41:12
    9. From Monitoring To Monetization With The Data Lake - Bill Schmarzo 00:42:41
    10. Breaking Through the Top 5 Enterprise Data Quality Roadblocks Inside Hadoop - George Corugedo 00:36:16
    11. Data Dexterity: Immediate Visibility Into All Information - Greg Goldsmith 00:30:32
    12. Extreme Sports and Beyond: Exploring a New Frontier in Data - Josh Byrd and Darren Chinen 00:37:57
    13. Cloud Machine Learning - Joseph Sirosh 00:40:30
    14. Credit Suisse Puts Vendors in the Hot Seat on Data Quality and Governance - Nitesh Ambastha, David Brewster and Nenshad Bardoliwalla 00:44:07
    15. Hive on Spark is Blazing Fast... Or Is It? - Carter Shanklin and Mostafa Mokhtar 00:41:34
    16. Tackling the World’s Biggest Data: Human Data - Richard Caudle 00:31:38
    17. Case Study: Data Warehousing in the Cloud with Snowflake at Kixeye - Jon Bock 00:34:30
    18. PostgreSQL Rising: The Other Elephant in the Room - Ozgun Erdogan 00:39:42
    19. Your First Big Data Application on AWS - Rahul Pathak 00:33:20
    20. Smart Enterprise Big Data Bus for the Modern Responsive Enterprise - Anand Venugopal 00:31:53
    21. Driving Better Business Results at Allstate with Machine Learning on Hadoop - Ryan Michaluk and Alexander Gray 00:40:13
    22. Big Data Architectural Pattern - Clint Sharp 00:27:09
    23. Perform Fast Analytics on Hadoop Data & Scalable Predictive Analytics with Open Innovations from HP Vertica - Steve Sarsfield and Sunil Venkayala 00:37:35
    24. Running Hadoop-as-a-Service in the Cloud - Lance Olson 00:42:14
    25. Real World Use Cases: Hadoop and NoSQL in Production - Ted Dunning and Ellen Friedman 00:39:58
  19. Keynotes
    1. Hadoop's Impact on the Future of Data Management - Amr Awadallah 00:15:05
    2. Close Encounters with the Third Kind of Database - Eric Frenkiel 00:05:22
    3. Impacting Business as it Happens - Anil Gadre 00:10:23
    4. A Bigger Lens Through which to View the World- the IBM Twitter Alliance - Adam Kocoloski 00:05:23
    5. Data Science: Where are We Going? - DJ Patil 00:12:59
    6. The Emerging Age of Data-Driven Policy Design: Examples from Trying to Manage the Global Climate - Solomon Hsiang 00:08:35
    7. Data: Open for Good and Secure by Default - Eddie Garcia 00:09:07
    8. Year Zero: How We’ll Run Our Lives in Ten Years’ Time - Alistair Croll 00:05:25
    9. Intel and the Role of Open Source in Delivering on the Promise of Big Data - Michael Greene 00:05:13
    10. Big Data Lessons from Our Cybernetic Past - Eden Medina 00:15:03
    11. New Directions for Spark in 2015 - Matei Zaharia 00:09:44
    12. A New Approach to Big Data - Roman Shaposhnik 00:05:12
    13. Charting a Path Forward: The Future of Data Visualization - Jeffrey Heer 00:10:10
    14. Connected Cows? - Joseph Sirosh 00:08:37
    15. Startup Showcase Winner Announcement 00:01:05
  20. Solutions Showcase Theater
    1. The Briefcase Cluster - Enabling Big Data Everywhere - Jim Scott 00:08:37
    2. Why Event Analytics Matter - Rohit Shrivastava 00:10:41
    3. Cracking the Data Conundrum - Steffin Harris 00:11:42
    4. Smart Data for Smarter Utilities - Irshad Raihan 00:08:33
    5. The Value of Churn Analytics at Cisco - Ivan Chen and Phil Hodsdon 00:12:57
    6. Big Data Governance - Felix Van de Maele 00:10:02
    7. Early Warnings for Customer Churn at a Leading Cloud Technology Firm! - Umair Rauf 00:11:27
    8. Harnessing Big Social Data to Deliver Human Data Intelligence - Jason Rose 00:09:32
    9. Operationalizing Hadoop – Are You Ready? - Valerie Fowler 00:10:20
    10. Multimedia Giant Turns Big Data into Real-Time Customer Insights - Brian Garrett 00:10:27
    11. Data Wrangling in the Wild - Sean Ma 00:10:02
    12. StreamAnalytix-Developing Enterprise Class, Real-time Streaming Applications on Apache Storm - Anand Venugopal 00:12:44
    13. Gaining Value From Data Where It's Born - Ryan Peterson 00:07:21
    14. Build a Foundation for Self-Service Data Prep, Analytics, and Governance - Oliver Claude 00:09:05
    15. Connecting the Big-Data Driven Enterprise in Online Retail - Ashley Stirrup 00:07:24
    16. Leading Telecommunications Company Uses BlueData to Spin Up Local, On Demand Hadoop and Spark Clusters to Enable Agile Deployment of Big Data Tools and Technologies - Nanda Vijaydev 00:10:07
    17. Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing - Alan Wagner 00:08:18
    18. Everything You Need To Know About HBase in 10 Minutes or Less - Alex Newman 00:10:09
    19. The Emergence of the Data Refinery - Chuck Yarbrough 00:12:28
    20. Big Data Cluster Planning and Optimization Using Wolf Island Simulation Technology - Laurent Isenegger 00:10:37
    21. Prosthetic Implant Surgery - Where Big Data Means Big Savings - Rola Shaar 00:08:07
    22. Close the Skills Gap and Deliver Rapid Business Value with Big Data Apps - Manan Goel 00:10:46
    23. Distributed R - Scaling the R Language for Even Bigger Data - Sunil Venkayala 00:10:51
    24. Transforming Big Data Landscape with Apache Spark - Rishi Yadav 00:10:11
    25. Data Warehousing in the Cloud - Jon Bock 00:10:12
    26. Proactive Product Intelligence for Electronics - Rami Lokas 00:12:29
    27. Massive-Scale Security Incident Response Leveraging a Hadoop Architecture - Michael A. Davis 00:13:05
    28. Don’t be a Hadoop Breach Headline - Discovery and Sensitive Data in Hadoop - Jeremy Stieglitz 00:11:08
    29. Big Data vs. Climate Change - Srivatsan Ramanujam and John Cardente 00:11:13
    30. ZEAS – Enabling anyone to create Hadoop Enterprise applications fast using a GUI - Aditya Agrawal 00:10:48
    31. Power Tools for Big Data Analytics - Dan Steinberg 00:10:53
    32. Big Data on OpenStack - Kirk Lewis and Frank Rego 00:13:00
    33. Fighting ATM Fraud in Real Time with Hadoop Analytics - Christy Maver 00:08:30
    34. Scale Big Data cost down, while scaling performance out. An NTT mobile personalization retrospective, re-thinking the Big Data solution stack. - Robert Greene 00:11:18
    35. Dato Enables Large-Scale Deduplication at Zillow using GraphLab Create - Rajat Arya 00:08:04
    36. To Catch a Thief with Big Data - Kevin Petrie 00:12:24
    37. Jump into the Data Lake with Hadoop-Scale Data Integration - Greg Benson 00:10:15
    38. Predicting The Future To Improve Customer Satisfaction - Joe Rossi 00:08:38
    39. The Practical, Profitable Magic of Prescriptive Analytics - Andy Flint 00:09:23
    40. Changing the Culture Around Data: Empowering More People with Analytics - Gary Cottrell 00:09:53
    41. How Havas Media Found New Revenue Streams with UNIFi Software - Sean Keenan 00:06:41
    42. What Enterprises Can Learn From Real-Time Bidding - Peter Corless 00:10:49
    43. Big Data and the Data Quality Imperative - Ed Wrazen 00:11:53
    44. Tapjoy Scales and Saves Costs with Riak - Tom Sigler 00:09:34
    45. Smart Execution: How to Optimize Performance by Intelligently Leveraging Multiple Hadoop Analytics Engines - Matt Schumpert 00:10:45
    46. Jagex Game Studio Case Study - Gregory McPhee 00:09:04
    47. Supercharge Sqoop with magical JDBC drivers - Sumit Sarkar 00:09:59
    48. Big Data Analytics: Diverse Use Cases, Diverse Architectures - Ben Conners 00:08:45
    49. Accelerate your data with SequoiaDB - Tao Wang 00:07:36
    50. Building reliable Hadoop clusters with two copies - Iyer Venkatesan 00:10:32

Product Information

  • Title: Strata + Hadoop World San Jose 2015: Complete Video Compilation
  • Author(s):
  • Release date: March 2015
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 9781491924143