Skip to Content conference Strata + Hadoop World 2016 - London, United Kingdom: Video Compilation June 2016
103h 45m
English
Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional) Course outline Data science & advanced analytics 10h 49m
R and reproducible reporting for big data - Aimee Gott (Mango Solutions), Mark Sellors (Mango Solutions), and Richard Pugh (Mango Solutions) - Part 11h 19m 29s
R and reproducible reporting for big data - Aimee Gott (Mango Solutions), Mark Sellors (Mango Solutions), and Richard Pugh (Mango Solutions) - Part 21h 17m 56s
Deep learning and natural language processing with Spark - Andy Petrella (Data Fellas) and Melanie Warrick (Skymind)40m 29s
Semantic natural language understanding with Spark Streaming, UIMA, and machine-learned ontologies - David Talby (Atigeo) and Claudiu Branzan (Atigeo)45m 30s
Sightseeing, venues, and friends: Predictive analytics with Spark ML and Cassandra - Natalino Busa (Teradata)38m 44s
Introduction to generalized low-rank models and missing values - Jo-fai Chow (H2O.ai)29m 12s
Petascale genomics - Tom White (Cloudera)40m 13s
Panel: The future of intelligence - Marc Warner (ASI), Stuart Russell (UC Berkeley), and Jaan Tallinn (CSER)39m 20s
The polyglot data scientist - Jeroen Janssens (Tilburg University)25m 11s
Beyond guide dogs: How advances in deep learning can empower the blind community - Anirudh Koul (Microsoft) and Saqib Shaikh (Microsoft)37m 52s
Predicting out-of-sample performance of a large cohort of trading algorithms with machine learning - Thomas Wiecki (Quantopian)38m 30s
Scala: The unpredicted lingua franca for data science - Andy Petrella (Data Fellas) and Dean Wampler (Lightbend)42m 56s
Land mine or Coke can: Machine learning from GPR data - Dirk Gorissen (Skycap | World Bank)33m 39s
Data modeling for data science: Simplify your workload with complex types - Marcel Kornacker (Cloudera)40m 28s
Applications of natural language understanding: Tools and technologies - Alyona Medelyan (Entopix)39m 31s
Data-driven business 15h 1m
Enterprise adoption 10h 3m
Apache Hadoop operations for production systems - Jayesh Seshadri (Cloudera), Justin Hancock (Cloudera), Mark Samson (Cloudera), and Wellington Chevreuil (Cloudera) - Part 11h 25m 46s
Apache Hadoop operations for production systems - Jayesh Seshadri (Cloudera), Justin Hancock (Cloudera), Mark Samson (Cloudera), and Wellington Chevreuil (Cloudera) - Part 21h 34m 4s
Apache Hadoop operations for production systems - Jayesh Seshadri (Cloudera), Justin Hancock (Cloudera), Mark Samson (Cloudera), and Wellington Chevreuil (Cloudera) - Part 31h 23m 20s
Apache Hadoop operations for production systems - Jayesh Seshadri (Cloudera), Justin Hancock (Cloudera), Mark Samson (Cloudera), and Wellington Chevreuil (Cloudera) - Part 41h 19m 21s
Architecting a data platform - John Akred (Silicon Valley Data Science) and Stephen O'Sullivan (Silicon Valley Data Science) - Part 11h 24m 41s
Architecting a data platform - John Akred (Silicon Valley Data Science) and Stephen O'Sullivan (Silicon Valley Data Science) - Part 21h 36m 3s
Big SQL: The future of in-cluster analytics and enterprise adoption - Moderated by: Surya Mukherjee (Ovum) - Panelists: Lloyd Tabb (Looker Data Science), Nick Amabile (FullStack Analytics), Rex Gibson (Knewton), dp Suresh (Yahoo!)39m 16s
BI on Hadoop: What are your options? - Tomer Shiran (Dremio)40m 44s
Hadoop internals & development 6h 34m
Hadoop application architectures: Fraud detection - Jonathan Seidman (Cloudera), Mark Grover (Cloudera), Gwen Shapira (Confluent), and Ted Malaska (Cloudera) - Part 11h 29m 48s
Hadoop application architectures: Fraud detection - Jonathan Seidman (Cloudera), Mark Grover (Cloudera), Gwen Shapira (Confluent), and Ted Malaska (Cloudera) - Part 21h 23m 17s
The next 10 years of Apache Hadoop - Doug Cutting (Cloudera), Tom White (Cloudera), and Ben Lorica (O'Reilly Media)39m 56s
Hadoop's storage gap: Resolving transactional access/analytic performance trade-offs with Apache Kudu (incubating) - Todd Lipcon (Cloudera, Inc.)41m 54s
Building real-time BI systems with HDFS and Kudu - Ruhollah Farchtchi (Zoomdata)35m 37s
Why is my Hadoop job slow? - Bikas Saha (Hortonworks Inc)39m 4s
Scaling out to 10 clusters, 1,000 users, and 10,000 flows: The Dali experience at LinkedIn - Carl Steinbach (LinkedIn)35m 43s
Floating elephants: Developing data wrangling systems on Docker - Chad Metcalf (Docker) and Seshadri Mahalingam (Trifacta)29m 7s
Hardcore data science 5h 34m
Visualization & user experience 5h 14m
Law, ethics, governance 2h 38m
Show More A hands-on introduction to Apache Kafka - Ian Wrigley (Confluent) - Part 1
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more. Start your free trial