Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more.

http://akamaicovers.oreilly.com/images/0636920034919/thumb.gif Python Data Science Handbook Python Data Science Handbook
by Jake VanderPlas
http://akamaicovers.oreilly.com/images/9781786461810/thumb.gif SQL Server 2016 Reporting Services Cookbook SQL Server 2016 Reporting Services Cookbook
http://akamaicovers.oreilly.com/images/0636920055334/thumb.gif Data Pipelines with Python Data Pipelines with Python
http://akamaicovers.oreilly.com/images/0636920039761/rc_thumb.gif Database Reliability Engineering Database Reliability Engineering
http://akamaicovers.oreilly.com/images/0636920057482/thumb.gif Mastering Spark for Structured Streaming Mastering Spark for Structured Streaming
by Michael Li
http://akamaicovers.oreilly.com/images/0636920044383/thumb.gif Programming Pig Programming Pig
Second Edition
http://akamaicovers.oreilly.com/images/0636920055341/thumb.gif Understanding SQL and R Understanding SQL and R
http://akamaicovers.oreilly.com/images/0636920051459/rc_thumb.gif Moving Hadoop to the Cloud Moving Hadoop to the Cloud
http://akamaicovers.oreilly.com/images/0636920064220/thumb.gif Learning React.js Data Visualization Learning React.js Data Visualization
http://akamaicovers.oreilly.com/images/0636920054108/thumb.gif Practical Artificial Intelligence in the Cloud Practical Artificial Intelligence in the Cloud
http://akamaicovers.oreilly.com/images/0636920056737/thumb.gif Database Fundamentals for Java Programmers Database Fundamentals for Java Programmers
http://akamaicovers.oreilly.com/images/0636920052715/rc_thumb.gif PostgreSQL: Up and Running PostgreSQL: Up and Running
Third Edition

Conferences

Change the World with Data
Join us at an upcoming Strata + Hadoop World Conference

Strata + Hadoop World in London
London, UK | 5-7 May, 2015

Strata + Hadoop World
New York, NY | September 29-October 1, 2015

Data News

Using Apache Spark to predict attack vectors among billions of users and trillions of events

By Ben Lorica
February 25, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science: Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode of the O’Reilly Data Show, I spoke with Fang Yu, co-founder and CTO …

Metadata services can lead to performance and organizational improvements

By Ben Lorica
February 11, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science: Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode of the O’Reilly Data Show, I spoke with one of the most popular …

Building a business that combines human experts and data science

By Ben Lorica
January 28, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science. In this episode of the O’Reilly Data Show, I spoke with Eric Colson, chief algorithms officer at Stitch Fix, and former …

Is 2016 the year you let robots manage your money?

By Ben Lorica
January 14, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science. In this episode of the O’Reilly Data Show, I sat down with Vasant Dhar, a professor at the Stern School of …

More News >

Data Experts

Sujee Maniyam Sujee Maniyam Sujee Maniyam is a seasoned Big Data practitioner and instructor in Big Data technologies (Hadoop, Spark, NoSQL and Cloud). He is an open source contributor and author of 'Hadoop illuminated' (an open-source book on Hadoop) and 'HBase Design Patterns'. Sujee is a frequent speaker at various conferences and meetups. He…

Jonathan Whitmore Jonathan Whitmore Jonathan Whitmore, PhD, is a Senior Data Scientist at Silicon Valley Data Science. He is the author of an O'Reilly media screencast titled Jupyter Notebook for Data Science Teams. Before moving into the tech industry, Dr. Whitmore worked as an astrophysicist in Melbourne, Australia, researching whether the fundamental physical constants…

Colin Gillespie Colin Gillespie Colin Gillespie is Senior lecturer (Associate professor) at Newcastle University, UK. His research interests are high performance statistical computing and Bayesian statistics. He is regularly employed as a consultant by Jumping Rivers and has been teaching R since 2005 at a variety of levels, ranging from beginners to advanced programming.

Konrad Malawski Konrad Malawski Konrad Malawski is a core developer at Lightbend working on Akka, a distributed systems toolkit for the JVM. He is currently responsible for the Akka HTTP module, has contributed large parts of Akka Persistence and remains active in the Core modules of Akka as well. He is a leading contributor…

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

The tools you need to get started with data—from basic statistics to complex modeling and large-scale analytics.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

Data Preparation State of the Union
December 6, 2016 - 10AM PT,


Data Product Architectures
December 7, 2016 - 10AM PT,


What’s coming for big data in 2017?
December 13, 2016 - 10AM PT,


How to build a successful enterprise data lake
January 12, 2017 - 10AM PT,


Spark and Java - Yes they work together!
January 24, 2017 - 10AM PT,


More Webcasts >