Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more.

http://akamaicovers.oreilly.com/images/0636920057321/rc_thumb.gif Stream Processing with Apache Flink Stream Processing with Apache Flink
http://akamaicovers.oreilly.com/images/9781937434533/thumb.gif HTML to MadCap Flare HTML to MadCap Flare
http://akamaicovers.oreilly.com/images/0636920067153/rc_thumb.gif Text Mining with R Text Mining with R
http://akamaicovers.oreilly.com/images/0636920063056/thumb.gif The Business of Deep Learning The Business of Deep Learning
by Matt Coatney
http://akamaicovers.oreilly.com/images/9781787125537/thumb.gif PostgreSQL High Availability Cookbook PostgreSQL High Availability Cookbook
Second Edition
http://akamaicovers.oreilly.com/images/0636920062691/thumb.gif Artificial Intelligence Now Artificial Intelligence Now
http://akamaicovers.oreilly.com/images/0636920062004/thumb.gif Big Data Now: 2016 Edition Big Data Now: 2016 Edition
http://akamaicovers.oreilly.com/images/0636920057062/thumb.gif Type Inheritance and Relational Theory Type Inheritance and Relational Theory
http://akamaicovers.oreilly.com/images/9781784396428/thumb.gif NHibernate 4.x Cookbook NHibernate 4.x Cookbook
Second Edition
http://akamaicovers.oreilly.com/images/0636920074915/thumb.gif Julia Solutions Julia Solutions
http://akamaicovers.oreilly.com/images/9781783983063/thumb.gif HBase High Performance Cookbook HBase High Performance Cookbook
http://akamaicovers.oreilly.com/images/0636920057628/rc_thumb.gif Data Science on the Google Cloud Platform Data Science on the Google Cloud Platform

Conferences

Change the World with Data
Join us at an upcoming Strata + Hadoop World Conference

Strata + Hadoop World in London
London, UK | 5-7 May, 2015

Strata + Hadoop World
New York, NY | September 29-October 1, 2015

Data News

Using Apache Spark to predict attack vectors among billions of users and trillions of events

By Ben Lorica
February 25, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science: Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode of the O’Reilly Data Show, I spoke with Fang Yu, co-founder and CTO …

Metadata services can lead to performance and organizational improvements

By Ben Lorica
February 11, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science: Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode of the O’Reilly Data Show, I spoke with one of the most popular …

Building a business that combines human experts and data science

By Ben Lorica
January 28, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science. In this episode of the O’Reilly Data Show, I spoke with Eric Colson, chief algorithms officer at Stitch Fix, and former …

Is 2016 the year you let robots manage your money?

By Ben Lorica
January 14, 2016

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science. In this episode of the O’Reilly Data Show, I sat down with Vasant Dhar, a professor at the Stern School of …

More News >

Data Experts

Tom Augspurger Tom Augspurger Data scientist and software developer. Contributor to and maintainer of several open source packages, including pandas.

Frank Kane Frank Kane Frank Kane spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers, all the time. Frank holds 17 issued patents in the fields of distributed computing, data mining, and machine learning. In 2012, Frank left…

Tyler Akidau Tyler Akidau Tyler Akidau is a staff software engineer at Google. The current tech lead for internal streaming data processing systems (e.g. "MillWheel"), he’s spent seven years working on massive-scale streaming data processing systems. He passionately believes in streaming data processing as the more general model of large-scale computation. His preferred mode…

Colin Gillespie Colin Gillespie Colin Gillespie is Senior lecturer (Associate professor) at Newcastle University, UK. His research interests are high performance statistical computing and Bayesian statistics. He is regularly employed as a consultant by Jumping Rivers and has been teaching R since 2005 at a variety of levels, ranging from beginners to advanced programming.

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

The tools you need to get started with data—from basic statistics to complex modeling and large-scale analytics.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

How to get started with DevOpSec
February 23, 2017 - 10AM PT,


An Intro to Predictive Modeling for Customer Lifetime Value
February 28, 2017 - 10AM PT,


How to Scale Different Data Models
March 2, 2017 - 10AM PT,


Practical strategies for data unification, with Dr. Michael Stonebraker
March 21, 2017 - 10AM PT,


More Webcasts >