Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more. HBase Essentials HBase Essentials Advanced Analytics with Spark Advanced Analytics with Spark Introduction to Data Science with R Introduction to Data Science with R Learning Big Data with Amazon Elastic MapReduce Learning Big Data with Amazon Elastic MapReduce Mastering Machine Learning with scikit-learn Mastering Machine Learning with scikit-learn Making Big Data Work for Your Business Making Big Data Work for Your Business PostgreSQL Administration Essentials PostgreSQL Administration Essentials Field Guide to Hadoop Field Guide to Hadoop Oracle Solaris 11 Advanced Administration Cookbook Oracle Solaris 11 Advanced Administration Cookbook I Heart Logs I Heart Logs
by Jay Kreps Thoughtful Machine Learning Thoughtful Machine Learning
by Matthew Kirk Data Science at the Command Line Data Science at the Command Line
by Jeroen Janssens


Change the World with Data
Join us at an upcoming O'Reilly Strata Conference

Strata Conference & Hadoop World
New York, NY | October 15-17, 2014

Strata Conference in Barcelona
Barcelona, Spain | November 19-21, 2014

Strata Santa Clara

Data News

Signals from Strata + Hadoop World in Barcelona 2014

By Mac Slocum
November 20, 2014

Experts from across the big data world are coming together this week for Strata + Hadoop World in Barcelona 2014. We’ve gathered insights from the event below. #IoTH: The Internet of Things and Humans “If we could start over with …

The science of moving dots: the O’Reilly Data Show Podcast

By Ben Lorica
November 20, 2014

Editor’s note: you can subscribe to the O’Reilly Data Show Podcast through SoundCloud. Many data scientists are comfortable working with structured operational data and unstructured text. Newer techniques like deep learning have opened up data types like images, video, and …

The big data sweet spot: policy that balances benefits and risks

By Andy Oram
November 13, 2014

A big reason why discussions of “big data” get complicated — and policy-makers resort to vague hand-waving, as in the well-known White House executive office report — is that its ripple effects travel fast and far. Your purchase, when recorded …

The problem of managing schemas

By Gwen Shapira
November 4, 2014

When a team first starts to consider using Hadoop for data storage and processing, one of the first questions that comes up is: which file format should we use? This is a reasonable question. HDFS, Hadoop’s data storage, is different …

More News >

Data Experts

Jay Kreibich Jay Kreibich is a professional software engineer who has always been interested in how people process and understand information.

Shahed Latif Shahed Latif is a partner in KPMG's Advisory practice having extensive IT and business skills. He has over 23 years of experience working with the global fortune 1000 companies focusing on providing business and technology solutions across a variety of areas.

Luciano Ramalho Luciano Ramalho Luciano Ramalho was a Web developer before the Netscape IPO in 1995, and switched from Perl to Java to Python in 1998. Since then he worked on some of the largest news portals in Brazil using Python, and taught Python web development in the Brazilian media, banking and government sectors.…

Noah Iliinsky Noah Iliinsky Illinsky has spent the last several years thinking about effective approaches to creating diagrams and other types of information visualization. He also works in interface and interaction design, all from a functional and user-centered perspective.

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

This kit includes everything you need to get started with data analysis, visualization, and management.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

Getting Started with Impala - Interactive SQL for Apache Hadoop
December 4, 2014 - 10AM PT,

All-vs-All: Correlation Using Spark/Hadoop
December 9, 2014 - 10AM PT,

Next Gen Leaders Set Pace For New Wave of Solutions
December 9, 2014 - 10AM PT,

More Webcasts >