Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more. Data Algorithms Data Algorithms
by Mahmoud Parsian Hadoop Application Architectures Hadoop Application Architectures
by Mark Grover, Ted Malaska, Jonathan Seidman, Gwen Shapira Accumulo Accumulo
by Michael Wall, Aaron Cordova, Billie Rinaldi Essential SQLAlchemy Essential SQLAlchemy
by Rick Copeland, Jason Myers
Second Edition Hadoop Security Hadoop Security QGIS By Example QGIS By Example Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Data Science with Microsoft Azure and R Data Science with Microsoft Azure and R
by Stephen Elston xQuery xQuery
Second Edition Graph Databases Graph Databases
Second Edition Apache Mesos Essentials Apache Mesos Essentials Learning Redis Learning Redis


Change the World with Data
Join us at an upcoming Strata + Hadoop World Conference

Strata + Hadoop World in London
London, UK | 5-7 May, 2015

Strata + Hadoop World
New York, NY | September 29-October 1, 2015

Data News

How real-time analytics integrates with our connected world

By Courtney Webster
July 29, 2015

In this special-edition O’Reilly Podcast, O’Reilly’s Ben Lorica and VoltDB’s co-founder Scott Jarr discuss how VoltDB’s hybrid transaction, analytic system allows for real-time analytics and personalization of data across various industries. Scaling transaction processing without losing the relational database MIT’s …

How trains are becoming data driven

By Gerhard Kress
July 27, 2015

Trains and public transport are, for many of us, a vital part of our daily lives. Large cities are particularly dependent on an efficient public transport system, and if disruption occurs, it usually affects many passengers while spreading across the …

Big data, real-time access: How Apache Drill makes it easy

By Ellen Friedman
July 24, 2015

Register for the free webcast “Easy, real-time access to data with Apache Drill,” which will be held Thursday, July 30, 2015, at 10 a.m. PT. This panel discussion will explore the major role SQL-on-Hadoop technologies play in organizations. Big data …

Big data, small cluster

By Marie Beaugureau
July 20, 2015

Register for the free webcast, “Extending Cassandra with Doradus OLAP for High Performance Analytics,” which will be held July 29 at 9 a.m. PT. Engineers at Dell were developing customer apps when they found that the query response times their …

More News >

Data Experts

David Yahalom David Yahalom CTO, Oracle / BigData / NoSQL DBA group founder, lead architect and hands-on database consultant. Over 12 years of experience in database & information systems architecture design, leading database teams and as a database consultant / DBA.Hands on experience with Oracle, Hadoop/Cloudera, Amazon EMR, MSSQL, MySQL, PostgreSQL and as an…

Mahmoud Parsian Mahmoud Parsian Mahmoud Parsian, Ph.D. in Computer Science, is a practicing software professional with 30 years of experience as a developer, designer, architect, and author. For the past 15 years, he has been involved in Java server-side, databases, MapReduce, and distributed computing. Dr. Parsian is currently with Illumina and leads the…

Garrett Grolemund Garrett Grolemund Garrett maintains, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by O'Reilly Media. Garrett is a Data Scientist and Chief Instructor at RStudio, Inc. In his own words: I specialize…

Luciano Ramalho Luciano Ramalho Luciano Ramalho was a Web developer before the Netscape IPO in 1995, and switched from Perl to Java to Python in 1998. Since then he worked on some of the largest news portals in Brazil using Python, and taught Python web development in the Brazilian media, banking and government sectors.…

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

The tools you need to get started with data—from basic statistics to complex modeling and large-scale analytics.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

Easy, real-time access to data with Apache Drill
July 30, 2015 - 10AM PT,

Tame the firehose with Elasticsearch and Spark
August 12, 2015 - 09AM PT,

Apache Kylin from eBay: Extreme OLAP engine for Hadoop
August 13, 2015 - 10AM PT,

The connected car: An example of streaming real-time analytics
August 20, 2015 - 10AM PT,

Easy, reproducible reports with R
August 26, 2015 - 10AM PT,

Deep dive into Project Tungsten: Bring Spark closer to bare metal
September 3, 2015 - 10AM PT,

More Webcasts >