Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more. Data Algorithms Data Algorithms
by Mahmoud Parsian Hadoop Application Architectures Hadoop Application Architectures
by Mark Grover, Ted Malaska, Jonathan Seidman, Gwen Shapira Accumulo Accumulo
by Michael Wall, Aaron Cordova, Billie Rinaldi Essential SQLAlchemy Essential SQLAlchemy
by Rick Copeland, Jason Myers
Second Edition Hadoop Security Hadoop Security QGIS By Example QGIS By Example Data Science with Microsoft Azure and R Data Science with Microsoft Azure and R
by Stephen Elston Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Apache Mesos Essentials Apache Mesos Essentials xQuery xQuery
Second Edition Graph Databases Graph Databases
Second Edition Learning Redis Learning Redis


Change the World with Data
Join us at an upcoming Strata + Hadoop World Conference

Strata + Hadoop World in London
London, UK | 5-7 May, 2015

Strata + Hadoop World
New York, NY | September 29-October 1, 2015

Data News

Big data, real-time access: How Apache Drill makes it easy

By Ellen Friedman
July 24, 2015

Register for the free webcast “Easy, real-time access to data with Apache Drill,” which will be held Thursday, July 30, 2015, at 10 a.m. PT. This panel discussion will explore the major role SQL-on-Hadoop technologies play in organizations. Big data …

Big data, small cluster

By Marie Beaugureau
July 20, 2015

Register for the free webcast, “Extending Cassandra with Doradus OLAP for High Performance Analytics,” which will be held July 29 at 9 a.m. PT. Engineers at Dell were developing customer apps when they found that the query response times their …

Data has a shape

By David Beyer
July 20, 2015

Get notified when our free report, “Future of Machine Intelligence: Perspectives from Leading Practitioners,” is available for download. The following interview is one of many that will be included in the report. As part of our ongoing series of interviews …

6 reasons why I like KeystoneML

By Ben Lorica
July 16, 2015

As we put the finishing touches on what promises to be another outstanding Hardcore Data Science Day at Strata + Hadoop World in New York, I sat down with my co-organizer Ben Recht for the the latest episode of the …

More News >

Data Experts

David Yahalom David Yahalom CTO, Oracle / BigData / NoSQL DBA group founder, lead architect and hands-on database consultant. Over 12 years of experience in database & information systems architecture design, leading database teams and as a database consultant / DBA.Hands on experience with Oracle, Hadoop/Cloudera, Amazon EMR, MSSQL, MySQL, PostgreSQL and as an…

Mahmoud Parsian Mahmoud Parsian Mahmoud Parsian, Ph.D. in Computer Science, is a practicing software professional with 30 years of experience as a developer, designer, architect, and author. For the past 15 years, he has been involved in Java server-side, databases, MapReduce, and distributed computing. Dr. Parsian is currently with Illumina and leads the…

Garrett Grolemund Garrett Grolemund Garrett maintains, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by O'Reilly Media. Garrett is a Data Scientist and Chief Instructor at RStudio, Inc. In his own words: I specialize…

Luciano Ramalho Luciano Ramalho Luciano Ramalho was a Web developer before the Netscape IPO in 1995, and switched from Perl to Java to Python in 1998. Since then he worked on some of the largest news portals in Brazil using Python, and taught Python web development in the Brazilian media, banking and government sectors.…

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

The tools you need to get started with data—from basic statistics to complex modeling and large-scale analytics.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

Integrating Customer Data at Scale
July 28, 2015 - 10AM PT,

Extending Cassandra with Doradus OLAP for High Performance Analytics
July 29, 2015 - 09AM PT,

Easy, real-time access to data with Apache Drill
July 30, 2015 - 10AM PT,

Tame the firehose with Elasticsearch and Spark
August 12, 2015 - 09AM PT,

Apache Kylin from eBay: Extreme OLAP engine for Hadoop
August 13, 2015 - 10AM PT,

The connected car: An example of streaming real-time analytics
August 20, 2015 - 10AM PT,

More Webcasts >