Data Resources – Making Data Work

Welcome to the essential training and information source for data science and big data—with books, in-person and online events, reports, industry news, and much more. Elasticsearch Blueprints Elasticsearch Blueprints Data Algorithms Data Algorithms
by Mahmoud Parsian Mastering Julia Mastering Julia Hadoop Application Architectures Hadoop Application Architectures
by Mark Grover, Ted Malaska, Jonathan Seidman, Gwen Shapira Accumulo Accumulo
by Michael Wall, Aaron Cordova, Billie Rinaldi Essential SQLAlchemy Essential SQLAlchemy
by Rick Copeland, Jason Myers
Second Edition Hadoop Security Hadoop Security Data Science with Microsoft Azure and R Data Science with Microsoft Azure and R
by Stephen Elston Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Design Principles for Process-driven Architectures Using Oracle BPM and SOA Suite 12c Apache Mesos Essentials Apache Mesos Essentials xQuery xQuery
Second Edition Graph Databases Graph Databases
Second Edition


Change the World with Data
Join us at an upcoming Strata + Hadoop World Conference

Strata + Hadoop World in London
London, UK | 5-7 May, 2015

Strata + Hadoop World
New York, NY | September 29-October 1, 2015

Data News

How an enterprise begins its big data journey

By Rachel Wolfson
July 31, 2015

As the amount of data continues to double in size every two years, organizations are struggling more than ever before to manage, ingest, store, process, transform, and analyze massive data sets. It has become clear that getting started on the …

Understanding neural function and virtual reality

By Ben Lorica
July 30, 2015

Like many data scientists, I’m excited about advances in large-scale machine learning, particularly recent success stories in computer vision and speech recognition. But I’m also cognizant of the fact that press coverage tends to inflate what current systems can do, …

How real-time analytics integrates with our connected world

By Courtney Webster
July 29, 2015

In this special-edition O’Reilly Podcast, O’Reilly’s Ben Lorica and VoltDB’s co-founder Scott Jarr discuss how VoltDB’s hybrid transaction, analytic system allows for real-time analytics and personalization of data across various industries. Scaling transaction processing without losing the relational database MIT’s …

How trains are becoming data driven

By Gerhard Kress
July 27, 2015

Trains and public transport are, for many of us, a vital part of our daily lives. Large cities are particularly dependent on an efficient public transport system, and if disruption occurs, it usually affects many passengers while spreading across the …

More News >

Data Experts

David Yahalom David Yahalom CTO, Oracle / BigData / NoSQL DBA group founder, lead architect and hands-on database consultant. Over 12 years of experience in database & information systems architecture design, leading database teams and as a database consultant / DBA.Hands on experience with Oracle, Hadoop/Cloudera, Amazon EMR, MSSQL, MySQL, PostgreSQL and as an…

Mahmoud Parsian Mahmoud Parsian Mahmoud Parsian, Ph.D. in Computer Science, is a practicing software professional with 30 years of experience as a developer, designer, architect, and author. For the past 15 years, he has been involved in Java server-side, databases, MapReduce, and distributed computing. Dr. Parsian is currently with Illumina and leads the…

Garrett Grolemund Garrett Grolemund Garrett maintains, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by O'Reilly Media. Garrett is a Data Scientist and Chief Instructor at RStudio, Inc. In his own words: I specialize…

Luciano Ramalho Luciano Ramalho Luciano Ramalho was a Web developer before the Netscape IPO in 1995, and switched from Perl to Java to Python in 1998. Since then he worked on some of the largest news portals in Brazil using Python, and taught Python web development in the Brazilian media, banking and government sectors.…

More Data Experts >

Video Compilation - Available Now

Strata Conference video compilation

Get Your Front-Row Access to Strata Conference

Gain a clear perspective on the future of big data--and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you'll get a front-row seat to the keynotes, workshops, and sessions at O'Reilly's Strata Conference Santa Clara 2014.

More about this video >

Data Science Starter Kit

Data Science Books

The tools you need to get started with data—from basic statistics to complex modeling and large-scale analytics.

"'Data Scientist' is now the hottest job title in Silicon Valley."

– Tim O'Reilly

Learn More

Data Webcasts
Learn directly from data experts. Join us for these free, live webcasts.

Tame the firehose with Elasticsearch and Spark
August 12, 2015 - 09AM PT,

Apache Kylin from eBay: Extreme OLAP engine for Hadoop
August 13, 2015 - 10AM PT,

The connected car: An example of streaming real-time analytics
August 20, 2015 - 10AM PT,

Apache Spark Solution for Rank Product
August 25, 2015 - 10AM PT,

Easy, reproducible reports with R
August 26, 2015 - 10AM PT,

Deep dive into Project Tungsten: Bring Spark closer to bare metal
September 3, 2015 - 10AM PT,

More Webcasts >