12. Data Science with Hadoop—The Next Frontier

We don’t have better algorithms, we just have more data.

Peter Norvig

Throughout this book we have seen how Hadoop provides a platform that enables a broad set of applications of data science for large datasets. With Hadoop, and utilizing its ecosystem of tools such as Spark, Pig, and Hive, it is possible to run typical data science flows in an efficient and scalable manner on much larger datasets than ever before.

But, the availability of ...

Get Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.