12. Data Science with Hadoop—The Next Frontier

We don’t have better algorithms, we just have more data.

Peter Norvig

Throughout this book we have seen how Hadoop provides a platform that enables a broad set of applications of data science for large datasets. With Hadoop, and utilizing its ecosystem of tools such as Spark, Pig, and Hive, it is possible to run typical data science flows in an efficient and scalable manner on much larger datasets than ever before.

But, the availability of ...

Get Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.