12. Data Science with Hadoop—The Next Frontier

We don’t have better algorithms, we just have more data.

Peter Norvig

Throughout this book we have seen how Hadoop provides a platform that enables a broad set of applications of data science for large datasets. With Hadoop, and utilizing its ecosystem of tools such as Spark, Pig, and Hive, it is possible to run typical data science flows in an efficient and scalable manner on much larger datasets than ever before.

But, the availability of ...

