12. Data Science with Hadoop—The Next Frontier
We don’t have better algorithms, we just have more data.
Throughout this book we have seen how Hadoop provides a platform that enables a broad set of applications of data science for large datasets. With Hadoop, and utilizing its ecosystem of tools such as Spark, Pig, and Hive, it is possible to run typical data science flows in an efficient and scalable manner on much larger datasets than ever before.
But, the availability of ...