December 2016
Beginner to intermediate
256 pages
7h 26m
English
We don’t have better algorithms, we just have more data.
Peter Norvig
Throughout this book we have seen how Hadoop provides a platform that enables a broad set of applications of data science for large datasets. With Hadoop, and utilizing its ecosystem of tools such as Spark, Pig, and Hive, it is possible to run typical data science flows in an efficient and scalable manner on much larger datasets than ever before.
But, the availability of ...
Read now
Unlock full access