Learn about the basics of how Hadoop works, why it's such an important technology, and how you should be using it without getting mired in the details.
Donald Miner is founder of the data science firm Miner & Kasch and specializes in Hadoop enterprise architecture and applying machine learning to real-world business problems. Donald is author of the O’Reilly book MapReduce Design Patterns and the upcoming O'Reilly book Enterprise Hadoop. He has architected and implemented dozens of mission-critical and large-scale Hadoop systems within the U.S. Government and Fortune 500 companies. He has applied machine learning techniques to analyze data across several verticals, including financial, retail, telecommunications, health care, government intelligence, and entertainment. His PhD is from the University of Maryland, Baltimore County, where he focused on machine learning and multi-agent systems. He lives in Maryland with his wife and two young sons.
Learn how to use Python with the Hadoop Distributed File System, MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework.