B. HDFS Quick Start

This appendix is intended for those that have little or no experience with the Hadoop Distributed File System (HDFS). The following is intended to provide minimal background on a few commands that will help get you started with Apache Hadoop. It is not a full description of HDFS and is missing many of the important commands and features. In addition to this quick start, you are strongly advised to consult these two resources:

http://hadoop.apache.org/docs/stable1/hdfs_design.html

http://developer.yahoo.com/hadoop/tutorial/module2.html

The following section is a quick command reference that may help you get started with HDFS. Be aware that there are alternative options for each command, and the examples below are simple use ...

Get Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.