Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale
by Ofer Mendelevitch, Casey Stella, Douglas Eadline
B. HDFS Quick Start
This appendix is intended for those that have little or no experience with the Hadoop Distributed File System (HDFS). The following is intended to provide minimal background on a few commands that will help get you started with Apache Hadoop. It is not a full description of HDFS and is missing many of the important commands and features. In addition to this quick start, you are strongly advised to consult these two resources:
http://hadoop.apache.org/docs/stable1/hdfs_design.html
http://developer.yahoo.com/hadoop/tutorial/module2.html
The following section is a quick command reference that may help you get started with HDFS. Be aware that there are alternative options for each command, and the examples below are simple use ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access