O'Reilly logo

Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale by Douglas Eadline, Casey Stella, Ofer Mendelevitch

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

B. HDFS Quick Start

This appendix is intended for those that have little or no experience with the Hadoop Distributed File System (HDFS). The following is intended to provide minimal background on a few commands that will help get you started with Apache Hadoop. It is not a full description of HDFS and is missing many of the important commands and features. In addition to this quick start, you are strongly advised to consult these two resources:

http://hadoop.apache.org/docs/stable1/hdfs_design.html

http://developer.yahoo.com/hadoop/tutorial/module2.html

The following section is a quick command reference that may help you get started with HDFS. Be aware that there are alternative options for each command, and the examples below are simple use ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required