Picking a Distribution and Version of HadoopApache HadoopCloudera’s Distribution Including Apache HadoopVersions and FeaturesWhat Should I Use?Hardware SelectionMaster Hardware SelectionNamenode considerationsSecondary namenode hardwareJobtracker hardwareWorker Hardware SelectionCluster SizingBlades, SANs, and VirtualizationOperating System Selection and PreparationDeployment LayoutSoftwareHostnames, DNS, and IdentificationUsers, Groups, and PrivilegesKernel Tuningvm.swappinessvm.overcommit_memoryDisk ConfigurationChoosing a Filesystemext3ext4xfsMount OptionsNetwork DesignNetwork Usage in Hadoop: A ReviewHDFSMapReduce1 Gb versus 10 Gb NetworksTypical Network TopologiesTraditional treeSpine fabric