This IBM® Redpaper™ publication provides guidance on building an enterprise-grade data lake by using IBM Spectrum™ Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models.
Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation.
IBM Spectrum Scale™ is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.
Table of contents
- Front cover
Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution
- Hortonworks Data Platform
- IBM Spectrum Scale
- Integrated solution overview
- Component diagram
- Deployment models
- Shared Storage model
- Shared Nothing Storage model
- System configuration
- HDP and IBM Spectrum Scale frequently asked questions
- Additional references
- Now you can become a published author, too!
- Stay connected to IBM Redbooks
- Back cover
- Title: Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution
- Release date: June 2018
- Publisher(s): IBM Redbooks
- ISBN: 9780738456966
You might also like
Highly Efficient Data Access with RoCE on IBM Elastic Storage Systems and IBM Spectrum Scale
With Remote Direct Memory Access (RDMA), you can make a subset of a host's memory directly …
Cyber Resiliency Solution for IBM Spectrum Scale
This document is intended to facilitate the deployment of the Cyber Resiliency solution for IBM® Spectrum …
IBM Spectrum Scale Security
Storage systems must provide reliable and convenient data access to all authorized users while simultaneously preventing …
Implementing IBM FlashSystem 900 Model AE3
Today's global organizations depend on being able to unlock business insights from massive volumes of data. …