This IBM® Redpaper™ publication provides guidance on building an enterprise-grade data lake by using IBM Spectrum™ Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models.
Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation.
IBM Spectrum Scale™ is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.
Table of contents
- Front cover
Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution
- Hortonworks Data Platform
- IBM Spectrum Scale
- Integrated solution overview
- Component diagram
- Deployment models
- Shared Storage model
- Shared Nothing Storage model
- System configuration
- HDP and IBM Spectrum Scale frequently asked questions
- Additional references
- Now you can become a published author, too!
- Stay connected to IBM Redbooks
- Back cover
- Title: Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution
- Release date: June 2018
- Publisher(s): IBM Redbooks
- ISBN: 9780738456966
You might also like
Architecting Data Lakes, 2nd Edition
Many organizations today are succeeding with data lakes, not just as storage repositories but as places …
Practical Synthetic Data Generation
Building and testing machine learning models requires access to large and diverse data. But where can …
Data Science from Scratch, 2nd Edition
To really learn data science, you should not only master the tools—data science libraries, frameworks, modules, …
Head First Design Patterns, 2nd Edition
You know you don’t want to reinvent the wheel, so you look to design patterns—the lessons …