book

Software Architecture for Big Data and the Cloud

Name: Software Architecture for Big Data and the Cloud
ISBN: 9780128093382

by Ivan Mistrik, Rami Bahsoon, Nour Ali, Maritta Heisel, Bruce Maxim

June 2017

Beginner to intermediate

470 pages

19h 50m

English

Morgan Kaufmann

Read now

Unlock full access

Cover image
Title page
Table of Contents
Copyright
Contributors
About the Editors
Foreword by Mandy Chessell
Amnesia or Progress?
Foreword by Ian Gorton
Preface
IntroductionWhy a New Book on Software Architecture for Big Data and the Cloud?Book OutlinePart I: Concepts and ModelsPart II: Analyzing and EvaluatingPart III: TechnologiesPart IV: Resource ManagementPart V: Looking Ahead
Chapter 1: Introduction. Software Architecture for Cloud and Big Data: An Open Quest for the Architecturally Significant Requirements
Abstract1.1. A Perspective into Software Architecture for Cloud and Big Data1.2. Cloud Architecturally Significant Requirements and Their Design Implications1.3. Big Data Management as Cloud Architecturally Significant RequirementReferences

Part 1: Concepts and Models
Chapter 2: Hyperscalability – The Changing Face of Software Architecture
Abstract2.1. Introduction2.2. Hyperscalable Systems2.3. Principles of Hyperscalable Systems2.4. Related Work2.5. ConclusionsReferences
Chapter 3: Architecting to Deliver Value From a Big Data and Hybrid Cloud Architecture
Abstract3.1. Introduction3.2. Supporting the Analytics Lifecycle3.3. The Role of Data Lakes3.4. Key Design Features That Make a Data Lake Successful3.5. Architecture Example – Context Management in the IoT3.6. Big Data Origins and Characteristics3.7. The Systems That Capture and Process Big Data3.8. Operating Across Organizational Silos3.9. Architecture Example – Local Processing of Big Data3.10. Architecture Example – Creating a Multichannel View3.11. Application Independent Data3.12. Metadata and Governance3.13. Conclusions3.14. Outlook and Future DirectionsReferences
Chapter 4: Domain-Driven Design of Big Data Systems Based on a Reference Architecture
Abstract4.1. Introduction4.2. Domain-Driven Design Approach4.3. Related Work4.4. Feature Model of Big Data Systems4.5. Deriving the Application Architectures and Example4.6. ConclusionReferences
Chapter 5: An Architectural Model-Based Approach to Quality-Aware DevOps in Cloud Applicationsc
Abstract5.1. Introduction5.2. A Cloud-Based Software Application5.3. Differences in Architectural Models Among Development and Operations5.4. The iObserve Approach5.5. Addressing the Differences in Architectural Models5.6. Applying iObserve to CoCoME5.7. Limitations5.8. Related Work5.9. ConclusionReferences
Chapter 6: Bridging Ecology and Cloud: Transposing Ecological Perspective to Enable Better Cloud Autoscaling
AbstractAcknowledgement6.1. Introduction6.2. Motivation6.3. Natural Ecosystem6.4. Transposing Ecological Principles, Theories and Models to Cloud Ecosystem6.5. Ecology-Inspired Self-Aware Pattern6.6. Opportunities and Challenges6.7. Related Work6.8. ConclusionReferences
Part 2: Analyzing and Evaluating
Chapter 7: Evaluating Web PKIs
Abstract7.1. Introduction7.2. An Overview of PKI7.3. Desired Features and Security Concerns7.4. Existing Proposals7.5. Observations7.6. ConclusionReferences
Chapter 8: Performance Isolation in Cloud-Based Big Data Architectures
Abstract8.1. Introduction8.2. Background8.3. Case Study and Problem Statement8.4. Performance Monitoring in Cloud-Based Systems8.5. Application Framework for Performance Isolation8.6. Evaluation of the Framework8.7. Discussion8.8. Related Work8.9. ConclusionReferences
Chapter 9: From Legacy to Cloud: Risks and Benefits in Software Cloud Migration
Abstract9.1. Introduction9.2. Research Method9.3. Results9.4. Discussion9.5. ConclusionReferences
Chapter 10: Big Data: A Practitioners Perspective
Abstract10.1. Big Data Is a New Paradigm – Differences With Traditional Data Warehouse, Pitfalls and Consideration10.2. Product Considerations for Big Data – Use of Open Source Products for Big Data, Pitfalls and Considerations10.3. Use of Cloud for hosting Big Data – Why to Use Cloud, Pitfalls and Consideration10.4. Big Data Implementation – Architecture Definition, Processing Framework and Migration Pattern From Data Warehouse to Big Data10.5. ConclusionReferences
Part 3: Technologies
Chapter 11: A Taxonomy and Survey of Stream Processing Systems
Abstract11.1. Introduction11.2. Stream Processing Platforms: A Brief Background11.3. Taxonomy11.4. A Survey of Stream Processing Platforms11.5. Comparison Study of the Stream Processing Platforms11.6. Conclusions and Future DirectionsReferences
Chapter 12: Architecting Cloud Services for the Digital Me in a Privacy-Aware Environment
Abstract12.1. Introduction12.2. Example12.3. Challenges12.4. Preliminaries12.5. System-of-Systems Approach12.6. Generative Approach12.7. Related Work12.8. Discussion12.9. ConclusionReferences
Chapter 13: Reengineering Data-Centric Information Systems for the Cloud – A Method and Architectural Patterns Promoting Multitenancy
Abstract13.1. Introduction13.2. Context and Problem: Multitenancy in Cloud Computing13.3. Solution Overview: Reengineering Method and Process13.4. Solution Detail 1: Architectural Patterns in the Method13.5. Solution Detail 2: Testing and Code Reviews13.6. Case Study (Implementation)13.7. Discussion13.8. Related Work13.9. Summary and ConclusionsAppendix 13.A. Architectural Refactoring (AR) ReferenceReferences
Chapter 14: Exploring the Evolution of Big Data Technologies
Abstract14.1. Introduction14.2. Big Data in Our Daily Lives14.3. Data Intensive Computing14.4. Apache Hadoop14.5. Apache Spark14.6. The Role of Cloud Computing14.7. The Future of Big Data Platforms14.8. ConclusionReferences
Chapter 15: A Taxonomy and Survey of Fault-Tolerant Workflow Management Systems in Cloud and Distributed Computing Environments
Abstract15.1. Introduction15.2. Background15.3. Introduction to Fault-Tolerance15.4. Taxonomy of Faults15.5. Taxonomy of Fault-Tolerant Scheduling Algorithms15.6. Modeling of Failures in Workflow Management Systems15.7. Metrics Used to Quantify Fault-Tolerance15.8. Survey of Workflow Management Systems and Frameworks15.9. Tools and Support Systems15.10. SummaryReferences
Part 4: Resource Management
Chapter 16: The HARNESS Platform: A Hardware- and Network-Enhanced Software System for Cloud Computing
AbstractAcknowledgements16.1. Introduction16.2. Related Work16.3. Overview16.4. Managing Heterogeneity16.5. Prototype Description16.6. Evaluation16.7. ConclusionProject ResourcesReferences
Chapter 17: Auditable Version Control Systems in Untrusted Public Clouds
Abstract17.1. Motivation and Contributions17.2. Background Knowledge17.3. System and Adversarial Model17.4. Auditable Version Control Systems17.5. Discussion17.6. Other RDIC Approaches for Version Control Systems17.7. Evaluation17.8. ConclusionReferences
Chapter 18: Scientific Workflow Management System for Clouds
Abstract18.1. Introduction18.2. Background18.3. Workflow Management Systems for Clouds18.4. Cloudbus Workflow Management System18.5. Cloud-Based Extensions to the Workflow Engine18.6. Performance Evaluation18.7. Summary and ConclusionsReferences
Part 5: Looking Ahead
Chapter 19: Outlook and Future Directions
Abstract19.1. New or Advanced Applications19.2. Advanced Supporting Technologies19.3. Architecturally Significant Requirements19.4. Challenges for the Architecting Process19.5. Further ReadingReferences
Glossary
Author Index
Subject Index

Overview

Software Architecture for Big Data and the Cloud is designed to be a single resource that brings together research on how software architectures can solve the challenges imposed by building big data software systems. The challenges of big data on the software architecture can relate to scale, security, integrity, performance, concurrency, parallelism, and dependability, amongst others. Big data handling requires rethinking architectural solutions to meet functional and non-functional requirements related to volume, variety and velocity.

The book's editors have varied and complementary backgrounds in requirements and architecture, specifically in software architectures for cloud and big data, as well as expertise in software engineering for cloud and big data. This book brings together work across different disciplines in software engineering, including work expanded from conference tracks and workshops led by the editors.

Discusses systematic and disciplined approaches to building software architectures for cloud and big data with state-of-the-art methods and techniques
Presents case studies involving enterprise, business, and government service deployment of big data applications
Shares guidance on theory, frameworks, methodologies, and architecture for cloud and big data

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Scalable Big Data Architecture: A Practitioner’s Guide to Choosing Relevant Big Data Architecture

Publisher Resources

ISBN: 9780128093382

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills