Big Data

Book description

Big Data: Principles and Paradigms captures the state-of-the-art research on the architectural aspects, technologies, and applications of Big Data. The book identifies potential future directions and technologies that facilitate insight into numerous scientific, business, and consumer applications.

To help realize Big Data’s full potential, the book addresses numerous challenges, offering the conceptual and technological solutions for tackling them. These challenges include life-cycle data management, large-scale storage, flexible processing infrastructure, data modeling, scalable machine learning, data analysis algorithms, sampling techniques, and privacy and ethical issues.

  • Covers computational platforms supporting Big Data applications
  • Addresses key principles underlying Big Data computing
  • Examines key developments supporting next generation Big Data platforms
  • Explores the challenges in Big Data computing and ways to overcome them
  • Contains expert contributors from both academia and industry

Table of contents

  1. Cover image
  2. Title page
  3. Table of Contents
  4. Copyright
  5. List of contributors
  6. About the Editors
  7. Preface
    1. Organization of the Book
    2. Part I: Big Data Science
    3. Part II: Big Data Infrastructures and Platforms
    4. Part III: Big Data Security and Privacy
    5. Part IV: Big Data Applications
  8. Acknowledgments
  9. Part I: Big Data Science
    1. Chapter 1: Big Data Analytics = Machine Learning + Cloud Computing
      1. Abstract
      2. 1.1 Introduction
      3. 1.2 A Historical Review of Big Data
      4. 1.3 Historical Interpretation of Big Data
      5. 1.4 Defining Big Data From 3Vs to 32Vs
      6. 1.5 Big Data Analytics and Machine Learning
      7. 1.6 Big Data Analytics and Cloud Computing
      8. 1.7 Hadoop, HDFS, MapReduce, Spark, and Flink
      9. 1.8 ML + CC → BDA and Guidelines
      10. 1.9 Conclusion
    2. Chapter 2: Real-Time Analytics
      1. Abstract
      2. 2.1 Introduction
      3. 2.2 Computing Abstractions for Real-Time Analytics
      4. 2.3 Characteristics of Real-Time Systems
      5. 2.4 Real-Time Processing for Big Data — Concepts and Platforms
      6. 2.5 Data Stream Processing Platforms
      7. 2.6 Data Stream Analytics Platforms
      8. 2.7 Data Analysis and Analytic Techniques
      9. 2.8 Finance Domain Requirements and a Case Study
      10. 2.9 Future Research Challenges
    3. Chapter 3: Big Data Analytics for Social Media
      1. Abstract
      2. Acknowledgments
      3. 3.1 Introduction
      4. 3.2 NLP and Its Applications
      5. 3.3 Text Mining
      6. 3.4 Anomaly Detection
    4. Chapter 4: Deep Learning and Its Parallelization
      1. Abstract
      2. 4.1 Introduction
      3. 4.2 Concepts and Categories of Deep Learning
      4. 4.3 Parallel Optimization for Deep Learning
      5. 4.4 Discussions
    5. Chapter 5: Characterization and Traversal of Large Real-World Networks
      1. Abstract
      2. Acknowledgments
      3. 5.1 Introduction
      4. 5.2 Background
      5. 5.3 Characterization and Measurement
      6. 5.4 Efficient Complex Network Traversal
      7. 5.5 k-Core-Based Partitioning for Heterogeneous Graph Processing
      8. 5.6 Future Directions
      9. 5.7 Conclusions
  10. Part II: Big Data Infrastructures and Platforms
    1. Chapter 6: Database Techniques for Big Data
      1. Abstract
      2. 6.1 Introduction
      3. 6.2 Background
      4. 6.3 NoSQL Movement
      5. 6.4 NoSQL Solutions for Big Data Management
      6. 6.5 NoSQL Data Models
      7. 6.6 Future Directions
      8. 6.7 Conclusions
    2. Chapter 7: Resource Management in Big Data Processing Systems
      1. Abstract
      2. 7.1 Introduction
      3. 7.2 Types of Resource Management
      4. 7.3 Big Data Processing Systems and Platforms
      5. 7.4 Single-Resource Management in the Cloud
      6. 7.5 Multiresource Management in the Cloud
      7. 7.6 Related Work on Resource Management
      8. 7.7 Open Problems
      9. 7.8 Summary
    3. Chapter 8: Local Resource Consumption Shaping: A Case for MapReduce
      1. Abstract
      2. 8.1 Introduction
      3. 8.2 Motivation
      4. 8.3 Local Resource Shaper
      5. 8.4 Evaluation
      6. 8.5 Related Work
      7. 8.6 Conclusions
      8. Appendix CPU Utilization With Different Slot Configurations and LRS
    4. Chapter 9: System Optimization for Big Data Processing
      1. Abstract
      2. 9.1 Introduction
      3. 9.2 Basic Framework of the Hadoop Ecosystem
      4. 9.3 Parallel Computation Framework: MapReduce
      5. 9.4 Job Scheduling of Hadoop
      6. 9.5 Performance Optimization of HDFS
      7. 9.6 Performance Optimization of HBase
      8. 9.7 Performance Enhancement of Hadoop System
      9. 9.8 Conclusions and Future Directions
    5. Chapter 10: Packing Algorithms for Big Data Replay on Multicore
      1. Abstract
      2. 10.1 Introduction
      3. 10.2 Performance Bottlenecks
      4. 10.3 The Big Data Replay Method
      5. 10.4 Packing Algorithms
      6. 10.5 Performance Analysis
      7. 10.6 Summary and Future Directions
  11. Part III: Big Data Security and Privacy
    1. Chapter 11: Spatial Privacy Challenges in Social Networks
      1. Abstract
      2. Acknowledgments
      3. 11.1 Introduction
      4. 11.2 Background
      5. 11.3 Spatial Aspects of Social Networks
      6. 11.4 Cloud-Based Big Data Infrastructure
      7. 11.5 Spatial Privacy Case Studies
      8. 11.6 Conclusions
    2. Chapter 12: Security and Privacy in Big Data
      1. Abstract
      2. 12.1 Introduction
      3. 12.2 Secure Queries Over Encrypted Big Data
      4. 12.3 Other Big Data Security
      5. 12.4 Privacy on Correlated Big Data
      6. 12.5 Future Directions
      7. 12.6 Conclusions
    3. Chapter 13: Location Inferring in Internet of Things and Big Data
      1. Abstract
      2. Acknowledgements
      3. 13.1 Introduction
      4. 13.2 Device-based Sensing Using Big Data
      5. 13.3 Device-free Sensing Using Big Data
      6. 13.4 Conclusion
  12. Part IV: Big Data Applications
    1. Chapter 14: A Framework for Mining Thai Public Opinions
      1. Abstract
      2. Acknowledgments
      3. 14.1 Introduction
      4. 14.2 XDOM
      5. 14.3 Implementation
      6. 14.4 Validation
      7. 14.5 Case Studies
      8. 14.6 Summary and Conclusions
    2. Chapter 15: A Case Study in Big Data Analytics: Exploring Twitter Sentiment Analysis and the Weather
      1. Abstract
      2. Acknowledgments
      3. 15.1 Background
      4. 15.2 Big Data System Components
      5. 15.3 Machine-Learning Methodology
      6. 15.4 System Implementation
      7. 15.5 Key Findings
      8. 15.6 Summary and Conclusions
    3. Chapter 16: Dynamic Uncertainty-Based Analytics for Caching Performance Improvements in Mobile Broadband Wireless Networks
      1. Abstract
      2. 16.1 Introduction
      3. 16.2 Background
      4. 16.3 Related Work
      5. 16.4 VoD Architecture
      6. 16.5 Overview
      7. 16.6 Data Generation
      8. 16.7 Edge and Core Components
      9. 16.8 INCA Caching Algorithm
      10. 16.9 QoE Estimation
      11. 16.10 Theoretical Framework
      12. 16.11 Experiments and Results
      13. 16.12 Synthetic Dataset
      14. 16.13 Conclusions and Future Directions
    4. Chapter 17: Big Data Analytics on a Smart Grid: Mining PMU Data for Event and Anomaly Detection
      1. Abstract
      2. Acknowledgments
      3. 17.1 Introduction
      4. 17.2 Smart Grid With PMUs and PDCs
      5. 17.3 Improving Traditional Workflow
      6. 17.4 Characterizing Normal Operation
      7. 17.5 Identifying Unusual Phenomena
      8. 17.6 Identifying Known Events
      9. 17.7 Related Efforts
      10. 17.8 Conclusion and Future Directions
    5. Chapter 18: eScience and Big Data Workflows in Clouds: A Taxonomy and Survey
      1. Abstract
      2. 18.1 Introduction
      3. 18.2 Background
      4. 18.3 Taxonomy and Review of eScience Services in the Cloud
      5. 18.4 Resource Provisioning for eScience Workflows in Clouds
      6. 18.5 Open Problems
      7. 18.6 Summary
  13. Index

Product information

  • Title: Big Data
  • Author(s): Rajkumar Buyya, Rodrigo N. Calheiros, Amir Vahid Dastjerdi
  • Release date: June 2016
  • Publisher(s): Morgan Kaufmann
  • ISBN: 9780128093467