Official Google Cloud Certified Professional Data Engineer Study Guide

Book description

The proven Study Guide that prepares you for this new Google Cloud exam

The Google Cloud Certified Professional Data Engineer Study Guide, provides everything you need to prepare for this important exam and master the skills necessary to land that coveted Google Cloud Professional Data Engineer certification. Beginning with a pre-book assessment quiz to evaluate what you know before you begin, each chapter features exam objectives and review questions, plus the online learning environment includes additional complete practice tests. 

Written by Dan Sullivan, a popular and experienced online course author for machine learning, big data, and Cloud topics, Google Cloud Certified Professional Data Engineer Study Guide is your ace in the hole for deploying and managing analytics and machine learning applications. 

•    Build and operationalize storage systems, pipelines, and compute infrastructure

•    Understand machine learning models and learn how to select pre-built models

•    Monitor and troubleshoot machine learning models

•    Design analytics and machine learning applications that are secure, scalable, and highly available. 

This exam guide is designed to help you develop an in depth understanding of data engineering and machine learning on Google Cloud Platform.

Table of contents

  1. Cover
  2. Acknowledgments
  3. About the Author
  4. About the Technical Editor
  5. Introduction
  6. Assessment Test
  7. Answers to Assessment Test
  8. Chapter 1 Selecting Appropriate Storage Technologies
    1. From Business Requirements to Storage Systems
    2. Technical Aspects of Data: Volume, Velocity, Variation, Access, and Security
    3. Types of Structure: Structured, Semi-Structured, and Unstructured
    4. Schema Design Considerations
    5. Exam Essentials
    6. Review Questions
  9. Chapter 2 Building and Operationalizing Storage Systems
    1. Cloud SQL
    2. Cloud Spanner
    3. Cloud Bigtable
    4. Cloud Firestore
    5. BigQuery
    6. Cloud Memorystore
    7. Cloud Storage
    8. Unmanaged Databases
    9. Exam Essentials
    10. Review Questions
  10. Chapter 3 Designing Data Pipelines
    1. Overview of Data Pipelines
    2. GCP Pipeline Components
    3. Migrating Hadoop and Spark to GCP
    4. Exam Essentials
    5. Review Questions
  11. Chapter 4 Designing a Data Processing Solution
    1. Designing Infrastructure
    2. Designing for Distributed Processing
    3. Migrating a Data Warehouse
    4. Exam Essentials
    5. Review Questions
  12. Chapter 5 Building and Operationalizing Processing Infrastructure
    1. Provisioning and Adjusting Processing Resources
    2. Monitoring Processing Resources
    3. Exam Essentials
    4. Review Questions
  13. Chapter 6 Designing for Security and Compliance
    1. Identity and Access Management with Cloud IAM
    2. Using IAM with Storage and Processing Services
    3. Data Security
    4. Ensuring Privacy with the Data Loss Prevention API
    5. Legal Compliance
    6. Exam Essentials
    7. Review Questions
  14. Chapter 7 Designing Databases for Reliability, Scalability, and Availability
    1. Designing Cloud Bigtable Databases for Scalability and Reliability
    2. Designing Cloud Spanner Databases for Scalability and Reliability
    3. Designing BigQuery Databases for Data Warehousing
    4. Exam Essentials
    5. Review Questions
  15. Chapter 8 Understanding Data Operations for Flexibility and Portability
    1. Cataloging and Discovery with Data Catalog
    2. Data Preprocessing with Dataprep
    3. Visualizing with Data Studio
    4. Exploring Data with Cloud Datalab
    5. Orchestrating Workflows with Cloud Composer
    6. Exam Essentials
    7. Review Questions
  16. Chapter 9 Deploying Machine Learning Pipelines
    1. Structure of ML Pipelines
    2. GCP Options for Deploying Machine Learning Pipeline
    3. Exam Essentials
    4. Review Questions
  17. Chapter 10 Choosing Training and Serving Infrastructure
    1. Hardware Accelerators
    2. Distributed and Single Machine Infrastructure
    3. Edge Computing with GCP
    4. Exam Essentials
    5. Review Questions
  18. Chapter 11 Measuring, Monitoring, and Troubleshooting Machine Learning Models
    1. Three Types of Machine Learning Algorithms
    2. Deep Learning
    3. Engineering Machine Learning Models
    4. Common Sources of Error in Machine Learning Models
    5. Exam Essentials
    6. Review Questions
  19. Chapter 12 Leveraging Prebuilt Models as a Service
    1. Sight
    2. Conversation
    3. Language
    4. Structured Data
    5. Exam Essentials
    6. Review Questions
  20. Appendix Answers to Review Questions
    1. Chapter 1: Selecting Appropriate Storage Technologies
    2. Chapter 2: Building and Operationalizing Storage Systems
    3. Chapter 3: Designing Data Pipelines
    4. Chapter 4: Designing a Data Processing Solution
    5. Chapter 5: Building and Operationalizing Processing Infrastructure
    6. Chapter 6: Designing for Security and Compliance
    7. Chapter 7: Designing Databases for Reliability, Scalability, and Availability
    8. Chapter 8: Understanding Data Operations for Flexibility and Portability
    9. Chapter 9: Deploying Machine Learning Pipelines
    10. Chapter 10: Choosing Training and Serving Infrastructure
    11. Chapter 11: Measuring, Monitoring, and Troubleshooting Machine Learning Models
    12. Chapter 12: Leveraging Prebuilt Models as a Service
  21. Index
  22. End User License Agreement

Product information

  • Title: Official Google Cloud Certified Professional Data Engineer Study Guide
  • Author(s): Dan Sullivan
  • Release date: June 2020
  • Publisher(s): Sybex
  • ISBN: 9781119618430