Table of Contents
Preface
Part 1: Getting Started with Data Engineering with GCP
1
Fundamentals of Data Engineering
Understanding the data life cycle
Understanding the need for a data warehouse
Start with knowing the roles of a data engineer
A data engineer versus a data scientist
The focus of data engineers
Going through the foundational concepts for data engineering
ETL concept in data engineering
The difference between ETL and ELT
What is not big data?
A quick look at how big data technologies store data
A quick look at how to process multiple files using MapReduce
Summary
Exercise
Further Reading
2
Big Data Capabilities on GCP
Technical requirements
Understanding what the cloud is
The difference between the cloud and non-cloud era
The on-demand ...
Get Data Engineering with Google Cloud Platform - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.