Skip to Main Content
Data Engineering with Google Cloud Platform
book

Data Engineering with Google Cloud Platform

by Adi Wijaya
March 2022
Beginner to intermediate content levelBeginner to intermediate
440 pages
9h 43m
English
Packt Publishing
Content preview from Data Engineering with Google Cloud Platform

Chapter 6: Processing Streaming Data with Pub/Sub and Dataflow

Processing streaming data is becoming increasingly popular, as streaming enables businesses to get real-time metrics on business operations. This chapter describes which paradigm should be used—and when—for streaming data. The chapter will also cover how to apply transformations to streaming data using Cloud Dataflow, and how to store processed records in BigQuery for analysis.

Learning about streaming data is easier when we really do it, so we will exercise creating a streaming data pipeline on Google Cloud Platform (GCP). We will use two GCP services, Pub/Sub and Dataflow. Both of the services are essential in creating a streaming data pipeline. We will use the same dataset as ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Engineering with Google Cloud Platform - Second Edition

Data Engineering with Google Cloud Platform - Second Edition

Adi Wijaya
Architecting Data and Machine Learning Platforms

Architecting Data and Machine Learning Platforms

Marco Tranquillin, Valliappa Lakshmanan, Firat Tekiner

Publisher Resources

ISBN: 9781800561328Supplemental Content