Skip to Content
Google Cloud Platform for Developers
book

Google Cloud Platform for Developers

by Ted Hunter, Steven Porter
July 2018
Intermediate to advanced
506 pages
16h 2m
English
Packt Publishing
Content preview from Google Cloud Platform for Developers

Executing streaming pipelines

In the previous example, the pipeline operated on a single input file from Cloud Storage. Because this is a bounded input, the pipeline executed as a batch job. We can alternatively configure the pipeline to pull messages from a Cloud Pub/Sub topic, which is an unbounded dataset and hence results in a streaming job.

In many cases, inferences need to be made against sets of data with a clear beginning and ending. For bounded datasets, the beginning and ending occur naturally as the boundaries for the dataset. However, streaming datasets lack such clearly defined beginnings and endings. In order to address this issue, many stream processing tools introduce the concept of windowing, or simply imposing a start and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Google Cloud Platform in Action

Google Cloud Platform in Action

John J. (JJ) Geewax
Google Cloud Platform for Architects

Google Cloud Platform for Architects

Vitthal Srinivasan, Loonycorn Ravi, Judy Raj

Publisher Resources

ISBN: 9781788837675Supplemental Content