book

Google Cloud Platform for Developers

by Ted Hunter, Steven Porter

July 2018

Intermediate to advanced

506 pages

16h 2m

English

Packt Publishing

Read now

Unlock full access

Content preview from Google Cloud Platform for Developers

Executing streaming pipelines

In the previous example, the pipeline operated on a single input file from Cloud Storage. Because this is a bounded input, the pipeline executed as a batch job. We can alternatively configure the pipeline to pull messages from a Cloud Pub/Sub topic, which is an unbounded dataset and hence results in a streaming job.

In many cases, inferences need to be made against sets of data with a clear beginning and ending. For bounded datasets, the beginning and ending occur naturally as the boundaries for the dataset. However, streaming datasets lack such clearly defined beginnings and endings. In order to address this issue, many stream processing tools introduce the concept of windowing, or simply imposing a start and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781788837675Supplemental Content

Google Cloud Platform for Developers

by Ted Hunter, Steven Porter

Executing streaming pipelines

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Google Cloud Platform in Action

Google Cloud Platform for Architects

Getting Started with Google Cloud Platform LiveLessons

Google Cloud Platform Professional Cloud Architect

Publisher Resources