Skip to Content
Google Cloud Platform for Developers
book

Google Cloud Platform for Developers

by Ted Hunter, Steven Porter
July 2018
Intermediate to advanced
506 pages
16h 2m
English
Packt Publishing
Content preview from Google Cloud Platform for Developers

Managing Cloud Dataflow jobs

Once a pipeline is up and running, there are limited options for managing the pipeline's execution. Currently, developers may cancel or drain a running job. Canceling a job causes a near immediate halt of execution, making this a good option for idempotent pipelines, where the state is not lost during pipeline ingestion and re-processed elements have no side effects. For example, a pipeline that performs a lift-and-shift from a CSV file in Cloud Storage into a BigQuery table with truncate-reload can likely be canceled mid-job and executed again at a later date.

However, canceling pipelines that consume data destructively, such as those with a PubsubIO source, will likely result in lost data. For cases like this, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Google Cloud Platform in Action

Google Cloud Platform in Action

John J. (JJ) Geewax
Google Cloud Platform for Architects

Google Cloud Platform for Architects

Vitthal Srinivasan, Loonycorn Ravi, Judy Raj

Publisher Resources

ISBN: 9781788837675Supplemental Content