Skip to Content
Google Cloud Platform for Developers
book

Google Cloud Platform for Developers

by Ted Hunter, Steven Porter
July 2018
Intermediate to advanced
506 pages
16h 2m
English
Packt Publishing
Content preview from Google Cloud Platform for Developers

BigQuery as a Cloud Dataflow Sink

Writing Cloud Dataflow results to BigQuery is a very common pattern for both stream ingestion and batch ETL processes. Dataflow provides a very powerful basis for transforming and conditioning data for storage, and BigQuery provides fast and expressive ad-hoc exploration of that data. Cloud Dataflow provides first-class support for integrating with BigQuery via the BigQueryIO reader and writer.

BigQueryIO automatically adapts how it writes to BigQuery based on whether the pipeline is processing bounded or unbounded data. For bounded datasets, BigQueryIO performs inserts using batch file uploads. For unbounded datasets, inserts are performed using streaming insert API calls. This behavior can be overridden ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Google Cloud Platform in Action

Google Cloud Platform in Action

John J. (JJ) Geewax
Google Cloud Platform for Architects

Google Cloud Platform for Architects

Vitthal Srinivasan, Loonycorn Ravi, Judy Raj

Publisher Resources

ISBN: 9781788837675Supplemental Content