Cloud Dataflow
Google Cloud Dataflow is a managed data transformation service, with a unified data processing model designed to process both unbounded and bounded datasets. Cloud Dataflow is a serverless platform—developers write code in the form of pipelines, and submit those pipelines to Cloud Dataflow for execution. There are no servers or other infrastructure to manage, allowing teams to quickly get up and running with large-scale data transformations. The core design of Cloud Dataflow allows for advanced concepts, such as autoscaling workers and dynamically rebalancing workloads across those workers, greatly lowering execution time while maximizing efficiency.
With integrations across the Google Cloud Platform catalog, Cloud Dataflow ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access