July 2018
Intermediate to advanced
506 pages
16h 2m
English
Datastore is not an ideal platform for analytical workloads. For these cases, it is often ideal to leverage external services. There are generally two paths to getting data off of Datastore for analytics: exporting to Cloud Storage and ingestion using Dataflow. We've already seen how to perform exports to Cloud Storage. Once there, data may be consumed by other services such as BigQuery or Dataproc.
A more streamlined and powerful approach is to leverage the Dataflow Datastore IO integration. This allows data to be processed in a natural form without the need to perform bulk imports/exports. Additionally, Dataflow can insert any resulting data back into Datastore once complete, providing a clear path for extract, transform, ...