April 2018
Beginner to intermediate
282 pages
9h 53m
English
Cloud Dataproc also has built-in integration with other Google Cloud Platform services, such as Cloud Storage, BigQuery, and Bigtable. So, along with a Spark or Hadoop cluster, we can set up a complete data platform. For example, you can use Cloud Dataproc to effortlessly get ETL and terabytes of raw log data directly into BigQuery for business reporting.
Another use case of Cloud Storage object storage is storing data that needs to be processed in Dataproc instead of using Hadoop Distributed File System.