Google BigQuery
While data processing engines such as Cloud Dataflow and Hadoop offer extreme computational power, they do so by following a well-defined execution plan, often with long delays in converting new data into usable insights. For many analytics workflows, this turnaround time is critical. As an example, suppose a marketing executive needs to know the effectiveness of recent changes to a marketing campaign for a given set of regions and a given demographic. Also suppose that the size of data involved is in the order of terabytes. These answers could certainly be determined using the likes of MapReduce or Dataflow, but doing so would involve developing, testing, and validating a new pipeline. If the results prompt further questions, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access