Skip to Content
Google BigQuery Analytics
book

Google BigQuery Analytics

by Siddartha Naidu, Jordan Tigani
June 2014
Intermediate to advanced
528 pages
13h 54m
English
Wiley
Content preview from Google BigQuery Analytics

Chapter 11Managing Data Stored in BigQuery

The previous chapters cover how BigQuery simplifies analytics over large datasets. BigQuery also has features to simplify data management and the integration of analytics into an application. This chapter covers those features and how to handle common data warehousing tasks using them.

Query Caching

As discussed in Chapter 7, “Running Queries,” BigQuery has an auto-caching feature that enables it to reuse results across identical queries. This feature is convenient because it is transparent to the user but is limited to instances in which the service can guarantee that existing results from a prior query job are identical to the results that would be generated by running the query again, which we will elaborate on below. The application developer, on the other hand, knows a great deal more about the use case. So when the application can trade freshness for execution cost, it is possible to further reduce query costs by directly managing caching. With many data warehousing systems, it is necessary to utilize a separate caching framework, for example Memcached, to reduce load on the query engine or the latency of operations in a front end. With BigQuery it is usually feasible to avoid a separate caching framework for query results by leveraging the feature that query results are actually new tables that can be assigned an explicit name. Different parts of the application can interact with the same query result by accessing the appropriate ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

BigQuery for Data Warehousing: Managed Data Analysis in the Google Cloud

BigQuery for Data Warehousing: Managed Data Analysis in the Google Cloud

Mark Mucchetti
Advanced Analytics with PySpark

Advanced Analytics with PySpark

Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills

Publisher Resources

ISBN: 9781118824795Purchase book