Chapter 11. Calgary Tools
In this chapter, you will learn about a collection of tools that allow you to tap into data sets up to billions of rows with runtimes in seconds. Yes, billions. Yes, in seconds. You are going to learn what the Calgary tools are and, more importantly, how to use them and why they are so powerful if you are ever working with large data sets.
Calgary is a special type of proprietary data engine from Alteryx that allows you to store and conduct analysis on Calgary databases, which can contain billions of records. In more practical terms, this means that you can load a huge amount of data into a Calgary database—a proprietary file database format that can be stored like any other files on your machine and with indexes—and the time it takes to retrieve that data is orders of magnitude faster than if you were to pull from a standard database.
If you have spent a decent amount of time working with large data sets, you know that nothing is more frustrating than building queries to count items (number of products, customers, locations) from millions (or even billions) of rows of data. It’s painfully slow without the right software. Calgary tools allow you to not only break that barrier but do so with a level of flexibility that allows you to write the specific queries you need (e.g., the number of products with at least $10M in sales and 500 customers).
Alteryx designed the Calgary tools with their users in mind. The UI and configuration of these tools is quite ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access