Chapter 5: Catalog Your Data

Introduction

A project data catalog gives users a clear understanding of your data sets and helps answer the following questions:

● What do the variables in each data set represent?

● What do the values of each variable mean?

● What data is available for this project?

● Where is the data located?

● How are my data sets related to each other?

The codebook that you learned to create with the %TK_codebook macro is an important part of the data catalog and answers the first two questions. The Data Detective’s Toolkit provides two additional macro programs to help you create documentation that answers the last three questions. The first macro data tool, %TK_inventory, will answer the third and forth questions by automatically ...

Get The Data Detective's Toolkit now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.