CHAPTER 5

The Raw Data

Hard to Digest Until Processed

Raw data is the expression used to describe the original data before any ­analysis is undertaken. It is not a very palatable phrase. Something like ­“original data” or “new data” would have been more inviting, but I have to stick to convention. The purpose of this chapter is to explain the different kinds of data and present a number of definitions to be used in the chapters that follow. In addition, I will demonstrate how figures can mislead or confuse even before the statistical analysis has started.

Descriptive or Numerical

Data may be descriptive or numerical. Descriptive data, which is also called categorical, can be placed in categories and counted. Recording the way people vote in ...

Get Better Business Decisions from Data now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.