CHAPTER 6
Regression Models with Categorical Variables
After reading this chapter you will understand:
- What a categorical variable is.
- How to handle the inclusion of one or more categorical variables in a regression when they are explanatory variables.
- How to test for the statistical significance of individual dummy variables in a regression and how to employ the Chow test.
- Models that can be used when the dependent variable is a categorical variable: the linear probability model, the logit regression model, and the probit regression model.
- The advantages and disadvantages of each type of model for dealing with situations where the dependent variable is a categorical variable.
Categorical variables are variables that represent group membership. For example, given a set of bonds, the credit rating is a categorical variable that indicates to what category—AAA, AA, A, BBB, BB, and so on—each bond belongs. A categorical variable does not have a numerical value or a numerical interpretation in itself. Thus the fact that a bond is in category AAA or BBB does not, in itself, measure any quantitative characteristic of the bond; though quantitative attributes such as a bond's yield spread can be associated with each category.
Performing a regression on categorical variables does not make sense per se. For example, it does not make sense to multiply a coefficient times AAA or times BBB. However, in a number of cases the standard tools of regression analysis can be applied to categorical ...
Get The Basics of Financial Econometrics: Tools, Concepts, and Asset Management Applications now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.