The following steps will lead us through our investigation:
- Let's look at how makes and models of cars inform us about fuel efficiency over time. First, let's look at the frequency of makes and models of cars available in the U.S., concentrating on 4-cylinder cars. To select the 4-cylinder cars, we first make the cylinders variable unique to see what the possible values are:
In [30]: pd.unique(vehicles_non_hybrid.cylinders) ...: Out[30]: array([ 4., 12., 8., 6., 5., 10., 2., 3., 16., nan])
Both 4.0 and 4 are listed as unique values; this fact should raise your suspicion. Remember, when we imported the data, pandas warned us that several variables were mixed types, and one of these variables was cylinders.
- Let's convert ...