May 2019
Beginner
528 pages
29h 51m
English
In Chapter 10’s Intro to Data Science section, we performed simple linear regression on a small weather data time series using pandas, Seaborn’s regplot function and the SciPy’s stats module’s linregress function. In the previous section, we reimplemented that same example using scikit-learn’s LinearRegression estimator, Seaborn’s scatterplot function and Matplotlib’s plot function. Now, we’ll perform linear regression with a much larger real-world dataset.
The California Housing dataset7 bundled with scikit-learn has 20,640 samples, each with eight numerical features. We’ll perform a multiple linear regression that uses all eight numerical features to make more ...
Read now
Unlock full access