September 2017
Beginner to intermediate
304 pages
7h 2m
English
The dataset that we are going to use to illustrate logistic regression is data corresponding to loans issues by LendingClub. LendingClub publishes this data quarterly and it can be found in its original form at https://www.lendingclub.com/info/download-data.action. We will work with a trimmed down and simplified version of this data (available in the code bundles that come with this book) including only two columns, FICO.Range (indicating a loan applicants credit score as given by Fair, Isaac and Company, or FICO) and Interest.Rate (indicating the interest rate of a loan granted to the loan applicant). The data looks like this:
$ head loan_data.csv FICO.Range,Interest.Rate 735-739,8.90% 715-719,12.12% 690-694,21.98% ...
Read now
Unlock full access