March 2019
Beginner to intermediate
448 pages
13h 14m
English
In this recipe, we will work with an imbalanced dataset and we will plot the ROC and precision-recall curves. The data contains:
library(MASS)library(caret)library(PRROC)library(precrec)set.seed(10)data = read.csv("./approved.csv")data = data[,-c(1,7)]data$Approved_ = "not_approved"data$Approved_[data$Approved == 1] <- "approved"data$Approved_ = as.factor(data$Approved_)data = data[,-1]trainIndex <- createDataPartition(data$Approved_, p = .75, list = FALSE, times = 1) traindata <- data[trainIndex,] testdata <- data[-trainIndex,]
Read now
Unlock full access