Skip to Content
数据科学中的实用统计学(第2版)
book

数据科学中的实用统计学(第2版)

by Peter Bruce, Andrew Bruce, Peter Gedeck
October 2021
Intermediate to advanced
289 pages
8h 31m
Chinese
Posts & Telecom Press
Content preview from 数据科学中的实用统计学(第2版)
统计机器学习
205
树模型就是一组“
if-then-else
”规则的集合,很好理解,实现也非常简单。与线性回归和
逻辑回归相比,树具有发现数据中隐藏模式的能力,这些模式对应于数据中复杂的交互作
用。与
KNN
或朴素贝叶斯方法不同
,简单的树模型可以用预测变量之间的关系表达出来,
这非常易于解释。
运筹学中的决策树
在决策科学与运筹学中,
决策树
这个词有另一种(更加古老的)意义,它表
示的是一种人工的决策分析过程。在这个意义中,决策点、可能结果以及它
们的估计概率都被放在一张分支图中,并选择期望值最大的决策路径。
6.2.1
 一个简单的例子
R
中拟合树模型的两个主要的包是
rpart
tree
。下面使用
rpart
包,在有
3000
条记录的
贷款数据样本上拟合一个模型,使用的变量是
payment_inc_ratio
borrower_score
(参
6.1
节中的数据描述)
library(rpart)
loan_tree <- rpart(outcome ~ borrower_score + payment_inc_ratio,
data=loan3000, control=rpart.control(cp=0.005))
plot(loan_tree, uniform=TRUE, margin=0.05)
text(loan_tree)
sklearn.tree.DecisionTreeClassifier
也提供了一种决策树实现。
dmba
包中还有一个非常
方便的函数,可以在
Jupyter
笔记本中创建决策树的可视化表示。 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu

Publisher Resources

ISBN: 9787115569028