Skip to Content
面向数据科学家的实用统计学
book

面向数据科学家的实用统计学

by Peter Bruce, Andrew Bruce
October 2018
Beginner to intermediate
238 pages
6h 32m
Chinese
Posts & Telecom Press
Content preview from 面向数据科学家的实用统计学
分类
145
Degrees of Freedom: 45341 Total (i.e. Null); 45330 Residual
Null Deviance: 62860
Residual Deviance: 57510 AIC: 57540
其中,outcome 是响应变量,在还清贷款时值为
0
,在欠贷时值为
1
purpose_ home_
分别表示贷款目的和房屋产权状态的因子变量。在回归中,具有
P
个层级的因子变量可以
P
-
1
个列表示。在
R
中,默认使用了
参考
编码,即将所有的层与参考层做对比(参见
4.4
节)。上述因子的参考层分别是 credit_card MORTGAGE。变量 borrower_score 是一个
位于
[0, 1]
区间的分值,表示了借款者的信誉度范围从“差”到“优秀”。该变量是使用
K
最近邻方法从其他几个变量中创建的,具体方法参见
6.1.6
节。
5.3.3
 广义线性模型
除了回归之外,另一类重要的模型是
广义线性模型
。广义线性模型的特征主要体现在下面
两个方面。
一个概率分布或家族。例如,对于逻辑回归,它是二项分布。
一个将响应映射到预测因子的
连接函数
link function
)。例如,对于逻辑回归,它是
Logit
函数。
到目前为止,广义线性模型最常见的形式是逻辑回归。数据科学家还会看到其他类型的广
义线性模型。有时,我们会发现连接函数使用的是对数函数,而非
Logit
函数。在实践中,
对于大多数应用而言,使用对数连接函数基本不会对结果造成很大的差异。泊松分布常用 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

C++语言导学(原书第2版)

C++语言导学(原书第2版)

本贾尼 斯特劳斯特鲁普
基于Python的智能文本分析

基于Python的智能文本分析

Benjamin Bengfort, Rebecca Bilbro, Tony Ojeda

Publisher Resources

ISBN: 9787115493668