Skip to Content
数据科学中的实用统计学(第2版)
book

数据科学中的实用统计学(第2版)

by Peter Bruce, Andrew Bruce, Peter Gedeck
October 2021
Intermediate to advanced
289 pages
8h 31m
Chinese
Posts & Telecom Press
Content preview from 数据科学中的实用统计学(第2版)
115
4
回归与预测
统计学中最常见的目标可能就是回答这个问题:“变量
X
(更可能是
X
1
,
,
X
p
)与变量
Y
有关联吗?如果有,它们之间的关系是什么?可以使用
X
来预测
Y
吗?”
统计学与数据科学之间联系最为紧密的领域是预测,具体地说,就是基于预测变量的值来预测
结果(目标)变量。在结果已知的数据上训练一个模型,再把这个模型应用于结果未知的数据
上,这个过程称为
监督学习
。数据科学与统计学之间有重要联系的另一种领域是
异常检测
——
在数据分析中先进行回归诊断,再逐步改进回归模型,然后使用这个模型来检测异常记录。
4.1
 简单线性回归
简单线性回归提供了一个关系模型来反映一个变量与另一个变量的大小之间的关系。例
如,当
X
增大时,
Y
也增大,或者当
X
增大时,
Y
减小。
1
测量两个变量之间如何关联的另
一种方式是相关性,参见
1.7
。二者之间的区别是,相关性测量的是两个变量之间关联
强度
,而回归模型则是对两个变量之间关系的
本质
进行量化。
本节关键术语
响应变量
试图预测的变量。
同义词
因变量、变量
Y
、目标、结果
1
本章内容版权归属:
©
2020 Datastats, LLC, Peter Bruce, Andrew Bruce, and Peter Gedeck
;已获得授权使用。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu

Publisher Resources

ISBN: 9787115569028