Skip to Content
面向数据科学家的实用统计学
book

面向数据科学家的实用统计学

by Peter Bruce, Andrew Bruce
October 2018
Beginner to intermediate
238 pages
6h 32m
Chinese
Posts & Telecom Press
Content preview from 面向数据科学家的实用统计学
1
探索性数据分析
在过去的一个世纪中,统计学作为一门学科得到了长足的发展。概率论是统计学的数学基
础,它基于托马斯
贝叶斯、皮埃尔
西蒙
拉普拉斯和卡尔
高斯等人的工作,在
17
纪至
19
世纪期间形成并发展。与概率论的纯理论本质不同,统计学是一门应用科学,关
注的是数据的分析和建模。现代统计学是一门严谨的科学,其根源可上溯至
19
世纪末的
弗朗西斯
高尔顿和卡尔
皮尔逊。
20
世纪初,罗纳德
艾尔默
费希尔成为现代统计学的
先驱之一,他提出了
实验设计法
最大似然估计
等重要概念。不少其他统计学概念在很大
程度上也深深地植根于数据科学中。本书的主要目标就是帮助你理解这些概念,并阐明这
些概念在数据科学和大数据的背景下是否依然重要。
本章的重点是探索数据,这是所有数据科学项目的第一步。
探索性数据分析
EDA
)是统
计学中一个相对新的领域。经典统计学几乎只注重
推断
,即从小样本得出关于整体数据的
结论,这往往是一个复杂的过程。
1962
年,约翰
图基(图
1-1
)发表了一篇著名的论文
The Future of Data Analysis
”,由此引发了对统计学的重构。在论文中,图基提出了他称
之为
数据分析
的一门新学科,并将统计推断包括于其中,由此建立了与工程和计算机科学
界的联系[他提出了术语
比特
软件
,其中“比特”(
bit
)是“二进制数字”(
binary digit
的缩写]。出乎意料的是,这一初始理念被延续了下来,并成为了数据科学的基础之一。
图基编著并在
1977
年出版了
Exploratory Data Analysis ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

C++语言导学(原书第2版)

C++语言导学(原书第2版)

本贾尼 斯特劳斯特鲁普
基于Python的智能文本分析

基于Python的智能文本分析

Benjamin Bengfort, Rebecca Bilbro, Tony Ojeda

Publisher Resources

ISBN: 9787115493668