Skip to Content
面向数据科学家的实用统计学
book

面向数据科学家的实用统计学

by Peter Bruce, Andrew Bruce
October 2018
Beginner to intermediate
238 pages
6h 32m
Chinese
Posts & Telecom Press
Content preview from 面向数据科学家的实用统计学
16
1
1-2:美国各州人口的箱线图
箱子的顶部和底部分别是第
75
百分位数和第
25
百分位数。箱内的水平线表示的是中位
数。图中的虚线称为
whisker
)。须从最大值一直延伸到最小值,显示了数据的极差。
箱线图有多种变体,具体细节可参考“
R
文档:boxplot 函数”
5
等资料。在默认情况下,该
R
函数会扩展须到箱子外的最远点,但不会超过四分位距的
1.5
倍。其他软件可能会采用
不同的规则。在须外的所有数据绘制为单个点。
1.5.2
 频数表和直方图
变量的频数表将该变量的极差均匀地分割为多个等距分段,并给出落在每个分段中的数值个
数。在
R
语言中,可使用下面命令计算美国人口按州分布的频数表,结果显示在表
1-5
中。
breaks <- seq(from=min(state[["Population"]]),
to=max(state[["Population"]]), length=11)
pop_freq <- cut(state[["Population"]], breaks=breaks,
right=TRUE, include.lowest = TRUE)
table(pop_freq)
5
R Core Team.
R: A Language and Environment for Statistical Computing,
R Foundation for Statistical
Computing (2015).
探索性数据分析
17
1-5:美国人口按州分布的频数表 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

C++语言导学(原书第2版)

C++语言导学(原书第2版)

本贾尼 斯特劳斯特鲁普
基于Python的智能文本分析

基于Python的智能文本分析

Benjamin Bengfort, Rebecca Bilbro, Tony Ojeda

Publisher Resources

ISBN: 9787115493668