Skip to Content
R 语言经典实例(原书第 2 版)
book

R 语言经典实例(原书第 2 版)

by J.D. Long, Paul Teetor
June 2020
Beginner to intermediate
522 pages
9h 6m
Chinese
China Machine Press
Content preview from R 语言经典实例(原书第 2 版)
230
9
#> 5%
#> 0.0451
如果我们将概率的含义理解为相对概率,那么函数 quantile 的文档将该函数的第二个
参数称为“概率”。
在真正的 R 风格中,第二个参数可以是由概率值构成的向量;在这种情况下,quantile
函数返回相应的分位数向量,每个分位数对应一个概率值:
quantile(vec, c(.05, .95))
#> 5% 95%
#> 0.0451 0.9363
上例是一个方便的识别数据中间的 90%观测值的方法。
如果你完全省略第二个参数,那么 R 假定你想要的概率值为 00.250.500.75 1.0
即四分位数:
quantile(vec)
#> 0% 25% 50% 75% 100%
#> 0.000405 0.235529 0.479543 0.737619 0.999379
令人惊讶的是,函数 quantile 实现了九种不同的算法用于计算分位数。在假设默认算
法是最适合你的算法之前,请先研究该函数的帮助页面。
9.6 求分位数的逆
9.6.1 问题
给定数据中的一个观测值
x
,你想知道其相应的分位数。 也就是说,你想知道小于
x
数据的比例。
9.6.2 解决方案
假设数据存储在一个向量 vec 中,把数据与观察值进行比较,然后使用 mean 计算小于
x
的观测值(例如 1.6)的相对频数,如下例所示:
mean(vec < 1.6)
#> [1] 0.948
9.6.3 讨论
表达式vec<
x
vec 向量的每个元素与
x
进行比较并返回逻辑值的向量,如果
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

机器学习实战:基于Scikit-Learn、Keras 和TensorFlow (原书第2 版)

机器学习实战:基于Scikit-Learn、Keras 和TensorFlow (原书第2 版)

Aurélien Géron
大规模数据分析和建模:基于 Spark 与 R

大规模数据分析和建模:基于 Spark 与 R

Javier Luraschi, Kevin Kuo, Edgar Ruiz
管理Kubernetes

管理Kubernetes

Brendan Burns, Craig Tracey

Publisher Resources

ISBN: 9787111656814