Skip to Content
精通数据科学算法
book

精通数据科学算法

by Posts & Telecom Press, David Natingga
May 2024
Intermediate to advanced
181 pages
3h 9m
Chinese
Packt Publishing
Content preview from 精通数据科学算法

附录A 统计

定义AB表示集合A和集合B的交集。交集是集合AB的子集,包含同时存在于AB里的所有元素,即AB := { x : x存在于A并且x存在于B}。

定义AB表示集合A和集合B的并集。并集完全包含了存在于集合A或者集合B的所有元素,即AB := { x : x存在于A或者x存在于B}。

定义AB或者A\B表示集合A和集合B的差集。差集是集合A的子集,包含所有存在于集合A但不存在于集合B的元素,即AB := { x : x存在于A并且x不存在于B}。

求和符号∑表示集合里所有成员之和,譬如:

C:\Users\LL\Desktop\49816\OK-未命名-1\9.png

  • 总体:分析过程中使用的相似数据或项的集合。
  • 样本:总体的一个子集。
  • 集合的算数均值(平均数):该集合的全部数值之和除以集合的大小。
  • 中位数:一个有序集合的中间数值,比如说,当x1<…< x2k+1时,集合{x1, …, x2k+1}的中位数是值xk+1
  • 随机变量:把一组可能的结果对应到一组值(例如实数)的函数。
  • 期望:随机变量的期望是由随机变量给定数值组成的递增集合的平均值极限。
  • 方差:衡量总体分布的平均水平。在数学上,随机变量X的方差是随机变量X与它的平均值μ之差的平方的期望,即Var( X ) = E [(X - μ)2]。
  • 标准差:随机变量X的标准差是变量X的方差的平方根,即SD( X ) = sqrt(Var( X ))。
  • 相关关系:随机变量之间相关性的度量。在数学上,对于随机变量 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学原理

数据科学原理

Posts & Telecom Press, Sinan Ozdemir
PyTorch深度学习

PyTorch深度学习

Posts & Telecom Press, Vishnu Subramanian
程序员学数据结构

程序员学数据结构

Posts & Telecom Press, William Smith
可编程网络自动化

可编程网络自动化

Jason Edelman, Scott S. Lowe, Matt Oswalt

Publisher Resources

ISBN: 9781836204596