book

Think Stats 第3版 ―Pythonで学ぶ統計学入門

Name: Think Stats 第3版 ―Pythonで学ぶ統計学入門
ISBN: 9784814401376

by Allen B. Downey, 大橋真也

October 2025

Intermediate to advanced

328 pages

3h 56m

Japanese

O'Reilly Japan, Inc.

Read now

Unlock full access

表紙
大扉
原書大扉
クレジット
はじめに
1章　探索的データ分析
1.1　証拠1.2　家族の成長に関する全国調査1.3　データの読み込み1.4　検証1.5　変換1.6　要約統計量1.7　データの解釈1.8　用語集1.9　練習問題
2章　分布
2.1　度数分布表2.2　NSFGの分布2.3　外れ値2.4　第一子2.5　効果量2.6　結果の報告2.7　用語集2.8　練習問題
3章　確率質量関数
3.1　確率質量関数3.2　PMFを要約する3.3　クラス規模のパラドックス3.4　NSFGデータ3.5　その他の可視化3.6　用語集3.7　練習問題
4章　累積分布関数
4.1　パーセンタイルとパーセンタイル順位4.2　累積分布関数4.3　CDFを比較する4.4　パーセンタイルに基づく統計量4.5　乱数4.6　用語集4.7　練習問題
5章　分布をモデル化する
5.1　二項分布5.2　ポアソン分布5.3　指数分布5.4　正規分布5.5　対数正規分布5.6　なぜモデルなのか5.7　用語集5.8　練習問題

6章　確率密度関数
6.1　分布の比較6.2　確率密度6.3　指数分布のPDF6.4　PMFとPDFの比較6.5　カーネル密度推定6.6　分布のフレームワーク6.7　用語集6.8　練習問題
7章　変数間の関係
7.1　散布図7.2　デシルプロット7.3　相関7.4　相関の強さ7.5　順位相関7.6　相関と因果7.7　用語集7.8　練習問題
8章　推定
8.1　ペンギンの体重測定8.2　頑健性8.3　分散の推定8.4　標本分布8.5　標準誤差8.6　信頼区間8.7　誤差の原因8.8　用語集8.9　練習問題
9章　仮説検定
9.1　コイン投げ9.2　平均の差の検定9.3　その他の検定統計量9.4　相関の検定9.5　比率の検定9.6　用語集9.7　練習問題
10章　最小二乗法
10.1　最小二乗法10.2　決定係数10.3　MSEの最小化10.4　推定10.5　不確実性の可視化10.6　変換10.7　用語集10.8　練習問題
11章　重回帰分析
11.1　統計モデル11.2　重回帰11.3　コントロール変数11.4　非線形関係11.5　ロジスティック回帰11.6　用語集11.7　練習問題
12章　時系列分析
12.1　電力12.2　時系列成分の分解12.3　予測12.4　乗法モデル12.5　自己回帰12.6　移動平均12.7　自己回帰による遡及予測12.8　ARIMA12.9　ARIMAによる予測12.10　用語集12.11　練習問題
13章　生存時間解析
13.1　生存関数13.2　ハザード関数13.3　結婚データ13.4　重み付きブートストラップ13.5　ハザード関数の推定13.6　生存関数の推定13.7　lifelinesパッケージ13.8　信頼区間13.9　期待残存期間13.10　用語集13.11　練習問題
14章　分析手法
14.1　正規確率プロット14.2　正規分布14.3　標本平均の分布14.4　差の分布14.5　中心極限定理14.6　中心極限定理の限界14.7　中心極限定理の適用14.8　相関検定14.9　カイ二乗検定14.10　計算と分析14.11　用語集14.12　練習問題
著者・訳者紹介
奥付

Content preview from Think Stats 第3版 ―Pythonで学ぶ統計学入門

2章分布

この章では、統計学の最も基本的な考え方の1つである分布を紹介します。まず度数分布表（データセット中の値とそれぞれの値の出現回数を表す表）から始め、それを使って「家族の成長に関する全国調査（NSFG）」のデータを調べます。また、外れ値と呼ばれる極端な値や誤った値を探し、その扱い方を考えます。

2.1　度数分布表

変数を表現する1つの方法として、変数の値とその度数、つまり各値が現れる回数を含む度数分布表があります。これは変数の分布と呼ばれます。

分布を表現するには、empiricaldistと呼ばれるライブラリを使います。ここでの「empirical（経験的）」とは、分布が数学的モデルではなくデータに基づいていることを意味します。empiricaldistにはFreqTabというクラスがあり、これを使って度数分布表の計算やプロットができます。これは以下のようにインポートします

from empiricaldist import FreqTab

どのように動作するかを示すために、小さな値のリストから始めることにします。

t = [1.0, 2.0, 2.0, 3.0, 5.0]

FreqTabには、from_seqメソッドがあります。このメソッドはシーケンスを受け取り、FreqTabオブジェクトを作ります。

ftab = FreqTab.from_seq(t)
ftab

	度数
1.0	1
2.0	2
3.0	1
5.0	1

FreqTabオブジェクトはPandasのSeriesの一種で、オブジェクトとその度数を含んでいます。この例では、値1.0は度数1に対応し、値2.0は度数2に対応しています。

FreqTabには、度数分布表を棒グラフとしてプロットする ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9784814401376Publisher Website

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Think Stats 第3版 ―Pythonで学ぶ統計学入門

by Allen B. Downey, 大橋真也

2章分布

2.1　度数分布表

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

直感生成AI ―ハンズオンで動かして学ぶ拡散モデル入門

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

MLOps実装ガイド ―本番運用を見据えた開発戦略

生成AI時代の新プログラミング実践ガイド Pythonで学ぶGPTとCopilotの活用ベストプラクティス

Publisher Resources

2章分布

2.1 度数分布表

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

直感 生成AI ―ハンズオンで動かして学ぶ拡散モデル入門

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

MLOps実装ガイド ―本番運用を見据えた開発戦略

生成AI時代の新プログラミング実践ガイド Pythonで学ぶGPTとCopilotの活用ベストプラクティス

Publisher Resources

2.1　度数分布表

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

直感生成AI ―ハンズオンで動かして学ぶ拡散モデル入門