book

Think Stats 第3版 ―Pythonで学ぶ統計学入門

Name: Think Stats 第3版 ―Pythonで学ぶ統計学入門
ISBN: 9784814401376

by Allen B. Downey, 大橋真也

October 2025

Intermediate to advanced

328 pages

3h 56m

Japanese

O'Reilly Japan, Inc.

Read now

Unlock full access

表紙
大扉
原書大扉
クレジット
はじめに
1章　探索的データ分析
1.1　証拠1.2　家族の成長に関する全国調査1.3　データの読み込み1.4　検証1.5　変換1.6　要約統計量1.7　データの解釈1.8　用語集1.9　練習問題
2章　分布
2.1　度数分布表2.2　NSFGの分布2.3　外れ値2.4　第一子2.5　効果量2.6　結果の報告2.7　用語集2.8　練習問題
3章　確率質量関数
3.1　確率質量関数3.2　PMFを要約する3.3　クラス規模のパラドックス3.4　NSFGデータ3.5　その他の可視化3.6　用語集3.7　練習問題
4章　累積分布関数
4.1　パーセンタイルとパーセンタイル順位4.2　累積分布関数4.3　CDFを比較する4.4　パーセンタイルに基づく統計量4.5　乱数4.6　用語集4.7　練習問題
5章　分布をモデル化する
5.1　二項分布5.2　ポアソン分布5.3　指数分布5.4　正規分布5.5　対数正規分布5.6　なぜモデルなのか5.7　用語集5.8　練習問題

6章　確率密度関数
6.1　分布の比較6.2　確率密度6.3　指数分布のPDF6.4　PMFとPDFの比較6.5　カーネル密度推定6.6　分布のフレームワーク6.7　用語集6.8　練習問題
7章　変数間の関係
7.1　散布図7.2　デシルプロット7.3　相関7.4　相関の強さ7.5　順位相関7.6　相関と因果7.7　用語集7.8　練習問題
8章　推定
8.1　ペンギンの体重測定8.2　頑健性8.3　分散の推定8.4　標本分布8.5　標準誤差8.6　信頼区間8.7　誤差の原因8.8　用語集8.9　練習問題
9章　仮説検定
9.1　コイン投げ9.2　平均の差の検定9.3　その他の検定統計量9.4　相関の検定9.5　比率の検定9.6　用語集9.7　練習問題
10章　最小二乗法
10.1　最小二乗法10.2　決定係数10.3　MSEの最小化10.4　推定10.5　不確実性の可視化10.6　変換10.7　用語集10.8　練習問題
11章　重回帰分析
11.1　統計モデル11.2　重回帰11.3　コントロール変数11.4　非線形関係11.5　ロジスティック回帰11.6　用語集11.7　練習問題
12章　時系列分析
12.1　電力12.2　時系列成分の分解12.3　予測12.4　乗法モデル12.5　自己回帰12.6　移動平均12.7　自己回帰による遡及予測12.8　ARIMA12.9　ARIMAによる予測12.10　用語集12.11　練習問題
13章　生存時間解析
13.1　生存関数13.2　ハザード関数13.3　結婚データ13.4　重み付きブートストラップ13.5　ハザード関数の推定13.6　生存関数の推定13.7　lifelinesパッケージ13.8　信頼区間13.9　期待残存期間13.10　用語集13.11　練習問題
14章　分析手法
14.1　正規確率プロット14.2　正規分布14.3　標本平均の分布14.4　差の分布14.5　中心極限定理14.6　中心極限定理の限界14.7　中心極限定理の適用14.8　相関検定14.9　カイ二乗検定14.10　計算と分析14.11　用語集14.12　練習問題
著者・訳者紹介
奥付

Content preview from Think Stats 第3版 ―Pythonで学ぶ統計学入門

6章確率密度関数

5章では、二項分布、ポアソン分布、指数分布、正規分布などの理論的分布を使ってデータをモデル化しました。

二項分布とポアソン分布は離散分布であり、これは結果が整数の、ヒット数とミス数、得点のように、とびとびの要素であることが必要であることを意味しています。離散分布では、各結果は確率質量が対応しています。

指数分布と正規分布は、連続的な分布であり、これは結果が可能な値の範囲内のどの点にもなり得ることを意味します。連続分布では、各結果は確率密度に関連付けられています。確率密度は抽象的な概念で、多くの人にとって最初は難しいと感じますが、一歩ずつ進めていきましょう。最初のステップとして、分布の比較についてもう一度考えてみましょう。

6.1　分布の比較

5章では、離散分布を比較するとき、それらの確率質量関数（PMF）を示すために棒グラフを使いました。連続分布を比較するときは、それらの累積分布関数（CDF）を示すために折れ線グラフを使いました。

離散分布については、CDFを使うこともできます。例えば、lam=2.2のポアソン分布のPMFは、NSFGデータにおける世帯人数の分布をよく表しているモデルです。

read_fem_respを使って回答者データファイルを読み込みます。

from nsfg import read_fem_resp

resp = read_fem_resp()

次に、25歳以上の世帯人数を選びます。

older = resp.query("age >= 25")
num_family = older["numfmhh"]

そして、回答者の分布を表すPmfを作成します。

from empiricaldist import Pmf

pmf_family ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9784814401376Publisher Website

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Think Stats 第3版 ―Pythonで学ぶ統計学入門

by Allen B. Downey, 大橋真也

6章確率密度関数

6.1　分布の比較

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

直感生成AI ―ハンズオンで動かして学ぶ拡散モデル入門

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

MLOps実装ガイド ―本番運用を見据えた開発戦略

生成AI時代の新プログラミング実践ガイド Pythonで学ぶGPTとCopilotの活用ベストプラクティス

Publisher Resources

6章確率密度関数

6.1 分布の比較

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

直感 生成AI ―ハンズオンで動かして学ぶ拡散モデル入門

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

MLOps実装ガイド ―本番運用を見据えた開発戦略

生成AI時代の新プログラミング実践ガイド Pythonで学ぶGPTとCopilotの活用ベストプラクティス

Publisher Resources

6.1　分布の比較

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

直感生成AI ―ハンズオンで動かして学ぶ拡散モデル入門