book

Think Stats 第2版 ―プログラマのための統計入門

Name: Think Stats 第2版 ―プログラマのための統計入門
ISBN: 9784873117355

by Allen B. Downey, 黒川利明, 黒川洋

August 2015

Intermediate to advanced

272 pages

5h 46m

Japanese

O'Reilly Japan, Inc.

Read now

Unlock full access

Content preview from Think Stats 第2版 ―プログラマのための統計入門

1.4　DataFrame

テキストファイル（

ASCII

）を

gzip

圧縮したファイルです。ファイルの各行が

つ

のレコード（

record

）で、

つの妊娠についてのデータを表します。

ファイルフォーマットは、

Stata

辞書ファイル形式の

2002FemPreg.dct

に文書化さ

れています。

Stata

は商品として販売されている統計ソフトのシステムです。この文

脈での「辞書（

dictionary

）」とは、変数名、型、インデックスのリストです。インデッ

クスは、行のどこに変数（の値）があるかを示します。

例えば、

2002FemPreg.dct

には次のような行があります。

infile dictionary {

_column(1) str12 caseid %12s "RESPONDENT ID NUMBER"

_column(13) byte pregordr %2f "PREGNANCY ORDER (NUMBER)"

}

この辞書は次の

変数を記述しています。

caseid

は、回答者

を表す

文字の

文字列です。

pregordr

は、この回答者の何回目の妊娠かを示す

バイト整数です。

ダウンロードしたコードには、

thinkstats2.py

モジュールが含まれています。こ

の

Python

モジュールには、本書で使う多数のクラスや関数に加えて、

Stata

辞書や

NSFG

データファイルを読む関数も含まれています。

nsfg.py

でこのモジュールがど

う使われているかを次に示します。

def ReadFemPreg(dct_file='2002FemPreg.dct', ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

Publisher Resources

ISBN: 9784873117355Other

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Think Stats 第2版 ―プログラマのための統計入門

by Allen B. Downey, 黒川利明, 黒川洋

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

SRE サイトリライアビリティエンジニアリング ―Googleの信頼性を支えるエンジニアリングチーム

マスタリングLinuxシェルスクリプト第2版 ―Linuxコマンド、bashスクリプト、シェルプログラミング実践入門

機械学習デザインパターン ―データ準備、モデル構築、MLOpsの実践上の問題と解決

戦略的データサイエンス入門 ―ビジネスに活かすコンセプトとテクニック

Publisher Resources