book

Pythonデータサイエンスハンドブック ―Jupyter、NumPy、pandas、Matplotlib、scikit-learnを使ったデータ分析、機械学習

May 2018

Intermediate to advanced

556 pages

13h 21m

Japanese

Read now

Unlock full access

5.8

詳細：決定木とランダムフォレスト

431

図5-77　データに当てはめたランダムフォレストモデル

5.8.4

事例：ランダムフォレストによる文字認識

手書きの数字データについては既に取り上げました（「5.2 scikit-learnの紹介」を参照）。この問

題に対してランダムフォレスト分類器がどのように使用できるのかを見てみましょう。

In[12]: from sklearn.datasets import load_digits

digits = load_digits()

digits.keys()

Out[12]: dict_keys(['target', 'data', 'target_names', 'DESCR', 'images'])

最初のいくつかのデータを可視化して、どのような問題であったのかを再度確認しましょう（図

5-78）。

In[13]:

# set up the figure

手書き数字画像のセットアップ

fig = plt.figure(figsize=(6, 6)) # figure size in inches

fig.subplots_adjust(left=0, right=1, bottom=0, top=1, hspace=0.05, wspace=0.05)

# plot the digits: each image is 8x8 pixels 8x8

の画像として、各数字をプロット

for i in range(64):

ax = fig.add_subplot(8, ...

Jake VanderPlas, 菊池彰

Andreas C. Muller, Sarah Guido, 中田秀基

Prateek Joshi, 相川愛三

Eve Porcello, Alex Banks, 尾崎沙耶, あんどうやすし

ISBN: 9784873118413Other