Skip to Content
深度學習|內行人的做法
book

深度學習|內行人的做法

by Josh Patterson, Adam Gibson
January 2019
Beginner to intermediate
576 pages
14h 31m
Chinese
GoTop Information, Inc.
Content preview from 深度學習|內行人的做法
機器學習的工作原理
|
29
套入不足和過度套入
我們之前曾提過,最佳化演算法一開始要先解決「套入不足(underfitting)」的問題;
換句話說,我們一開始只是採用了一條與資料之間還不是很近似的直線,然後再設法讓
這條直線與資料更接近一點。假設我們想把資料散佈圖中不同類別的資料切分開來,如
1-7 所示,用直線來切分的話,就成了一個套入不足的好例子。
套入不足 不錯的套入結果 過度套入
1-7 機器學習中「套入不足」和「過度套入」的例子
如果套入的曲線把資料切分得太過完美,很可能就是遇到了相反的問題,稱為「過度套
入(overfitting)」。解決套入不足當然是優先要考慮的問題,但在機器學習方面,有很
多努力其實都花費在解決過度套入的問題。如果我們說某個模型過度套入整組資料,意
思就是它有可能在面對訓練資料時錯誤率很低,但面對我們感興趣的母體資料時,通用
性卻很差。
另一種解釋過度套入的方式,就是可以考慮一下資料的分佈情況。我們利用訓練組資料
畫出了一條具有預測能力的線,但訓練組資料只不過是從一整組更大的未知資料集取樣
的結果,如果我們希望畫出來的線能具備預測的能力,這條線面對更大的整組資料集時
也必須具有同樣的套入效果才行。因此我們必須假設,我們的樣本組必須是更大一整組
資料集的一個鬆散的代表。
最佳化
前面曾提過,我們可以利用調整權重的方式,讓資料的預測越來越準確,這整個過程就
叫做「
參數最佳化
parameter optimization
)」。你可以把這整個過程,想像成一種科學
方法。首先可以用某種公式做出假設 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

高效能網站建置指南

高效能網站建置指南

Steve Souders
初探深度學習|使用TensorFlow

初探深度學習|使用TensorFlow

Reza Zadeh, Bharath Ramsundar
深度学习实战

深度学习实战

Douwe Osinga

Publisher Resources

ISBN: 9789865020262