Skip to Content
深度學習|內行人的做法
book

深度學習|內行人的做法

by Josh Patterson, Adam Gibson
January 2019
Beginner to intermediate
576 pages
14h 31m
Chinese
GoTop Information, Inc.
Content preview from 深度學習|內行人的做法
深度學習在自然語言處理的應用
|
239
深度學習在自然語言處理的應用
深度學習在自然語言處理(NLP)領域已被證明是相當有效的一種做法。諸如「詞性標
記(POS tagging)」
13
、「 字元生成(character generation
http://bit.ly/2sUs2PU
」與
「單詞內嵌(word embeddings)」等技術,都是深度學習的常見應用。以下是本章打算
重點討論的一些自然語言處理應用:
使用 Word2Vec 學習單詞內嵌
14
使用段落向量做為句子的分散式表達方式
15
用段落向量進行文件分類
我們會在後面的章節中,看到以上每一種應用方式的介紹。
使用 Word2Vec,學習單詞內嵌
Word2Vec 是利用圍繞在單詞周圍的前後文,學習偵測出單詞與單詞之間的數學相似
性。Word2Vec 所建立的單詞向量,其實是單詞特徵(隱含於單詞前後文)的一種分散
式數值表達方式。Word2Vec 是把大量文字語料素材當成輸入資料進行訓練,並生成單
詞向量(或稱為「單詞內嵌(word embedding)」)的一個列表,以做為模型的輸出。我
們稍後就會看到,單詞內嵌其中所包含的單詞含義與關係,會在空間中以編碼的方式呈
現,而這種編碼方式同時也具有一些相當好用的特性(例如可進行向量算術運算)。
Word2Vec 模型與演算法
這個演算法首先會根據輸入訓練資料建立一個詞彙表(vocabulary),然後為每個單詞打
造專屬的表達方式。一開始,我們並不是像其他向量化技術一樣,只針對所要處理的文 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

高效能網站建置指南

高效能網站建置指南

Steve Souders
初探深度學習|使用TensorFlow

初探深度學習|使用TensorFlow

Reza Zadeh, Bharath Ramsundar
深度学习实战

深度学习实战

Douwe Osinga

Publisher Resources

ISBN: 9789865020262