Skip to Content
精通特征工程
book

精通特征工程

by Alice Zheng, Amanda Casari
April 2019
Intermediate to advanced
172 pages
4h 39m
Chinese
Posts & Telecom Press
Content preview from 精通特征工程
108
8
自动特征生成:图像特征提取和
深度学习
影像和声音是人类固有的感官输入。我们的大脑天生适合快速发展处理视觉和听觉信号的
能力,有些系统甚至在出生之前就可以对刺激做出反应(
Eliot, 2000
)。另一方面,语言
能力则是靠学习得到的,它需要几个月来发展,而完全掌握则需要好几年。很多人的视
觉和听觉能力的发展都是自然而然的,但所有人都必须有意地训练自己的大脑来理解和
使用语言。
有趣的是,对于机器学习来说,情况则正好相反。我们在文本分析应用方面取得的进展要
远远多于图像和音频应用。以搜索问题为例,人们已经享受了多年在信息检索和文本搜索
方面的成果,而图像和音频搜索还在走向成熟的途中(然而在过去的
5
年中,深度学习模
型取得了突破性发展,这可能预示着在图像和语音分析领域会出现人们期待已久的革命性
成果)。
进展中的困难与从图像和音频数据中提取有意义特征的难度直接相关。机器学习模型需要
语义上有意义的特征来做出语义上有意义的预测。在文本分析中,尤其是在像英语这样语
义上有意义的基本单位(单词)很容易提取的语言中,进展可以非常快速。另一方面,图
像和声音是以数字像素或波形来记录的。图像中的单个“原子”是一个像素。在音频数据
中,基本单位是对波形密度的一次测量。这些单位包含的语义信息要比文本数据的基本单
位(单词)少。因此,与文本相比,图像和音频上的特征提取和特征工程要困难得多。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

精通機器學習

精通機器學習

Aurélien Géron

Publisher Resources

ISBN: 9787115509680