Skip to Content
深度学习:核心原理与案例分析
book

深度学习:核心原理与案例分析

by Posts & Telecom Press, Ahmed Menshawy
May 2024
Intermediate to advanced
389 pages
6h 49m
Chinese
Packt Publishing
Content preview from 深度学习:核心原理与案例分析

第8章 目标检测——CIFAR-10示例

在介绍了卷积神经网络(CNN)背后的基础知识和直觉(动机)之后,本章将在物体检测方面最著名的数据集上进行演示。同样地,你将会看到CNN前面的一些层是如何提取关于物体的一些基本特征的,而最后的卷积层将会提取更多的语义级特征,这些特征都是从前面层的基本特征中构建而来的。

本章将包含以下两个主题。

  • 目标检测。
  • CIFAR-10图像目标检测——构建与训练模型。

维基百科指出:

“目标检测——计算机视觉领域的技术,旨在查找和识别位于图像或视频序列中的对象。尽管当平移或旋转物体时,物体的图像可能有不同的视点、不同的大小和尺度,但是人类都能够很容易地识别出图像中的多个物体。甚至当物体被部分遮挡时,人类都能够识别出这些物体。但这项任务对于计算机视觉系统来说仍然是一项挑战。几十年来,完成这项任务的很多方法都已经实现了。”

图像分析是深度学习中最突出的领域之一。图像易于生成和处理,它们也是机器学习中的正确数据类型:对于人类容易理解,但对于计算机很难。所以图像分析在深度神经网络历史中发挥了关键作用也是不足为奇的。

随着自动驾驶汽车、面部检测、智能视频监控和人口统计解决方案的兴起,迫切需要快速、准确的目标检测系统。这些系统不仅包括图像中的目标识别和分类,还可以通过在它们周围绘制适当的方框来定位每一个目标。这也使得目标检测比传统的计算机视觉前身——图像分类任务更加困难。

本章的重点是目标检测,即找到图像中有哪些物体。例如,如图8.1所示,一辆自动驾驶汽车需要检测道路上的其他汽车。目标检测有很多复杂的算法,它们通常需要庞大的数据集、非常深的卷积网络和很长的训练时间。

图8.1 目标检测的例子(图片来源:B. C. Russell, A. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

scikit-learn机器学习(第2版)

scikit-learn机器学习(第2版)

Posts & Telecom Press, Gavin Hackeling
自然语言处理与计算语言学

自然语言处理与计算语言学

Posts & Telecom Press, Bhargav Srinivasa-Desikan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
Python数据分析(第2版)

Python数据分析(第2版)

Posts & Telecom Press, Armando Fandango

Publisher Resources

ISBN: 9781836201212