Skip to Content
深度学习:核心原理与案例分析
book

深度学习:核心原理与案例分析

by Posts & Telecom Press, Ahmed Menshawy
May 2024
Intermediate to advanced
389 pages
6h 49m
Chinese
Packt Publishing
Content preview from 深度学习:核心原理与案例分析

第1章 数据科学——鸟瞰全景

数据科学或机器学习是一个使机器能够在不被告知数据或编程的情况下从数据集中学习知识的过程。例如,编写一个能够将手写数字作为输入图像并根据输入的图像输出值为0~9的程序非常困难。这同样适用于将收到的电子邮件分为垃圾邮件或非垃圾邮件的任务。为了解决这些问题,数据科学家使用数据科学或机器学习领域的学习方法和工具,通过向计算机提供一些可以区分一位数字和另一位数字的解释性特征,教会计算机如何自动识别数字。对于垃圾邮件/非垃圾邮件问题也是如此,我们可以通过特定的学习算法教会计算机如何区分垃圾邮件和非垃圾邮件,而不是使用正则表达式并编写数百条规则来对收到的电子邮件进行分类。

 

对于垃圾邮件过滤程序,你可以通过基于规则的方法对它进行编码,但它不会用于生产中,比如邮件服务器中的程序。所以建立一个学习系统是一个理想的解决方案。

用户可能每天都在使用数据科学应用程序,却不知道它就是数据科学应用程序。例如,某机构可能使用某些系统来检测大家发布的信件的邮政编码,以便自动将它们转发到正确的区域。如果用户使用亚马逊网站,它们通常会推荐用户购买一些东西,亚马逊就是通过了解用户经常搜索或购买哪些东西做到这一点的。

建立一个训练的机器学习算法需要一些基础的历史数据样本,从中学习如何区分不同的例子,并从这些数据中了解一些知识和趋势。之后,训练算法可用于对未知数据进行预测。学习算法将使用原始历史数据,并将尝试从该数据中了解一些知识和趋势。

本章将全面介绍数据科学,包括数据科学如何像一个黑盒子一样工作,以及数据科学家每天面临的挑战。本章具体讨论以下主题。 ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

scikit-learn机器学习(第2版)

scikit-learn机器学习(第2版)

Posts & Telecom Press, Gavin Hackeling
自然语言处理与计算语言学

自然语言处理与计算语言学

Posts & Telecom Press, Bhargav Srinivasa-Desikan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
Python数据分析(第2版)

Python数据分析(第2版)

Posts & Telecom Press, Armando Fandango

Publisher Resources

ISBN: 9781836201212