Skip to Content
Web机器学习
book

Web机器学习

by Posts & Telecom Press, Andrea Isoni
May 2024
Intermediate to advanced
234 pages
3h 58m
Chinese
Packt Publishing
Content preview from Web机器学习

第3章 有监督机器学习

本章讨论最常用的回归和分类技术。这类算法背后的机制是相同的。通常而言,有监督学习算法指的是分类和回归。本章,我们会依次讨论线性回归、朴素贝叶斯、决策树和支持向量机算法。我们将用这些算法解决一个分类问题和一个回归问题,以帮助你理解它们的使用方法。前言部分也曾讲过,有监督学习要用标注好的训练集训练模型,找到合适的参数值。跟之前一样,本章代码也已放到我的GitHub主页本章文件夹中,地址是https://github.com/ai2010/machine_learning_for_the_web/tree/master/chapter_3/

本章最后,我们将介绍另一种也可以实现分类的算法(隐马尔可夫模型),虽然它不是专门用来处理分类问题的。我们现在先来解释这些方法在预测数据集标签这类问题上常见的出错原因。

我们前面讲过,用训练好的模型去预测新数据的标签,预测结果的质量取决于模型的泛化能力,即正确预测在训练数据中未出现的数据的能力。该问题的研究文献很多,一般涉及两个概念:输出的偏差(bias)和方差(variance)。偏差是指由算法的错误假设导致的错误。给定标签为{{y}_{t}}的数据点{{x}^{(t)}},如果用不同的训练集训练,模型就会有偏差,预测结果将总是不同于。而方差误差(variance ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

人工智能技术与大数据

人工智能技术与大数据

Posts & Telecom Press, Anand Deshpande, Manish Kumar
神经网络算法与Java编程

神经网络算法与Java编程

Posts & Telecom Press, Fabio M. Soares, Alan M. F. Souza
Python图像处理实战

Python图像处理实战

Posts & Telecom Press, Sandipan Dey
面向MapReduce的Hadoop优化

面向MapReduce的Hadoop优化

Posts & Telecom Press, Khaled Tannir

Publisher Resources

ISBN: 9781836203612