Skip to Content
Python机器学习手册:从数据预处理到深度学习
book

Python机器学习手册:从数据预处理到深度学习

by Chris Albon
July 2019
Intermediate to advanced
365 pages
8h 13m
Chinese
Publishing House of Electronics Industry
Content preview from Python机器学习手册:从数据预处理到深度学习
v
在过去的几年中,机器学习已经渗透到企业、非营利组织和政府的日常运作中。随着机
器学习热度的增加,在对机器学习从业者的指导方面,涌现出一批高质量的文献。这些
文献培养了整整一代的数据科学家和机器学习工程师。这些文献提供了学习资源,为人
们讲解机器学习是什么及其工作原理。尽管这种方法富有成效,但却遗漏了一部分内容:
机器学习日常开发中的细节。这就是笔者写本书的动机——本书不是写给学生读者学习
机器学习理论的大部头,而是写给专业人士的“扳手型”工具书。我希望你把它放在书
桌上,把你感兴趣的某些页折起来,在日常开发中需要解决实际问题时就拿过来翻一翻。
更具体地说,本书采用基于任务的方式来介绍机器学习,有近
200
个独立的解决方案(你
可以复制并粘贴这些代码,它们都是可以正常运行的),针对的都是数据科学家或机器
学习工程师在构建模型时可能遇到的常见任务。
本书的最终目标是成为人们在构建真实的机器学习系统时的参考书。例如,假设你有一
JSON
文件,其中包含
1000
个数据分类特征和数值型特征,并且目标向量的分类不均
衡,你想得到一个可解释的模型,那么使用本书提供的解决方案可以帮助你解决如下问
题:
y
加载
JSON
文件(
2.5
节)
y
对特征进行标准化(
4.2
节)
y
对特征字典编码(
5.3
节)
y
填充缺失的分类值(
5.4
节)
y
使用主成分进行特征降维(
9.1
节)
y
使用随机搜索选择最佳模型(
12.2
节)
y
训练随机森林分类器(
14.4
节)
y
选择随机森林中的重要特征(
14.7
节)
vi
本书的目标是让你
1
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

精通特征工程

精通特征工程

Alice Zheng, Amanda Casari
精通機器學習

精通機器學習

Aurélien Géron
Python数据分析基础

Python数据分析基础

Clinton W. Brownley

Publisher Resources

ISBN: 9787121369629