Skip to Content
深度学习入门4 :强化学习
book

深度学习入门4 :强化学习

by 斋藤康毅
August 2024
Beginner to intermediate
333 pages
9h 15m
Chinese
Posts & Telecom Press

Overview

沿袭“鱼书”系列风格,提供实际代码,边实践边学习,无须依赖外部库,从零开始实现支撑强化学习的基础技术。

本书有什么特点?

  • 把握潮流中的变与不变
  • 在快速发展变化的深度学习领域,有变化的事物,有不变的事物。有些事物会随潮流而消逝,有些则会被传承下去。本书从马尔可夫决策过程、贝尔曼方程、蒙特卡洛方法、时间差分法等强化学习基础方法,自然而然地过渡到神经网络和深度学习,从前沿视角为读者遴选出最值得关注的强化学习理论和方法。

  • 内容丰富,讲解简明易懂
  • 作为超高人气“鱼书”系列第四部作品,本书延续了这一系列的写作和讲解风格,搭配丰富的图、表、代码示例,加上轻松、简明的讲解,能够让人非常信服地理解强化学习中各种方法之间的关系,于不知不觉中登堂入室。

  • 原理与实践并重
  • 本书旨在确保读者能够牢固掌握强化学习的独特理论,奉行“只有做出来才能真正理解”的理念,将这一主题的每个构成要素都从“理论”和“实践”两个方面进行详尽解释,并鼓励读者动手尝试。与仅通过数学公式解释理论的图书不同,读者将通过实际运行本书代码获得许多令人惊叹的领悟。

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

C++语言导学(原书第2版)

C++语言导学(原书第2版)

本贾尼 斯特劳斯特鲁普
软件开发实践:项目驱动式的Java开发指南

软件开发实践:项目驱动式的Java开发指南

Raoul-Gabriel Urma, Richard Warburton
Python算法交易实战

Python算法交易实战

Posts & Telecom Press, Sebastien Donadio
基于Python的DevOps

基于Python的DevOps

Noah Gift, Kennedy Behrman, Alfredo Deza, Grig Gheorghiu

Publisher Resources

ISBN: 9787115649171