Skip to Content
LLM 服务和优化实践 (Chinese Edition)
book

LLM 服务和优化实践 (Chinese Edition)

by Chi Wang, Peiheng Hu
May 2026
Intermediate
374 pages
5h 3m
Chinese
O'Reilly Media, Inc.

Overview

本作品已使用人工智能进行翻译。欢迎您提供反馈和意见:translation-feedback@oreilly.com

大型语言模型(LLM)是现代人工智能的推理引擎。如今,一个重要的拐点已经到来:随着全球竞相大规模部署人工智能,模型推理已成为人工智能堆栈的核心。欢迎来到推理时代。

然而,如果不进行适当的优化,LLMs 的服务可能既昂贵又缓慢。LLM 服务与优化实践》(Hands-On LLM Serving and Optimization)是一本全面介绍大规模部署和优化 LLM 复杂性的指南。

在这本注重工程实践的书中,作者 Chi Wang 和 Peiheng Hu 结合了实用的示例、代码和策略,以构建稳健、高性能和低成本的人工智能令牌工厂。无论您是在构建 LLM 推理基础架构,还是在构建消耗 LLM 推理基础架构的应用程序,在人工智能改变我们的工作和构建方式之际,深入了解 LLM 服务将使您成为一名更高效、为未来做好准备的工程师。

  • 通过核心概念、设计范例和行业最佳实践了解模型服务的基础
  • 了解大规模托管 LLM 的常见挑战
  • 平衡延迟和吞吐量,满足人工智能应用的需求和业务要求
  • 利用实用的代码支持技术,经济高效地托管 LLMs
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

企业级Java开发中的应用人工智能 (Chinese Edition)

企业级Java开发中的应用人工智能 (Chinese Edition)

Alex Soto Bueno, Markus Eisele, Natale Vinto

Publisher Resources

ISBN: 0642572383695