book

Python for Excel (Chinese Edition), 2nd Edition

Name: Python for Excel (Chinese Edition), 2nd Edition
Author: Felix Zumstein
ISBN: 0642572396008

by Felix Zumstein

May 2026

Intermediate

418 pages

5h 50m

Chinese

O'Reilly Media, Inc.

Read now

Unlock full access

前言
本书的结构第二版的新内容本书适合哪些读者我为何撰写本书Python 与 Excel 版本本书采用的约定代码示例的使用O’Reilly 在线学习联系我们致谢
I. Python 入门
1. 为何选择 Python 操作 Excel？
Python 核心原理通用编程语言可读性至关重要开箱即用跨平台科学计算现代语言特性文本处理数组支持Web API错误处理工具包管理器测试版本控制结论
2. 开发环境
配套资源库使用终端安装 Python安装 uv安装 Python 和包Python REPLJupyter Notebooks运行 Jupyter Notebook笔记本单元格编辑模式与命令模式执行顺序很重要Visual Studio Code安装与配置运行 Python 脚本调试器使用 VS Code 运行 Jupyter Notebook结论
3. Python入门
数据类型对象数值类型注释布尔值字符串索引与切片索引切片数据结构列表字典元组集合控制流代码块与 pass 语句if 语句与条件表达式for 和 while 循环列表推导代码组织函数模块与 import 语句datetime 类代码格式化、代码检查与类型提示PEP 8：Python 代码风格指南Ruff类型提示结论
II. pandas 简介
4. NumPy 基础
NumPy 入门NumPy 数组向量化和广播通用函数 (ufunc)创建和操作数组获取和设置数组元素有用的数组构造函数视图与副本结论
5. 使用 pandas 进行数据分析
DataFrame 和 Series索引列数据操作选择数据设置数据缺失数据重复数据算术运算文本处理合并 DataFrame连接连接与合并描述性统计与数据聚合描述性统计分组数据透视表绘图MatplotlibPlotly导入和导出 DataFrame导出 CSV 文件导入 CSV 文件现代替代方案结论
III. Python 在 Excel 中的应用
6. 在 Excel 中开始使用 Python
要求在 Excel 中编写 Python 代码Python 单元格输出类型xl 函数使用 pandas DataFrame单元格组织Python 编辑器绘图Excel 中 Python 的幕后原理初始化脚本单元格按行优先的执行顺序DataFrame 的往返过程设置键盘快捷键限制结论

7. 使用 pandas 进行时间序列分析
日期时间索引创建日期时间索引筛选日期时间索引常见的时间序列操作移位与百分比变化重基准与相关性重采样滑动Windows使用 Power Query 加载数据加载单个文件从文件夹加载文件结论
IV. xlwings
8. Excel 自动化
xlwings 入门将 Excel 用作数据查看器Excel 对象模型运行 VBA 代码转换器、选项和集合处理 DataFrames转换器和选项图表、图片和定义名称报告案例研究（第一版）xlwings 高级主题提高性能缺失的功能结论
9. 基于 Python 的 Excel 工具
将 Excel 用作前端xlwings 加载项快速入门命令运行主程序运行 Python 函数部署Python 依赖项独立工作簿xlwings 配置结论
10. Python 包追踪器
试用体验核心功能Web API数据库异常应用演示前端后端调试结论
11. 自定义函数
自定义函数入门先决条件Hello World使用 DataFrames转换器和选项从 Web 获取数据绘制图表调试性能优化减少跨应用程序调用缓存脚本装饰器结论
V. xlwings Lite
12. 使用 xlwings Lite 编写脚本
xlwings Lite 入门xlwings Lite 的工作原理安装用户界面导览运行您的第一个脚本工作表按钮分析纽约市出租车数据集使用 DuckDB 探索数据集按小时分析行程和距离平台特性发送 Web API 请求安装 Python 包局限性结论
13. 使用 xlwings Lite 创建自定义函数
现代自定义函数Hello World将类型提示用作转换器错误处理使用命名空间组织函数使用 OpenAI 构建 AI 函数使用环境变量管理凭据AI 函数使用 Hugging Face 进行情感分析机器学习术语CLASSIFY 函数使用 Open-Meteo 绘制气象数据日期处理绘制温度随时间变化的图表结论
VI. 无需 Excel 即可读写 Excel 文件
14. 使用 pandas 操作 Excel 文件
案例研究报告（第二版）读写 Excel 文件read_excel 函数与 ExcelFile 类并行读取工作表to_excel 方法与 ExcelWriter 类局限性结论
15. 使用 Reader 和 Writer 包操作 Excel 文件
读取和写入 Excel 文件基于Calamine的Excel读取器python-calaminefastexcelXlsxWriterXlsxWriter 入门使用 XlsxWriter 写入大文件OpenPyXL使用 OpenPyXL 读取文件使用 OpenPyXL 读取大文件使用 OpenPyXL 写入使用 OpenPyXL 写入大文件使用 OpenPyXL 进行编辑更多读写包Excelizepyxlsbxlrd、xlwt 和 xlutils将格式化 DataFrame 写入 Excel设置 DataFrame 的索引和标题设置 DataFrame 数据部分的格式报表案例研究（第三版）结论
A. uv 包管理器
创建新项目添加和移除包升级包和 uv在项目目录外操作其他包管理器
B. Excel 中的 Copilot
Copilot 系列提示词工程Excel 中 Copilot 的入门指南要求开始对话
C. Python 高级概念
类与对象使用支持时区的 datetime 对象Python 中可变对象与不可变对象调用以可变对象为参数的函数将可变对象作为默认参数的函数
索引
关于作者

Content preview from Python for Excel (Chinese Edition), 2nd Edition

第 5 章. 使用 pandas进行数据分析

本作品已使用人工智能进行翻译。欢迎您提供反馈和意见：translation-feedback@oreilly.com

本章将向您介绍 pandas——这个Python 数据分析库，或者用我喜欢的说法，就是拥有超能力的基于 Python 的电子表格。pandas 库让那些在 Excel 中特别令人头疼的任务变得更简单、更快，且更少出错。其中一些任务包括从外部来源获取数据集，以及处理统计数据、时间序列和交互式图表。如果您熟悉 Excel 中的 Power Query，pandas 涵盖了类似的功能，但灵活性更高。pandas 最重要的“超能力”是向量化与数据对齐。正如我们在上一章关于 NumPy 数组的讨论中所见，向量化让您能够编写简洁的基于数组的代码，而数据对齐则确保在处理多个数据集时不会出现数据不匹配的情况。

本章将完整呈现的数据分析之旅：首先介绍数据清洗与预处理，随后展示如何通过聚合、描述性统计和可视化来解析大型数据集。章末，我们将了解如何使用pandas导入和导出数据。但先从基础开始——让我们先来了解pandas的主要数据结构：DataFrame和Series！

DataFrame 和 Series

DataFrame（）和 Series 是 pandas 中的核心数据结构。在本节中，我将介绍 DataFrame 的主要组成部分：索引、列和数据。DataFrame类似于二维 NumPy 数组，但它带有列和行标签，且每列可以存储不同的数据类型。从 DataFrame 中提取单列或单行，即可得到一维的 Series。同样，Series类似于带有标签的一维 NumPy 数组。观察图 5-1 中的 DataFrame 结构，不难发现 DataFrame 将成为您基于 Python 的电子表格。

Diagram comparing a pandas Series and DataFrame, highlighting their structure with indices, columns, and data orientation along axis 0 and axis 1.

为了向您展示从电子表格转换为 DataFrame 是多么简单，请看图 5-2 中的 Excel 表格，其中列出了某在线课程学生的成绩。您可以在配套代码库的xl文件夹中找到对应的students.xlsx文件。

Excel table showing user IDs, names, birth years, countries, scores, and continents for online course students.

要将此 Excel 表格引入 Python，首先导入 pandas，然后使用其read_excel函数，该函数会返回一个 DataFrame：

In [1]: import pandas as pd

In [2]: pd.read_excel("xl/students.xlsx", engine="calamine")

Out[2]: user_id name born country score continent 0 1001 Mark 1966 Italy 4.5 Europe 1 1000 John 1988 USA 6.7 America 2 1002 Tim 1980 USA 3.9 America 3 1003 Jenny 2009 Germany 9.0 Europe ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 0642572396008

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Python for Excel (Chinese Edition), 2nd Edition

by Felix Zumstein

第 5 章. 使用 pandas进行数据分析

DataFrame 和 Series

图 5-1. pandas的 Series 和 DataFrame

图 5-2. students .xlsx

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

学习勒索软件响应和恢复 (Chinese Edition)

Kubernetes 认证管理员 (CKA) 学习指南 (Chinese Edition), 2nd Edition

Prometheus：快速入门，第二版

实用 Python 数据整理与数据质量

Publisher Resources

第 5 章. 使用 pandas进行数据 分析

DataFrame 和 Series

图 5-1. pandas的 Series 和 DataFrame

图 5-2. students .xlsx

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

学习勒索软件响应和恢复 (Chinese Edition)

Kubernetes 认证管理员 (CKA) 学习指南 (Chinese Edition), 2nd Edition

Prometheus：快速入门，第二版

实用 Python 数据整理与数据质量

Publisher Resources

第 5 章. 使用 pandas进行数据分析

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.