Skip to Content
For Enterprise
For Government
For Higher Ed
For Individuals
For Marketing
For Enterprise
For Government
For Higher Ed
For Individuals
For Marketing
Explore Skills
Cloud Computing
Microsoft Azure
Amazon Web Services (AWS)
Google Cloud
Cloud Migration
Cloud Deployment
Cloud Platforms
Data Engineering
Data Warehouse
SQL
Apache Spark
Microsoft SQL Server
MySQL
Kafka
Data Lake
Streaming & Messaging
NoSQL Databases
Relational Databases
Data Science
Pandas
R
MATLAB
SAS
D3
Power BI
Tableau
Statistics
Exploratory Data Analysis
Data Visualization
AI & ML
Generative AI
Machine Learning
Artificial Intelligence (AI)
Deep Learning
Reinforcement Learning
Natural Language Processing
TensorFlow
Scikit-Learn
Hyperparameter Tuning
MLOps
Programming Languages
Java
JavaScript
Spring
Python
Go
C#
C++
C
Swift
Rust
Functional Programming
Software Architecture
Object-Oriented
Distributed Systems
Domain-Driven Design
Architectural Patterns
IT/Ops
Kubernetes
Docker
GitHub
Terraform
Continuous Delivery
Continuous Integration
Database Administration
Computer Networking
Operating Systems
IT Certifications
Security
Network Security
Application Security
Incident Response
Zero Trust Model
Disaster Recovery
Penetration Testing / Ethical Hacking
Governance
Malware
Security Architecture
Security Engineering
Security Certifications
Design
Web Design
Graphic Design
Interaction Design
Film & Video
User Experience (UX)
Design Process
Design Tools
Business
Agile
Project Management
Product Management
Marketing
Human Resources
Finance
Team Management
Business Strategy
Digital Transformation
Organizational Leadership
Soft Skills
Professional Communication
Emotional Intelligence
Presentation Skills
Innovation
Critical Thinking
Public Speaking
Collaboration
Personal Productivity
Confidence / Motivation
Features
All features
Verifiable skills
AI Academy
Courses
Certifications
Interactive learning
Live events
Superstreams
Answers
Insights reporting
Radar Blog
Buy Courses
Plans
Sign In
Try Now
O'Reilly Platform
book
Python语言及其应用(第2版)
by
Bill Lubanovic
March 2022
Intermediate to advanced
522 pages
13h 52m
Chinese
Posts & Telecom Press
Content preview from
Python语言及其应用(第2版)
187
第
12
章
数据处理
只要你把数据折磨得够狠,大自然都会招供。
——
Ronald Coase
迄今为止,本书主要讨论的是
Python
语言本身
,即它的数据类型、代码结构、语法,等
等。本书余下部分主要关注
Python
如何应用于现实问题。
本章将介绍很多数据处理的实践技术。有时候,数据处理也被称作
数据整理
(
data
munging
)
,或者是数据库世界中更商业化的
ETL
(
extract/transform/load
,提取
/
转换
/
加
载)。尽管编程类图书通常并不会专门介绍该主题,但程序员往往要花大量时间将数据整
理成符合要求的形式。
数据科学
专业在过去几年中已经变得颇为流行。《哈佛商业评论》(
Harvard Business
Review
)的一篇文章称数据科学家是“
21
世纪最性感的工作”
。如果这意味着需求旺盛、
报酬丰厚,那还算好,但苦差事同样也少不了。数据库的
ETL
无法满足
数据科学的需求,
其中往往要用到
机器学习
来挖掘出人眼无法看到的深刻见解。
本章会从基本的数据格式开始,一路讲解到对数据科学最有用的新工具。
数据格式粗略地分为两类:
文本
和
二进制
。
Python
字符串
用于文本数据,本章包括了前面
略过的字符串相关内容:
•
Unicode
字符;
•
正则表达式模式匹配。
然后,本章会介绍二进制数据以及另外两种
Python
内建数据类型:
•
bytes
类型,用于不可变的
8
位值;
•
bytearray
类型,用于可变的
8
位值。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial
You might also like
Python编程入门与实战
Posts & Telecom Press, Fabrizio Romano
Python实用技能学习指南
Posts & Telecom Press, Robert Smallshire, Austin Bingham
Python技术基础视频教程
保罗·J·戴特尔
Python面向对象编程指南
Posts & Telecom Press, Steven F. Lott
Publisher Resources
ISBN: 9787115586223