Skip to Content
R在数据科学中的应用,第2版
book

R在数据科学中的应用,第2版

by Hadley Wickham, Mine Cetinkaya-Rundel, Garrett Grolemund
May 2025
Intermediate to advanced
578 pages
8h 9m
Chinese
O'Reilly Media, Inc.
Content preview from R在数据科学中的应用,第2版

第一部分: 整个游戏

本书这一部分的目标是让你快速了解数据科学的主要工具:导入Tidy转换可视化数据如图 I-1 所示。我们希望向你展示数据科学的 "全局",让你对所有主要部分有足够的了解,这样你就可以处理真实的、简单的数据集。本书的后半部分将更深入地讨论这些主题中的每一个,从而扩大你可以应对的数据科学挑战的范围。

A diagram displaying the data science cycle: Import -> Tidy -> Understand  (which has the phases Transform -> Visualize -> Model in a cycle) -> Communicate. Surrounding all of these is Program Import, Tidy, Transform, and Visualize is highlighted.

图 I-1. 在本书的这一部分,你将学习如何导入、Tidy、转换和可视化数据。

其中四章重点介绍数据科学工具:

  • 可视化是开始学习 R 编程的绝佳起点,因为它的回报是显而易见的:你可以绘制优雅而翔实的图表,帮助你理解数据。在第 1 章中,你将深入学习可视化,了解 ggplot2 图形的基本结构,以及将数据转化为图形的强大技术。

  • 仅有可视化通常是不够的,因此在第 3 章中,您将学习一些关键动词,以便选择重要变量、筛选出关键观测值、创建新变量并计算摘要。

  • 第 5 章中,你将学习 Tidy 数据,这是一种一致的数据存储方式,能让转换、可视化和建模变得更容易。你将了解其基本原理以及如何将数据转化为整洁的形式。

  • 在转换和可视化数据之前,您需要先将数据导入 R。在第 7 章中,您将学习将.csv 文件导入 R 的基础知识。

在这几章中,还有四章是关于 R 工作流程的。在第 2 章第 4 章第 6 章中,你将学习到编写和组织 R 代码的良好工作流程实践。从长远来看,这将为你的成功奠定基础,因为它们将为你提供在处理实际项目时保持条理清晰的工具。最后,第 8 章将教你如何获得帮助并不断学习。

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

R深度学习权威指南

R深度学习权威指南

Posts & Telecom Press, Joshua F. Wiley
AI工程

AI工程

Chip Huyen
Raku学习手册

Raku学习手册

brian d foy

Publisher Resources

ISBN: 9798341657304