Skip to Content
大规模数据分析和建模:基于 Spark 与 R
book

大规模数据分析和建模:基于 Spark 与 R

by Javier Luraschi, Kevin Kuo, Edgar Ruiz
July 2020
Intermediate to advanced
262 pages
5h 34m
Chinese
China Machine Press
Content preview from 大规模数据分析和建模:基于 Spark 与 R
162
9
使用配置文件的另一个好处是,系统管理员可以通过更改 R_CONFIG_ACTIVE 环境变
量的值来更改默认配置。若想获取更多信息,请参阅 GitHub rstudio/config 仓库。
9.8
小结
本章介绍了 Spark 内部机制和具体设置的诸多内容,以帮助你加速计算,并启用高计算
负载。它可以为理解常见配置的瓶颈和指导提供基础。然而,调试 Spark 是一个广泛的
话题,需要更多的章节来详尽介绍。因此,在对 Spark 的性能和可扩展性进行故障排除、
搜索网络资源和咨询在线社区时,通常还需要对你的特定环境进行调试。
10 章会介绍 R 中提供的 Spark 扩展的生态系统。大多数扩展都是高度专业化的,但
是它们对于特定情况以及有特殊需求的读者来说都是非常有用的。例如,它们可以处理
嵌套数据、执行图分析、使用不同的建模库(如 H20 中的 rsparkling)。此外,接下
来的几章会介绍许多高级数据分析和建模话题。这些话题是掌握 R 语言大规模计算所必
需的。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

机器学习实战:基于Scikit-Learn、Keras 和TensorFlow (原书第2 版)

机器学习实战:基于Scikit-Learn、Keras 和TensorFlow (原书第2 版)

Aurélien Géron
数字化转型:企业破局的34 个锦囊

数字化转型:企业破局的34 个锦囊

Gary O’Brien, Xiao Guo, Mike Mason

Publisher Resources

ISBN: 9787111661016