Skip to Content
Spark:权威指南
book

Spark:权威指南

by Bill Chambers, Matei Zaharia
May 2025
Intermediate to advanced
606 pages
7h 38m
Chinese
O'Reilly Media, Inc.

Overview

本作品已使用人工智能进行翻译。欢迎您提供反馈和意见:translation-feedback@oreilly.com

通过这本由开源集群计算框架的创建者撰写的综合指南,了解如何使用、部署和维护 Apache Spark。作者 Bill Chambers 和 Matei Zaharia 重点介绍了 Spark 2.0 的改进和新功能,并将 Spark 主题分为多个部分,每个部分都有独特的主题。

您将探索 Spark 结构化 API 的基本操作和常见功能,以及用于构建端到端流应用程序的新高级 API——结构化流。开发人员和系统管理员将学习监控、调整和调试 Spark 的基础知识,并探索机器学习技术和使用 Spark 可扩展机器学习库 MLlib 的场景。

  • 轻松了解大数据和 Spark
  • 通过实例学习 DataFrames、SQL 和 Datasets(Spark 的核心 API)
  • 深入探讨 Spark 的低级 API、RDD 以及 SQL 和 DataFrames 的执行
  • 了解 Spark 在集群上的运行方式
  • 调试、监控和调整 Spark 集群和应用程序
  • 掌握结构化流处理(Structured Streaming)——Spark的流处理引擎
  • 学习如何将 MLlib 应用于各种问题,包括分类或推荐
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

设计数据密集型应用程序

设计数据密集型应用程序

Martin Kleppmann
Kafka权威指南(第2版)

Kafka权威指南(第2版)

Gwen Shapira, Todd Palino, Rajini Sivaram, Krit Petty
低代码AI

低代码AI

Gwendolyn Stripling, Michael Abel

Publisher Resources

ISBN: 9798341656932