Skip to Content
Spark快速大数据分析(第2版)
book

Spark快速大数据分析(第2版)

by Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee
November 2021
Intermediate to advanced
340 pages
10h 46m
Chinese
Posts & Telecom Press
Content preview from Spark快速大数据分析(第2版)
译者序
Spark
开创至今,已经走过了近
12
年。
12
年间,时代的脚步不断前进,我们看到互联网不
断发展,各种初创公司崭露头角,在公司日常业务中需要处理的数据量也飞速增长。数据
中心也从云下逐渐迁往云上,从单一云走向多云,批处理和流计算逐渐融合,数据仓库逐
渐走向湖仓一体,集群资源调度也越来越轻量化。现在,
Spark
将发布
3.2
版本。从问世
至今,
Spark
不断增强
,在大数据蓬勃发展的浪潮中占据越来越重要的位置。
Spark 3.0
的发
布标志着
Spark
进入了一个
全新的时代,本书的第
2
版正是根据
Spark 3.0
编写的,兼顾旧
版本保留的基本原理与大数据发展的新趋势,相信新老用户都可以从本书中得到新的收获。
8
年前
,刚刚从大学毕业的我,有幸误打误撞地走进了大数据这个领域,第一次接触到各
种各样的大数据软件。业界的这些大数据软件基本上是开源的,在大数据这个领域似乎商
业软件完全无法望其项背。
2014
,我开始参与
Spark
社区的开发,当时
1.0
版本尚未发
布,我对大数据也没有特别深入的认知,当时纯粹以自己掌握的数据库和编译原理的皮毛
知识参与其中。在这个过程中,我看到了很多牛人的代码,也结识了很多社区大佬,渐渐
Spark
有了一些了解
。后来,我加入阿里云。在大量的客户支持工作中,我才逐渐对整
个大数据生态有了一定的了解,也见证了
Spark
被越来越多的客户使用
,替换原有的技术
栈。毫无疑问,
Spark
在开源软件中是比较成功的
,活跃的开源社区为
Spark
贡献了非常
多的重要功能和改进,日益好用的
Spark
也正是开源社区给所有人的回馈 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据驱动力:企业数据分析实战

数据驱动力:企业数据分析实战

Carl Anderson
数据压缩入门

数据压缩入门

Colt McAnlis, Aleks Haecky
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115576019