Skip to Content
Spark快速大数据分析(第2版)
book

Spark快速大数据分析(第2版)

by Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee
November 2021
Intermediate to advanced
340 pages
10h 46m
Chinese
Posts & Telecom Press
Content preview from Spark快速大数据分析(第2版)
94
4
-rw-r--r-- 1 jules wheel 0 May 17 11:54 _SUCCESS
-rw-r--r-- 1 jules wheel 526 May 17 11:54 part-00000-ffdf70f4-<...>-c000.avro
4. Avro
数据源选项
4-5
列出了
DataFrameReader
DataFrameWriter
的常用选项。完整的选项列表请参见
文档。
4-5
DataFrameReader
DataFrameWriter
中的Avro选项
属 性 名 默 认 值 含  义 作用范围
avroSchema
可选参数,用户可以使用
JSON
格式提供
Avro
的字
段定义。
Avro
记录的字段名和数据类型必须与
Avro
输入数据或
Catalyst
数据(
Spark
内部数据类型)保
持一致,否则会导致操作失败
/
recordName topLevelRecord
所写结果的顶层记录名,这是
Avro
记录规范要求的
recordNamespace ""
所写结果的命名空间
ignoreExtension true
如果开启本选项,则会加载所有文件(不论扩展名
是否为
.avro
),否则会忽略没有
.avro
扩展名的文件
compression snappy
允许指定写文件时使用的压缩编码。目前支持设为
uncompressed
snappy
deflate
bzip2
xz
。如
果没有设置这个选项,那么会使用
spark.sql.avro.
compression
对应的值 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据驱动力:企业数据分析实战

数据驱动力:企业数据分析实战

Carl Anderson
数据压缩入门

数据压缩入门

Colt McAnlis, Aleks Haecky
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115576019