-rw-r--r-- 1 jules wheel 526 May 17 11:54 part-00000-ffdf70f4-<...>-c000.avro
4. Avro
数据源选项
表
4-5
列出了
DataFrameReader
和
DataFrameWriter
的常用选项。完整的选项列表请参见
文档。
表4-5:
DataFrameReader
和
DataFrameWriter
中的Avro选项
属 性 名默 认 值含 义作用范围
avroSchema
无可选参数,用户可以使用
JSON
格式提供
Avro
的字
段定义。
Avro
记录的字段名和数据类型必须与
Avro
输入数据或
Catalyst
数据(
Spark
内部数据类型)保
持一致,否则会导致操作失败
读
/
写
recordNametopLevelRecord
所写结果的顶层记录名,这是
Avro
记录规范要求的写
recordNamespace""
所写结果的命名空间写
ignoreExtensiontrue
如果开启本选项,则会加载所有文件(不论扩展名
是否为
.avro
),否则会忽略没有
.avro
扩展名的文件
读
compressionsnappy
允许指定写文件时使用的压缩编码。目前支持设为
uncompressed
、
snappy
、
deflate
、
bzip2
和
xz
。如
果没有设置这个选项,那么会使用
spark.sql.avro.
compression
对应的值 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more.
O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.