July 2017
Beginner to intermediate
378 pages
10h 26m
English
Avro is an Apache open source data serialization system project. The system processes data into a resulting container file that has some useful properties. It uses binary data format to keep the file size small and compact. This also results in faster read times.

The structure of the data is stored in the file container. Rich data structures are supported. Avro file structures, or schemas, are defined using JSON. This is a schema-on-write process as far as file creation is concerned. This has the benefit of client applications not having to generate code to define the data structure. This makes using Avro files for analytics ...