July 2017
Beginner to intermediate
378 pages
10h 26m
English
You will see SerDe mentioned often in Hadoop-related documentation. SerDe is short for Serialization/Deserialization. The serialization and deserialization refers to how files are transformed from the saved state into a standardized readable format. Serialization is part of creation of the file when writing and deserialization happens when a file is read. This allows files to be compressed and structured. Metadata about the file contents and data structure can also be saved as part of the file.
It is a way of abstracting away the details of decoding a file format from the client applications. It also allows multiple different formats to work seamlessly in the same environment. Some of these formats can ...