October 2016
Intermediate to advanced
618 pages
10h 44m
English
This recipe will take a look at how to parse and write large amount of data. Since XML is probably the most verbose of data formats, we will look briefly at how to generate large amount of junk data and then how to process those in a memory efficient way.
The data.xml library is our library of choice here, so let's add it to the project.clj if it is not already:
[org.clojure/data.xml "0.1.0-beta1"]
We will also make use of the core Java streaming and zipping functions that are available in the JVM. We know we can access host functions just by importing them in the current namespace, so nothing more to prepare here.
So that's it. Let's go for a large dump of data onto our file ...
Read now
Unlock full access