
358 High Performance Parallel I/O
FIGURE 32.4: Using on-memory deduplication provides extra performance
without any administration cost (i.e., installing a new file system).
becoming more important. Deduplication is a technique that has been mainly
used to reduce the size of data in various cases, such as in file systems
(ZFS [11]), virtual machines (KVM, XEN [5], VMware [12]), and special
tagged memory zones (KSM [2]). However, using deduplication to improve
I/O caching by effectively increasing the size of the I/O cache incurs signifi-
cant CPU overhead due to the cost of deduplication techniques. On the other
hand, with the increasing number of cores, ...