434 Large Scale and Big Data
37. N. Tolia, M. Kozuch, M. Satyanarayanan, B. Karp, T.C. Bressoud, and A. Perrig.
Opportunistic use of content addressable storage for distributed le systems. In USENIX,
pages 127–140, 2003.
38. J.S. Vitter. Random sampling with a reservoir. ACM Trans. Math. Software, 11(1):37–
57, 1985.
39. M. Weis and F. Naumann. Dogmatrix tracks down duplicates in xml. In Proc. ACM
SIGMOD, pages 431–442, 2005.