Skip to Content
Big Data
book

Big Data

by James Warren, Nathan Marz
April 2015
Beginner to intermediate
328 pages
11h 1m
English
Manning Publications
Content preview from Big Data

Chapter 7. Batch layer: Illustration

This chapter covers

  • Sources of complexity in data-processing code
  • JCascalog as a practical implementation of pipe diagrams
  • Applying abstraction and composition techniques to data processing

In the last chapter you saw how pipe diagrams are a natural and concise way to specify computations that operate over large amounts of data. You saw that pipe diagrams can be executed as a series of MapReduce jobs for parallelism and scalability.

In this illustration chapter, we’ll look at a tool that’s a fairly direct mapping of pipe diagrams: JCascalog. There’s a lot to cover in JCascalog, so this chapter is a lot more involved than the previous illustration chapters. Like always, you can still learn the full ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Big Data For Dummies

Big Data For Dummies

Judith Hurwitz, Alan Nugent, Dr. Fern Halper, Marcia Kaufman

Publisher Resources

ISBN: 9781617290343Publisher SupportOtherPublisher WebsiteSupplemental ContentPurchase Link