Summary

In this chapter we covered in depth the various sources which you can use to ingest log data into Flume, including the following:

  • The exec source
  • Syslog sources (UDP, TCP, and multiport TCP)

We discussed replicating the old TailSource functionality in Flume 0.9 and the problems using tail semantics in general.

We also covered channel selectors and how to send events to one of more channels. Specifically:

  • The replicating channel selector
  • The multiplexing channel selector

Optional channels were also discussed as a way to only fail a Channel put transaction for only some of the Channels when more than one is used.

In the next chapter, we'll introduce Interceptors that will allow in-flight inspection and transformation of Events. Used in conjunction ...

Get Apache Flume: Distributed Log Collection for Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.