Samza is an open source distributed stream processing framework originally developed at LinkedIn. It has the following features:
Some concepts in Samza are described in the following sections.
Samza processes streams of data—for example, website clickstreams, server logs, or any other event data. Messages can be added and read from a data stream. Multiple frameworks can access the same data stream and can partition the data based on the keys present in the message.
A Samza job is the computation logic that ...