Get full access to Architecting Data-Intensive Applications and 60K+ other titles, with a free 10-day trial of O'Reilly.

There are also live events, courses curated by job role, and more.

Start your free trial

Creating a Data Pipeline for Consistent Data Collection, Processing, and Dissemination

In a data intensive application, data travels in two directions in two different forms. One form of the data is data that is returned to the end users as part of a request. The process of gathering the data is usually synchronous and in a distributed system, which typically comes from a variety of data sources. Imagine we are building a context service where we want to know everything about a given IP address that tries to access our secured network. The use case would be that we want to block all IP addresses that we know are potentially from known malicious users. Typically, to keep the example discussion simple, what we would do is the following:

Get Architecting Data-Intensive Applications now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Don’t leave empty-handed

Get Mark Richards’s Software Architecture Patterns ebook to better understand how to design components—and how they should interact.

It’s yours, free.

Get it now

Check it out now on O’Reilly

Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.

Start your free trial Become a member now