Table of Contents
Preface
Section 1: Apache Beam: Essentials
Chapter 1: Introduction to Data Processing with Apache Beam
Technical requirements
Why Apache Beam?
Writing your first pipeline
Running our pipeline against streaming data
Exploring the key properties of unbounded data
Measuring event time progress inside data streams
States and triggers
Timers
Assigning data to windows
Defining the life cycle of a state in terms of windows
Pane accumulation
Unifying batch and streaming data processing
Summary
Chapter 2: Implementing, Testing, and Deploying Basic Pipelines
Technical requirements
Setting up the environment for this book
Installing Apache Kafka
Making our code accessible from minikube
Installing Apache Flink
Reinstalling the complete ...
Get Building Big Data Pipelines with Apache Beam now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.