With the basics defined, let's create a simple pipeline that accepts user posts from an imaginary social media platform, extracts word frequencies, and determines which words are trending for any given time. The following example code is available in this book's source repository under chapter_15/example_01. Getting started with Cloud Dataflow pipelines requires importing either the Cloud Dataflow or Apache Beam SDK into your Java project. For Maven projects, this is done by adding the core SDK to the Maven POM file:
<dependency> <groupId>com.google.cloud.dataflow</groupId> <artifactId>google-cloud-dataflow-java-sdk-all</artifactId> <version>2.5.0</version></dependency>
You'll also likely want to integrate ...