Creating and executing pipelines

With the basics defined, let's create a simple pipeline that accepts user posts from an imaginary social media platform, extracts word frequencies, and determines which words are trending for any given time. The following example code is available in this book's source repository under chapter_15/example_01. Getting started with Cloud Dataflow pipelines requires importing either the Cloud Dataflow or Apache Beam SDK into your Java project. For Maven projects, this is done by adding the core SDK to the Maven POM file:

<dependency>    <groupId>com.google.cloud.dataflow</groupId>    <artifactId>google-cloud-dataflow-java-sdk-all</artifactId>    <version>2.5.0</version></dependency>

You'll also likely want to integrate ...

Get Building Google Cloud Platform Solutions now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.