Book description
Learn to process massive real-time data streams using Storm and Python - no Java required!
About This Book
- Learn to use Apache Storm and the Python Petrel library to build distributed applications that process large streams of data
- Explore sample applications in real-time and analyze them in the popular NoSQL databases MongoDB and Redis
- Discover how to apply software development best practices to improve performance, productivity, and quality in your Storm projects
Who This Book Is For
This book is intended for Python developers who want to benefit from Storm's real-time data processing capabilities. If you are new to Python, you'll benefit from the attention to key supporting tools and techniques such as automated testing, virtual environments, and logging. If you're an experienced Python developer, you'll appreciate the thorough and detailed examples
What You Will Learn
- Install Storm and learn about the prerequisites
- Get to know the components of a Storm topology and how to control the flow of data between them
- Ingest Twitter data directly into Storm
- Use Storm with MongoDB and Redis
- Build topologies and run them in Storm
- Use an interactive graphical debugger to debug your topology as it's running in Storm
- Test your topology components outside of Storm
- Configure your topology using YAML
In Detail
Big data is a trending concept that everyone wants to learn about. With its ability to process all kinds of data in real time, Storm is an important addition to your big data ?bag of tricks.?
At the same time, Python is one of the fastest-growing programming languages today. It has become a top choice for both data science and everyday application development. Together, Storm and Python enable you to build and deploy real-time big data applications quickly and easily.
You will begin with some basic command tutorials to set up storm and learn about its configurations in detail. You will then go through the requirement scenarios to create a Storm cluster. Next, you'll be provided with an overview of Petrel, followed by an example of Twitter topology and persistence using Redis and MongoDB. Finally, you will build a production-quality Storm topology using development best practices.
Style and approach
This book takes an easy-to-follow and a practical approach to help you understand all the concepts related to Storm and Python.
Table of contents
-
Building Python Real-Time Applications with Storm
- Table of Contents
- Building Python Real-Time Applications with Storm
- Credits
- About the Authors
- About the Reviewers
- www.PacktPub.com
- Preface
- 1. Getting Acquainted with Storm
- 2. The Storm Anatomy
- 3. Introducing Petrel
- 4. Example Topology – Twitter
- 5. Persistence Using Redis and MongoDB
- 6. Petrel in Practice
- A. Managing Storm Using Supervisord
- Index
Product information
- Title: Building Python Real-Time Applications with Storm
- Author(s):
- Release date: December 2015
- Publisher(s): Packt Publishing
- ISBN: 9781784392857
You might also like
book
Data Science with Raspberry Pi: Real-Time Applications Using a Localized Cloud
Implement real-time data processing applications on the Raspberry Pi. This book uniquely helps you work with …
article
Run Llama-2 Models Locally with llama.cpp
Llama is Meta’s answer to the growing demand for LLMs. Unlike its well-known technological relative, ChatGPT, …
book
Real-Time IoT Imaging with Deep Neural Networks: Using Java on the Raspberry Pi 4
This book shows you how to build real-time image processing systems all the way through to …
article
Use Github Copilot for Prompt Engineering
Using GitHub Copilot can feel like magic. The tool automatically fills out entire blocks of code--but …