Chapter 1. Mining Twitter: Exploring Trending Topics, Discovering What People Are Talking About, and More
Since this is the first chapter, we’ll take our time acclimating to our journey in social web mining. However, given that Twitter data is so accessible and open to public scrutiny, Chapter 9 further elaborates on the broad number of data mining possibilities by providing a terse collection of recipes in a convenient problem/solution format that can be easily manipulated and readily applied to a wide range of problems. You’ll also be able to apply concepts from future chapters to Twitter data.
Tip
Always get the latest bug-fixed source code for this chapter (and every other chapter) on GitHub. Be sure to also take advantage of this book’s virtual machine experience, as described in Appendix A, to maximize your enjoyment of the sample code.
Overview
In this chapter, we’ll ease into the process of getting situated with a minimal (but effective) development environment with Python, survey Twitter’s API, and distill some analytical insights from tweets using frequency analysis. Topics that you’ll learn about in this chapter include:
-
Twitter’s developer platform and how to make API requests
-
Tweet metadata and how to use it
-
Extracting entities such as user mentions, hashtags, and URLs from tweets
-
Techniques for performing frequency analysis with Python
-
Plotting histograms of Twitter data with the Jupyter Notebook
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access
Why Is Twitter All the Rage?
Most chapters won’t open ...