Chapter 6

Extracting Information from Social Media with GATE

K. Bontcheva; L. Derczynski     University of Sheffield, Sheffield, UK

Abstract

Information extraction from social media content has only recently become an active research topic, following early experiments that showed this genre to be extremely challenging for state-of-the-art algorithms. Unlike carefully authored news text and other longer content, social media content poses a number of new challenges, due to shortness, noise, strong contextual anchoring, and highly dynamic nature.

This chapter provides a thorough analysis of the problems and describes the most recent GATE algorithms, specifically developed for extracting information from social media content. Comparisons against ...

Get Working with Text now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.