Skip to Main Content
Working with Text
book

Working with Text

by Emma Tonkin, Gregory J.L Tourte
July 2016
Intermediate to advanced content levelIntermediate to advanced
344 pages
10h 11m
English
Chandos Publishing
Content preview from Working with Text
Chapter 7

Newton

Building an Authority-Driven Company Tagging and Resolution System

M. Thomas*; H. Bretz*; T. Vacek*; B. Hachey; S. Singh*; F. Schilder*    * Thomson Reuters, NYC, NY, USA University of Sydney, Sydney, Australia

Abstract

We describe an entity detection and resolution system called Newton that is being used to identify company names in Reuters news articles and ground the mention text to a company authority database. The system is required to be fast and precise on arbitrary web news sources. We introduce an infrastructure for authority-driven lookup-tagging followed by joint mention and disambiguation classification using a support vector machine. Performance on a corpus of 70k automatically annotated documents from the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Communicate with Teams More Effectively

Communicate with Teams More Effectively

Charles Humble

Publisher Resources

ISBN: 9781780634302