book

Multilingual Natural Language Processing Applications: From Theory to Practice

Name: Multilingual Natural Language Processing Applications: From Theory to Practice
ISBN: 9780137047833

by Daniel Bikel, Imed Zitouni

May 2012

Beginner to intermediate

640 pages

21h 52m

English

IBM Press

Read now

Unlock full access

Title Page
Copyright Page
Register Your Book
Contact us
Dedication
Contents
Preface
Acknowledgments
About the Authors
Part I. In Theory
Chapter 1. Finding the Structure of Words
1.1. Words and Their Components1.2. Issues and Challenges1.3. Morphological Models1.4. SummaryAcknowledgmentBibliography

Chapter 2. Finding the Structure of Documents
2.1. Introduction2.2. Methods2.3. Complexity of the Approaches2.4. Performances of the Approaches2.5. Features2.6. Processing Stages2.7. Discussion2.8. SummaryBibliography
Chapter 3. Syntax
3.1. Parsing Natural Language3.2. Treebanks: A Data-Driven Approach to Syntax3.3. Representation of Syntactic Structure3.4. Parsing Algorithms3.5. Models for Ambiguity Resolution in Parsing3.6. Multilingual Issues: What Is a Token?3.7. SummaryAcknowledgmentsBibliography
Chapter 4. Semantic Parsing
4.1. Introduction4.2. Semantic Interpretation4.3. System Paradigms4.4. Word Sense4.5. Predicate-Argument Structure4.6. Meaning Representation4.7. SummaryBibliography
Chapter 5. Language Modeling
5.1. Introduction5.2. n-Gram Models5.3. Language Model Evaluation5.4. Parameter Estimation5.5. Language Model Adaptation5.6. Types of Language Models5.7. Language-Specific Modeling Problems5.8. Multilingual and Crosslingual Language Modeling5.9. SummaryBibliography
Chapter 6. Recognizing Textual Entailment
6.1. Introduction6.2. The Recognizing Textual Entailment Task6.3. A Framework for Recognizing Textual Entailment6.4. Case Studies6.5. Taking RTE Further6.6. Useful Resources6.7. SummaryBibliography
Chapter 7. Multilingual Sentiment and Subjectivity Analysis
7.1. Introduction7.2. Definitions7.3. Sentiment and Subjectivity Analysis on English7.4. Word- and Phrase-Level Annotations7.5. Sentence-Level Annotations7.6. Document-Level Annotations7.7. What Works, What Doesn’t7.8. SummaryAcknowledgmentsBibliography
Part II. In Practice
Chapter 8. Entity Detection and Tracking
8.1. Introduction8.2. Mention Detection8.3. Coreference Resolution8.4. SummaryBibliography
Chapter 9. Relations and Events
9.1. Introduction9.2. Relations and Events9.3. Types of Relations9.4. Relation Extraction as Classification9.5. Other Approaches to Relation Extraction9.6. Events9.7. Event Extraction Approaches9.8. Moving Beyond the Sentence9.9. Event Matching9.10. Future Directions for Event Extraction9.11. SummaryBibliography
Chapter 10. Machine Translation
10.1. Machine Translation Today10.2. Machine Translation Evaluation10.3. Word Alignment10.4. Phrase-Based Models10.5. Tree-Based Models10.6. Linguistic Challenges10.7. Tools and Data Resources10.8. Future Directions10.9. SummaryBibliography
Chapter 11. Multilingual Information Retrieval
11.1. Introduction11.2. Document Preprocessing11.3. Monolingual Information Retrieval11.4. CLIR11.5. MLIR11.6. Evaluation in Information Retrieval11.7. Tools, Software, and Resources11.8. SummaryAcknowledgmentsBibliography
Chapter 12. Multilingual Automatic Summarization
12.1. Introduction12.2. Approaches to Summarization12.3. Evaluation12.4. How to Build a Summarizer12.5. Competitions and Data Sets12.6. SummaryBibliography
Chapter 13. Question Answering
13.1. Introduction and History13.2. Architectures13.3. Source Acquisition and Preprocessing13.4. Question Analysis13.5. Search and Candidate Extraction13.6. Answer Scoring13.7. Crosslingual Question Answering13.8. A Case Study13.9. Evaluation13.10. Current and Future Challenges13.11. Summary and Further ReadingAcknowledgmentsBibliography
Chapter 14. Distillation
14.1. Introduction14.2. An Example14.3. Relevance and Redundancy14.4. The Rosetta Consortium Distillation System14.5. Other Distillation Approaches14.6. Evaluation and Metrics14.7. SummaryBibliography
Chapter 15. Spoken Dialog Systems
15.1. Introduction15.2. Spoken Dialog Systems15.3. Forms of Dialog15.4. Natural Language Call Routing15.5. Three Generations of Dialog Applications15.6. Continuous Improvement Cycle15.7. Transcription and Annotation of Utterances15.8. Localization of Spoken Dialog Systems15.9. SummaryBibliography
Chapter 16. Combining Natural Language Processing Engines
16.1. Introduction16.2. Desired Attributes of Architectures for Aggregating Speech and NLP Engines16.3. Architectures for Aggregation16.4. Case Studies16.5. Lessons Learned16.6. Summary16.7. Sample UIMA CodeBibliography
Index

Content preview from Multilingual Natural Language Processing Applications: From Theory to Practice

Chapter 8. Entity Detection and Tracking

Xiaoqiang Luo and Imed Zitouni

8.1. Introduction

Information extraction (IE) is the task of identifying and extracting useful textual information from natural language documents. While “usefulness” is user- and application-dependent, we often care about “who did what to whom at when and/or for what reason (why)” from the input document. Clearly, the scope of information extraction can be arbitrarily broad, and sometimes it may even require world knowledge. To make problems tractable, we focus on two subtasks in this chapter:

1. detecting mentions from a document and identifying mentions’ attributes: a mention is a text chunk identifying a physical object (e.g., a person or an organization);

2. grouping ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Transfer Learning for Natural Language Processing

Publisher Resources

ISBN: 9780137047833Purchase book

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Multilingual Natural Language Processing Applications: From Theory to Practice

by Daniel Bikel, Imed Zitouni