Chapter 7 Content Management

Introduction

Content Categorization

Types of Taxonomy

Statistical Categorizer

Rule-Based Categorizer

Comparison of Statistical versus Rule-Based Categorizers

Determining Category Membership

Concept Extraction

Contextual Extraction

CLASSIFIER Definition

SEQUENCE and PREDICATE_RULE Definitions

Automatic Generation of Categorization Rules Using SAS Text Miner

Differences between Text Clustering and Content Categorization

Summary

Appendix

References

Introduction

In Chapter 2, we discussed how to extract content from a variety of data sources such as websites, blogs, feeds, local files, etc. In this chapter, we focus on how to organize and manage the data that we collect based on its content. Why is content management ...

Get Text Mining and Analysis now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.