O'Reilly logo

Text Mining and Analysis by Satish Garla, Murali Pagolu, Dr. Goutam Chakraborty

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 7 Content Management

Introduction

Content Categorization

Types of Taxonomy

Statistical Categorizer

Rule-Based Categorizer

Comparison of Statistical versus Rule-Based Categorizers

Determining Category Membership

Concept Extraction

Contextual Extraction

CLASSIFIER Definition

SEQUENCE and PREDICATE_RULE Definitions

Automatic Generation of Categorization Rules Using SAS Text Miner

Differences between Text Clustering and Content Categorization

Summary

Appendix

References

Introduction

In Chapter 2, we discussed how to extract content from a variety of data sources such as websites, blogs, feeds, local files, etc. In this chapter, we focus on how to organize and manage the data that we collect based on its content. Why is content management ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required