Classifying Documents with Respect to “Earnings” and Then Making a Predictive Model for the Target Variable Using Decision Trees, MARSplines, Naïve Bayes Classifier, and K-Nearest Neighbors with STATISTICA Text Miner
Contents
Introduction: Automatic Text Classification
Data File with File References
Saving the Extracted Word Frequencies to the Input File
Introduction: Automatic Text Classification
This example is based on the “classic” Reuters collection of documents. Specifically, 5,000 documents were selected from the Reuters-21578 database, which is a collection ...
Get Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.