
Text Classification Using Python 219
FIGURE 9.12: SVM scikit-learn model most informative features.
9.4 Conclusions
This chapter was a whistlestop tour through binary text classification using Python,
NLTK, and scikit-learn. Although testing was not exhaustive, it was found that the scikit-
learn Linear SVC classifier provided results that were slightly better than NLTK and scikit-
learn Na¨ıve Bayes.
Only the surface of these tools has been scratched and hopefully this chapter encourages
you to explore each tool in detail.
Bibliography
[1] The official home of the python programming language.
[2] The natural language toolkit.
[3] The official home of numpy: