- Can you also tweak other hyperparameters, such as the max_df and min_df parameters in CountVectorizer? What are their optimal values?
- Practice makes perfect—another great project to deepen your understanding could be sentiment (positive/negative) classification for movie review data, which can be downloaded directly at http://www.cs.cornell.edu/people/pabo/movie-review-data/review_polarity.tar.gz, or from the page at http://www.cs.cornell.edu/people/pabo/movie-review-data/.
Exercise
Get Python Machine Learning By Example - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.