Naïve Bayes and text mining

The multinomial Naïve Bayes classifier is particularly suited for text mining. Naïve Bayes is used to classify the following entities:

  • E-mails as legitimate versus spam
  • Business news stories
  • Movie reviews and scoring
  • Technical papers as per field of expertise

This third use case consists of predicting the direction of a stock, Tesla Motors Inc, (ticker symbol: TSLA) give the financial news. The features are the frequency of occurrence of some specific terms related to the stock. It is unclear how fast the investor or trader reacts to the news and influence, if any, of the value of a stock. Therefore, the delayed response time, as depicted in the following chart, should be a feature of the proposed model:

The feature market ...

Get Scala: Guide for Data Science Professionals now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.