Chapter 8. Sentiment Analyser Application for Movie Reviews

In this chapter, we describe an application to determine the sentiment of movie reviews using algorithms and methods described throughout the book. In addition, the Scrapy library will be used to collect reviews from different websites through a search engine API (Bing search engine). The text and the title of the movie review is extracted using the newspaper library or following some pre-defined extraction rules of an HTML format page. The sentiment of each review is determined using a naive Bayes classifier on the most informative words (using the X2 measure) in the same way as in Chapter 4, Web Mining Techniques. Also, the rank of each page related to each movie query is calculated ...

Get Machine Learning for the Web now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.