O'Reilly logo

Learning Real-time Processing with Spark Streaming by Sumit Gupta

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Data loading from distributed and varied sources

In large enterprises, log file analysis is one of the popular use cases. Architects/business analysts and all other stake holders always want to analyze the logs of various activities like events, security, access, and so on and uncover the hidden patterns. For example, the web logs from a popular user interfacing application (a website or portal) can easily provide you with the following information:

  • Most popular pages: Frequently visited pages
  • Type of browsers or user agent used by consumers to visit the website
  • Origin of the user: Users' referrer
  • Final status of the user request (HTTP status codes): Successful (200), broken links (404), redirection (301), and many more

Consider another example where ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required