O'Reilly logo

Hadoop for Finance Essentials by Rajiv Tiwari

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 6. Getting Experienced

If you bought this book from Amazon, then you may remember how other books on similar topics were recommended. Ever wondered how does Amazon do this in real time? They have a recommendation system based on clustering and it recommends similar or related items for sale.

In the financial industry, there is a use case that is just the opposite of this and that is fraud detection, which is to identify outliers or anything that doesn't belong to a cluster. I will discuss it in more detail in this chapter as a project.

In this chapter, I will explain low latency or real-time analytics and cover the full data lifecycle of the fraud detection project.

  • Data collection—ingesting data using Kafka, Storm, and Spark
  • Data transformation—using ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required