If you bought this book from Amazon, then you may remember how other books on similar topics were recommended. Ever wondered how does Amazon do this in real time? They have a recommendation system based on clustering and it recommends similar or related items for sale.
In the financial industry, there is a use case that is just the opposite of this and that is fraud detection, which is to identify outliers or anything that doesn't belong to a cluster. I will discuss it in more detail in this chapter as a project.
In this chapter, I will explain low latency or real-time analytics and cover the full data lifecycle of the fraud detection project.