Creating our fraud detection model

HDFS is a system designed for storing massive volumes of data. In our case, we start with the 3-year banking transaction history of a fictitious customer of a bank. Our dataset includes 2,191 transactions that have resulted in the transfer of money from the customer's account to other accounts. These transactions happened using a variety of methods, such as payments at a POS terminal, direct debits, transfers from internet banking, and so on. The result of these transactions is that money leaves the account of the customer and gets credited to another account. All the times, the customer's bank wants to ensure that the money only leaves the account of the customer when the customer has authorized it. Otherwise, ...

Get Hadoop Blueprints now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.