Ian HowellsBillie RinaldiArshak Navruzyan

Beating Billion Dollar Fraud Using Anomaly Detection

A Signal Processing Approach using Argyle Data on the Hortonworks Data Platform with Accumulo

Date: This event took place live on October 08 2014

Presented by: Ian Howells, Billie Rinaldi, Arshak Navruzyan

Duration: Approximately 60 minutes.

Cost: Free

Questions? Please send email to


The Association of Certified Fraud Examiners (ACFE) found that a typical organization loses 5% of revenues to fraud each year. The more connected the industry the more potential there is for fraud. The mobile industry alone loses over $46 billion annually to fraud. When it comes the detecting fraud in the financial services industry 16% take up to 8 hours to detect it and 22% do not know the average time for detection.

Traditional approaches, designed to combat fraud, often bring together a patchwork quilt of database, ETL (Extract, Transform and Load) and BI technologies working in a batch mode. The problem is this delivers insights that are hours or days behind. Unfortunately fraudsters work in real time and often complete their job in under a day.

This presentation will review the approach Argyle Data has taken to develop a real-time fraud analytics application using anomaly detection at scale building on open source technology developed at the NSA (Accumulo) and Facebook (Prestodb) on the Hortonworks Data Platform.

The presentation will review:


  • The Financial Scale of Fraud
  • Types of Mobile Fraud
  • The Scalability Required to Track Fraud in a Mobile Network


  • What is Accumulo
  • Accumulo on the Hortonworks Data Platform

Argyle Data

  • Creating a Packet Level Data Lake
  • Using a Signal Processing Approach for Anomaly Detection at Petabyte Scale

About Dr. Ian Howells, Chief Marketing Officer, Argyle Data

Ian Howells is a passionate technologist and serial entrepreneur with over 25 years experience building successful technology companies through IPO and acquisition. Prior to joining Argyle Data, Ian has held senior marketing roles at Documentum from its early days through to IPO, SeeBeyond through its IPO and acquisition by Sun, Alfresco from a startup to the largest private open source company in the world, StorSimple, acquired by Microsoft, and Couchbase through a period of 400% growth. Ian has a Ph.D. in distributed databases and has published a number of papers and contributed to books on related topics. Ian has a passion for applying big data analytics techniques to marketing and open source. He previously penned the blog "Open Source Hearts and Minds" for Computerworld.

About Dr. Billie Rinaldi, Sr. Member of Technical Staff, Hortonworks

Billie Rinaldi currently prototypes new features related to application monitoring and deployment in the Apache Hadoop ecosystem in her position at Hortonworks, Inc. She is co-authoring the O'Reilly book "Accumulo: Application Development, Table Design, and Best Practices" that is now in early release. Prior to August 2012, Billie engaged in big data science and research at the National Security Agency. Since 2008, she has been providing technical leadership regarding the software that is now Apache Accumulo. Billie is the VP of Apache Accumulo, the Accumulo Project Management Committee Chair, and a member of the Apache Software Foundation. She holds a Ph.D. in applied mathematics from Rensselaer Polytechnic Institute.

About Arshak Navruzyan, VP Product Management at Argyle Data

Arshak Navruzyan is a passionate technologist with a background in distributed systems and machine learning. Prior to joining Argyle, Arshak served as vice president of product management at Alpine Data Labs, where he focused on bringing modern machine learning techniques to Hadoop. Previously Arshak has held senior technical and product management roles at Endeca and Oracle. Arshak founded and runs the popular San Francisco Machine Learning meet up.