Pranav RastogiMark Balkenende

Sponsored by


How to use machine learning to scale data quality

Date: This event took place live on November 16 2017

Presented by: Pranav Rastogi, Mark Balkenende

Duration: Approximately 60 minutes.

Cost: Free

Questions? Please send email to


Free - Register Now

Machine learning helps pinpoint errors in large datasets for cleansing before entering the analytics pipeline. This webcast shows you how to set it up.

Big data brings tremendous opportunity to better target customers and improve operations. Yet, data-driven insights are only as good and trusted as the data going into them.

Find out how you can build data quality into your structured, semi-structured, or unstructured data on Microsoft Azure Data Lake Store and HDInsight using Talend's native support for Spark machine learning algorithms.

Join Microsoft and Talend to see how to:

  • Process data faster using Talend's native support for Spark on HDInsight
  • Quickly import bulk data into Azure Data Lake Store
  • Deploy Spark machine learning to match and dedupe records at scale
  • Enable best practices for data quality using Talend Data Stewardship

About Mark Balkenende, Sales Solution Architects Manager at Talend

Mark Balkenende is a Sales Solution Architects Manager at Talend. Prior to joining Talend, Mark has had a long career of mastering and integrating data at a number of companies, including Motorola, Abbott Labs, and Walgreens. Mark holds an Information Systems Management degree and is also an extreme cycling enthusiast.

About Pranav Rastogi, Program Manager at Microsoft Azure

Microsoft – Pranav Rastogi is a Program Manager in Microsoft Azure. He focusses on Azure HDInsight, a managed cloud Apache Hadoop & Spark offering that gives you optimized open-source analytical clusters for running open source projects. He spends most of his time making it easier for customers to leverage the big data ecosystem to complete big data solutions to meet their enterprise needs.

Free - Register Now