Description:

Machine learning helps pinpoint errors in large datasets for cleansing before entering the analytics pipeline. This webcast shows you how to set it up.

Big data brings tremendous opportunity to better target customers and improve operations. Yet, data-driven insights are only as good and trusted as the data going into them.

Find out how you can build data quality into your structured, semi-structured, or unstructured data on Microsoft Azure Data Lake Store and HDInsight using Talend's native support for Spark machine learning algorithms.

Join Microsoft and Talend to see how to:

Process data faster using Talend's native support for Spark on HDInsight
Quickly import bulk data into Azure Data Lake Store
Deploy Spark machine learning to match and dedupe records at scale
Enable best practices for data quality using Talend Data Stewardship

About Mark Balkenende, Sales Solution Architects Manager at Talend

Mark Balkenende is a Sales Solution Architects Manager at Talend. Prior to joining Talend, Mark has had a long career of mastering and integrating data at a number of companies, including Motorola, Abbott Labs, and Walgreens. Mark holds an Information Systems Management degree and is also an extreme cycling enthusiast.

About Pranav Rastogi, Program Manager at Microsoft Azure

Microsoft – Pranav Rastogi is a Program Manager in Microsoft Azure. He focusses on Azure HDInsight, a managed cloud Apache Hadoop & Spark offering that gives you optimized open-source analytical clusters for running open source projects. He spends most of his time making it easier for customers to leverage the big data ecosystem to complete big data solutions to meet their enterprise needs.

Description:

About Mark Balkenende, Sales Solution Architects Manager at Talend

About Pranav Rastogi, Program Manager at Microsoft Azure

About O'Reilly

Community

Partner Sites

Shop O'Reilly