Ben SharmaVikram Sreekanti

Sponsored by

Zaloni

Understanding Metadata: Why it's essential to your big data solution and how to manage it well

Date: This event took place live on June 21 2016

Presented by: Ben Sharma, Vikram Sreekanti

Duration: Approximately 60 minutes.

Questions? Please send email to

Description:

Metadata is essential for managing, migrating, accessing, and deploying a big data solution. Without it, enterprises have limited visibility into the data itself and cannot trust in its quality—negating the value of data in the first place. Creating end-to-end data visibility allows you to keep track of data, enable search and query across big data systems, safeguards your data, and reduces risk.

In this O'Reilly webcast, Ben Sharma (cofounder and CEO of Zaloni) and Vikram Sreekanti (software engineer in the AMPLab at UC Berkeley) discuss the value of collecting and analyzing metadata, and its potential to impact your big data solution and your business.

Attendees will learn how access to your data's lineage allows you to know where data has come from, where it is, and how it is being used. We'll also take a deep dive into a new open-source project under development at U.C. Berkeley — Ground. Ground is a data context system that enables users to uncover what data they have, where the data is flowing to and from, who is using the data, and when and how it changes. We'll explore how data context stretches the bounds of what we have traditionally considered metadata.

Key topics will include:

  • The role of metadata in data analysis
  • Key considerations for managing metadata
  • How to establish data lineage and provenance, in order to create a repeatable process
  • How Ground is making an impact on a wide range of data tasks, including data inventory, usage tracking, model-specific interpretation, reproducibility, interoperability, and collective governance.
  • Initial work on Ground, and how this data context system is making an impact on a wide range of data tasks, including: data inventory, usage tracking, model-specific interpretation, reproducibility, interoperability, and collective governance.

About Ben Sharma, Co-Founder & CEO – Zaloni

Ben Sharma is CEO and co-founder of Zaloni. He is a passionate technologist and thought leader in big data, analytics and enterprise infrastructure solutions. Having previously worked in technology leadership at NetApp, Fujitsu and others, Ben's expertise ranges from business development to production deployment in a wide array of technologies including Hadoop, HBase, databases, virtualization and storage. Ben is co-author of Architecting Data Lakes and Java in Telecommunications. He holds two patents.

About Vikram Sreekanti, Software Engineer – AMPLab, UC Berkeley

Vikram Sreekanti is a software engineer working on research in the AMPLab at UC Berkeley. A graduate of Berkeley's computer science department, he will begin his Ph.D. in Fall 2016, working with Joe Hellerstein.