Skip to Content
Getting Data Right
book

Getting Data Right

by Shannon Cutt
September 2015
Beginner to intermediate content levelBeginner to intermediate
52 pages
1h 51m
English
O'Reilly Media, Inc.

Overview

Over the last 20 years, companies have invested roughly $3-4 trillion in enterprise software. These investments have been primarily focused on the development and deployment of single systems, applications, functions, and geographies targeted at the automation and optimization of key business processes. Companies are now investing heavily in big data analytics ($44 billion alone in 2014) in an effort to begin analyzing all of the data being generated from their process automation systems. But companies are quickly realizing that one of their key bottlenecks is Data Variety—the silo’d nature of the data that is a natural result of internal and external source proliferation.

The problem of big data variety has crept up from the bottom—and the cost of variety is only appreciated when companies attempt to ask simple questions across many business silos (divisions, geographies, functions, etc.). Current top-down, deterministic data unification approaches (such as ETL, ELT, and MDM) were simply not designed to scale to the variety of hundreds or thousands or even tens of thousands of data silos.

Download this free eBook to learn about the fundamental challenges that Data Variety poses to enterprises looking to maximize the value of their existing investments—and how new approaches promise to help organizations embrace and leverage the fundamental diversity of data. Readers will also find best practices for designing bottom-up and probabilistic methods for finding and managing data; principles for doing data science at scale in the big data era; preparing and unifying data in ways that complement existing systems; optimizing data warehousing; and how to use “data ops” to automate large-scale integration.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Lake Maturity Model

Data Lake Maturity Model

Scott Gidley, Andy Oram
Application of Big Data for National Security

Application of Big Data for National Security

Petra Saskia Bayerl, Andrew Staniforth, Richard Hill, Hamid R. Arabnia, Gregory B. Saathoff, Babak Akhgar
Ready, Set, Curate: 8 Learning Experts Tell You How

Ready, Set, Curate: 8 Learning Experts Tell You How

Editor Ben Betts, Editor Allison Anderson

Publisher Resources

ISBN: 9781491935361Errata Page