Skip to Content
The AI Ladder
book

The AI Ladder

by Rob Thomas, Paul Zikopoulos
April 2020
Intermediate to advanced
223 pages
7h 8m
English
O'Reilly Media, Inc.
Content preview from The AI Ladder

Chapter 6. Collect Your Data

In the preceding chapter we talked about the process of modernizing your entire data infrastructure to make it one integrated, efficient platform. A consistently audited, asset-optimized, and instrumented-in-IT infrastructure is essential for flexible and cost-efficient operation in the AI-centric world.

You may have noticed that there were a few important things we didn’t talk about in that chapter. For example, we haven’t talked about how you acquire data. Nor have we talked about the quality of the data that resides in that platform, or how to make it available to the AI processes and programs that may require it or benefit from it. We certainly haven’t talked about how to get the data you already have under control. Many companies think “Of course we have data,” only to find that there are many reasons why this data isn’t really accessible. Those reasons may be technical, political, regulatory, or some combination of the three—but they’re real. We always tell people that big data without analytics is…well…just a bunch of data. Never forget: data may be an asset, but it’s a valueless asset if you can’t use it.

In this chapter we will talk about getting access to all relevant data and evaluating its utility. We’ll look at ways to consolidate data sources, because at far too many companies data resides in departmental silos that prevent it from being used effectively. We’ll discuss the pros and cons of combining data sources into “data lakes” and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

What Is Generative AI?

What Is Generative AI?

Kyle Stratis
AI Engineering

AI Engineering

Chip Huyen

Publisher Resources

ISBN: 9781492073420Errata Page