Skip to Content
The Self-Service Data Roadmap
book

The Self-Service Data Roadmap

by Sandeep Uttamchandani
September 2020
Beginner to intermediate
284 pages
7h 40m
English
O'Reilly Media, Inc.
Content preview from The Self-Service Data Roadmap

Chapter 3. Search Service

So far, given a dataset, we are able to gather the required metadata details to correctly interpret the properties and meaning of the attributes. The next challenge is, given thousands of datasets across enterprise silos, how we effectively locate the attributes required to develop the insight. For instance, when developing a revenue dashboard, how do we locate datasets of existing customers, products they use, pricing and promotions, activity, usage profiles, and so on? Further, how do we locate artifacts such as metrics, dashboards, models, ETLs, and ad hoc queries that can be reused in building the dashboard? This chapter focuses on finding the relevant datasets (tables, views, schema, files, streams, and events) and artifacts (metrics, dashboards, models, ETLs, and ad hoc queries) during the iterative process of developing insights.

A search service simplifies the discovery of datasets and artifacts. With a search service, data users express what they are looking for using keywords, wildcard searches, business terminology, and so on. Under the hood, the service does the heavy lifting of discovering sources, indexing datasets and artifacts, ranking results, ensuring access governance, and managing continuous change. Data users get a list of datasets and artifacts that are most relevant to the input search query. The success criteria for such a service is reducing the time to find. Speeding up time to find significantly improves time to insight, as ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Data Management at Scale

Data Management at Scale

Piethein Strengholt
Data Mesh

Data Mesh

Zhamak Dehghani
The Enterprise Data Catalog

The Enterprise Data Catalog

Ole Olesen-Bagneux

Publisher Resources

ISBN: 9781492075240Errata Page