Skip to Content
Semantic Modeling for Data
book

Semantic Modeling for Data

by Panos Alexopoulos
August 2020
Beginner to intermediate
328 pages
9h 38m
English
O'Reilly Media, Inc.
Content preview from Semantic Modeling for Data

Chapter 10. Bad Application

Computers are useless. They can only give you answers.

Pablo Picasso

So far we have seen pitfalls mainly related to the development of a semantic model. In this chapter, we switch perspective and we look at a frequent mistake that happens when we apply such a model in an application. The mistake is that we assume that just because the model has been designed for the same domain or kind of data the application operates in, its semantics are directly applicable and beneficial to it. In reality, it can be that:

  • The application’s semantic needs seem to be covered by the model’s elements, yet there are subtle but crucial differences between them that make the model useless or even harmful

  • The application’s semantic needs are covered by the model’s elements, but the model contains additional elements that are not just redundant but actually harmful to the application

In what follows, we see how these two issues can arise in two common applications of semantic models, namely entity resolution and semantic relatedness calculation, and how we can tackle them in each case.

Bad Entity Resolution

Entity resolution is an information-extraction task that involves detecting mentions of entities within texts and mapping them to their corresponding entities in a given semantic model. For example, consider the following text from an IMDb review of the 1997 film Steel:

How’s this for diminishing returns? In BATMAN AND ROBIN, George Clooney battled Arnold Schwarzenegger. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Learning Data Modeling

Learning Data Modeling

Michael Blaha

Publisher Resources

ISBN: 9781492054269Errata Page