11. Metadata in Unstructured Data

There is a level of abstraction sitting above all information processing, including that of unstructured data. That level of abstraction is commonly called metadata, which sits above unstructured data just as it sits on top of structured data. It is with metadata that we can see the “larger picture” of what is going on in the systems and components of information processing.

A simplistic definition of metadata is “data about data,” which has been around for as long as there have been information systems. Although the definition gives a flavor of what metadata is, it is not a good definition. A somewhat better description of metadata is that metadata is an abstraction or a classification of data.

Metadata in the ...

Get Tapping into Unstructured Data: Integrating Unstructured Data and Textual Analytics into Business Intelligence now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.