Chapter 5. Content Management and Unstructured Data

Beyond the realm of information held in traditional databases lies the world of unstructured data. The information we deal with comes in these forms, among others:

  • Notes

  • E-mails

  • Faxes

  • Word processing

  • Spreadsheets

  • Images and other graphics

  • Invoices and other hard-copy output

  • Web pages

  • Audio

  • Video

We all have been involved in dealing with various portals and end-user interfaces to this critical volume of information. We may see an article regarding a competitor who has had some success, such as an increase in market share, a new product, and so on. Up goes the warning flag, and we try to ferret out more information from our competitive database—oops, we don't have one!

A search of several sources that publish ...

Get Business Intelligence for the Enterprise now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.