O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

The reading of unstructured data must encompass data in many types of unstructured files. Some of the common file types that can be used for input into the unstructured integration engine include:

·         PowerPoint files - .ppt

·         Portable document format files- .pdf

·         Text files - .txt

·         Document files - .doc

·         Excel spreadsheet - .xls

·         Email files

In addition to the above common file types that contain unstructured data, there are many file types that can be read as a .txt file.

Many file types have unstructured data as only one component. There are other kinds of data in the file type, as well. PowerPoint files contain a good example of this separation of the file type ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required