O'Reilly logo

Building the Unstructured Data Warehouse: Architecture, Analysis, and Design by Krish Krishnan, W. H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

In the preceding section, we looked in-depth at the processing of structured data and its associated challenges. Let us now look at the challenges with processing unstructured data, which can be classified into Data, Acquisition, Processing, Storage, Integration, Usage, Volume, and Workload:

·         Data. Data needs to be processed from acquisition to integration in the unstructured data world as much as we do in the structured data world.

·         Acquisition. Data, in terms of the unstructured world, can be sourced from microblogs (like twitter), documents, emails, manuals, notes, speech to text, video to text, and much more. Based on the source of the data, there will be data quality issues arising from conversions ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required