The second step in building the unstructured data warehouse is that of identifying all sources and potential sources of unstructured data. Here, the analyst/designer can become creative. News articles, research, hospital visits, police records, insurance claims, and so forth, are all potential sources for an unstructured data warehouse, depending on its business context. Some of the observations that need to be considered here include:
· The volume of data to be input
· Whether the input is available at the outset or will be arriving over time
· The structure of the input data – repetitive or non repetitive, how robust is it, is it in well-structured sentences ...