One important type of unstructured data is the email. The email is common and is a part of everyday life in most modern parts of the world. Emails have certain characteristics:
· There are no rules as to what the content of an email may be (you can write anything that you want in an email)
· Most (but not all) emails are short
· Many emails are personal and have little or nothing to do with business or commerce.
In addition, emails can contain attachments. In those attachments, oftentimes, there are many useful and interesting items of information. In truth, from a business perspective, the attachments are usually more interesting than the emails themselves.