Skip to Content
Win with Advanced Business Analytics: Creating Business Value from Your Data
book

Win with Advanced Business Analytics: Creating Business Value from Your Data

by Jean Paul Isson, Jesse Harriott
October 2012
Beginner to intermediate
398 pages
10h 22m
English
Wiley
Audiobook available
Content preview from Win with Advanced Business Analytics: Creating Business Value from Your Data

WHAT IS UNSTRUCTURED DATA ANALYTICS?

Before we discuss unstructured data analytics in more detail, let’s first define what is meant by “unstructured data.” Unstructured data or unstructured information refers to information that does not have a predefined data model and/or does not fit well into relational database tables. Unstructured data typically have no identifiable structure and may include bitmap images/objects, text, and other data types that are not part of a typical database. Unstructured information is frequently text-heavy but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities, making it difficult to understand through the use of traditional computer programs, as compared to data stored in fielded forms in traditional relational databases or annotated (semantically tagged) in documents. Unstructured data cannot easily be analyzed with traditional analytics techniques.2

Unstructured data analytics first emerged in the late 1990s as “text mining.” Early approaches treated and analyzed text as a bag of words. Text mining evolved early to use basic shallow linguistics to handle variant word forms, such as abbreviations, plurals, and conjugations, as well as multiword terms known as n-grams. N-grams are a contiguous sequence of items from a sequence of text or speech. The items in question can be phonemes, syllables, letters, or words, depending on the application. An n-gram text analytics model is a type of probabilistic ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Data Driven Business Transformation

Data Driven Business Transformation

Peter Jackson, Caroline Carruthers
Data Driven Business Transformation

Data Driven Business Transformation

Peter Jackson, Caroline Carruthers
Reporting, Predictive Analytics, and Everything in Between

Reporting, Predictive Analytics, and Everything in Between

Brett Stupakevich, David Sweenor, Shane Swiderek

Publisher Resources

ISBN: 9781118417089Purchase book