Chapter 10.1

Nonrepetitive Data


Nonrepetitive analytics begins with the contextualization of the nonrepetitive data. Unlike repetitive data, the context of nonrepetitive data is difficult to determine. The context of nonrepetitive big data is determined by textual disambiguation. In textual disambiguation, there are algorithms that relate to stop word resolution, stemming, homographic resolution, inline contextualization, taxonomy/ontology resolution, custom variable resolution, acronym resolution, and so forth. Nonrepetitive analytics is very relevant to business value. Some typical forms of nonrepetitive analytics include the analysis of medical records, warranty analysis, insurance claim analysis, and call center analysis.

Get Data Architecture: A Primer for the Data Scientist, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.