The Data Infrastructure
Abstract
In corporate data there are two types of repetitive data - DBMS managed repetitive data and in Big Data there is repetitive data found inside an unstructured block of data. Repetitive data managed under a standard DBMS contains records and rows of data accessed by indexes. Repetitive data managed by Big data contains unstructured blocks of data in which rows of data are logically stored. Repetitive data managed by a DBMS optimizes the speed with which data can be accessed and repetitive data managed by Big data optimizes the ease with which data can be stored.
Keywords
Get Data Architecture: A Primer for the Data Scientist now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.