1.2

The Data Infrastructure

Abstract

In corporate data there are two types of repetitive data - DBMS managed repetitive data and in Big Data there is repetitive data found inside an unstructured block of data. Repetitive data managed under a standard DBMS contains records and rows of data accessed by indexes. Repetitive data managed by Big data contains unstructured blocks of data in which rows of data are logically stored. Repetitive data managed by a DBMS optimizes the speed with which data can be accessed and repetitive data managed by Big data optimizes the ease with which data can be stored.

Keywords

DBMS
repetitive data
Big Data
records
blocks
parsing optimal access of data versus optimal ease of adding data
If there is any secret ...

Get Data Architecture: A Primer for the Data Scientist now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.