4Digital Archiving and Big Data
4.1. Introduction
In recent years, we have witnessed an explosion of digital data and documents that is constantly evolving in all areas of activity. The rate at which this data is produced is frantic.
Every day, companies, especially with the expansion of the number of Internet users and the emergence of the Internet of Things as a hypernetwork connecting actors, artifacts, writings and concepts [SAL 18] (see Figure 4.1), produce large amounts of data that can be measured in zetta or petabytes1.
Indeed, “individuals, threatened by infobesity, interconnected and interacting with objects, are generating an exponential amount of data and digital traces via computers, smartphones, tablets, cameras, sensors, implants, electric counters, autonomous cars, connected toys, cryptocurrency, etc.”. “Campaigns to digitize heritage sources (paper, photographic, cinematographic and radio archives, etc.) are contributing to this boom” [BOY 17].
Although some companies today are able to store this voluminous data, their integration, handling, management and operation remain very complicated. This has forced researchers to find new methods, new infrastructures and new orders of magnitude for acquiring, storing, analyzing, sharing, retrieving and presenting this data. This is how “Big Data” was born.
Figure 4.1. Exponential expansion of the data exchanged over ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access