Appendix 2: Glossary

3NF.
Third normal form (3NF) is a database schema design approach for relational databases which uses normalizing principles to reduce the duplication of data, avoid data anomalies, ensure referential integrity, and simplify data management.
ACID model.
A model applied to data for atomicity, consistency, isolation, and durability.
aggregation.
Collecting data from various databases for the purpose of data processing or analysis.
algorithm.
A mathematical formula placed in software that performs analysis on a set of data.
analytics.
Use of statistical algorithms to derive insights from data.
API (application program interface).
A set of programming standards and instructions for accessing or building web-based software applications.
big data.
A term for data sets that is so large or complex that traditional data processing applications are inadequate to deal with them.
business intelligence (BI).
The general term used for the identification, extraction, and analysis of data.
CDO.
The senior executive who bears responsibility for the firm's enterprise-wide data.
cloud computing.
A broad term that refers to any internet-based application or service that is hosted remotely.
data.
A set of values of qualitative or quantitative variables.
data catalog.
An organized inventory of data assets in an organization.
data custodian.
The person responsible for the database structure and the technical environment, including the storage of data.
data democratization. ...

Get Data Quality now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.