Skip to Content
Data Lakes
book

Data Lakes

by Anne Laurent, Dominique Laurent, Cédrine Madera
June 2020
Beginner to intermediate
244 pages
5h 40m
English
Wiley-ISTE
Content preview from Data Lakes

Glossary

This glossary gathers definitions of terms used in this book. These definitions are borrowed from the bibliographical sources and from the corporate terminology in use at IBM.

Business Intelligence

Business intelligence (BI) refers to the strategies and technologies used by organizations for data analysis of business information. Historically, it was synonymous with data warehousing; however, today, it more often refers to the analysis and reporting of data once it has been curated in a data warehouse or other data analytical platform. Similarly, BI tools refer to the software primarily used and business functions for reporting, visualization, creating dashboards and data analysis. These tools are typically used against data that has already been prepared for reporting and analysis (data marts); in contrast, data science involves a measure of data manipulation and, in some cases, acquiring data before analysis. In addition to statistical data analysis, data science may also involve aspects of machine learning.

Data Architecture

Data architecture plays an increasingly important role and has evolved to consider all areas of data management, not just relational database storage. It defines an organization’s data strategy, covering decisions on the different types of data store (relation, NoSQL), data integration strategies (messaging, streaming, API, batch files) and data security. Data architecture also encompasses the design of data stores (data modeling) and defines ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Architecting Data Lakes

Architecting Data Lakes

Ashish Thusoo, Ben Sharma
Operationalizing the Data Lake

Operationalizing the Data Lake

Holden Ackerman, Jon King
Data Superstream: Data Lakes and Warehouses

Data Superstream: Data Lakes and Warehouses

Alistair Croll, Lena Hall, Vini Jaiswal, Einat Orr, Wannes Rosiers, Jessica Larson, Ryan Blue, Tejas Chopra
Data Lake Maturity Model

Data Lake Maturity Model

Scott Gidley, Andy Oram

Publisher Resources

ISBN: 9781786305855Purchase book