Chapter 7. The Real-Time Hub
As organizational data grows, users face challenges in locating the information they need, which leads to reduced productivity, data duplication, and the introduction of errors. To mitigate these issues, you need to create a centralized repository for data discovery. You can do this with a metadata catalog, or simply a data catalog, which provides a comprehensive record of all metadata within the data platform.
A well-maintained data catalog provides valuable context on data origin, usage, and relationships with other datasets. Additionally, it can support data governance, to ensure compliance with regulatory requirements and enhance data security by controlling access to sensitive information.
In Microsoft Fabric, the Real-Time hub is the designated place for discovering all of your organization’s data-in-motion. It provides a unified view of real-time data streams that centralizes data discovery to improve responsiveness and agility in decision-making processes.
The Real-Time hub also has two other functions. First of all, it serves as a starting point for users to understand which sources they can connect to, and it also provides quick access so users can get started capturing and analyzing data-in-motion. Second, the Real-Time hub enables an event-driven architecture (EDA), in which the flow of data through the layers of the data platform (such as ETL pipelines) is driven by events (such as user actions, sensor outputs, and messages from other ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access