Chapter 3
Understanding Networking Data Sources
This chapter begins to examine the complexities of networking data. Understanding and preparing all the data coming from the IT infrastructure is part of the data engineering process within analytics solution building. Data engineering involves the setup of data pipelines from the data source to the centralized data environment, in a format that is ready for use by analytics tools. From there, data may be stored, shared, or streamed into dedicated environments where you perform data science analysis. In most cases, there is also a process of cleaning up or normalizing data at this layer. ETL (Extract, Transform, Load) is a carryover acronym from database systems that were commonly used at the data ...
Get Data Analytics for IT Networks: Developing Innovative Use Cases, First Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.