Chapter 6

Stage 3: Data Load

Summary

This chapter discusses the Data Load stage of the Guerrilla Analytics workflow. Data Load involves getting data from a receipt location (generally the file system) and loading it into the Data Manipulation Environment (DME). In this chapter, you will learn about the various activities that take place at Data Load. You will learn about the pitfalls and risks in these activities. You will then learn a number of practice tips to mitigate those risks.

Keywords

Data Load
Raw Files
Data Manipulation Environment
Unique Identifiers
Plain Text
Data Load Challenges

6.1. Guerrilla Analytics Workflow

Figure 16 shows the Guerrilla Analytics workflow. Data Load involves getting raw data from its storage location ...

Get Guerrilla Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.