Chapter 6

Stage 3: Data Load

Summary

This chapter discusses the Data Load stage of the Guerrilla Analytics workflow. Data Load involves getting data from a receipt location (generally the file system) and loading it into the Data Manipulation Environment (DME). In this chapter, you will learn about the various activities that take place at Data Load. You will learn about the pitfalls and risks in these activities. You will then learn a number of practice tips to mitigate those risks.

Keywords

Data Load
Raw Files
Data Manipulation Environment
Unique Identifiers
Plain Text
Data Load Challenges

6.1. Guerrilla Analytics Workflow

Figure 16 shows the Guerrilla Analytics workflow. Data Load involves getting raw data from its storage location ...

Get Guerrilla Analytics now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.