Index

A
Access, command line, 230–232
append, files, 231
availability of, 231
chain together commands, 231
count, number of words and lines in file, 231
download web pages, 231
edit large text files, 231
examine, beginning or end of file, 231
find patterns, 231
find, duplicate rows in file, 231
join, files horizontally, 231
list files and directories, 231
sort data files, 231
Analytics code, See Analytics coding
Analytics coding, 29, 78, 126, 137
break up data flows into data steps, 85–86
clean data minimum of locations in data flow, 93–94
cleaning a data field, keep original raw field, 94–95
clearly label running order of code files, 83–84
consistent cleaning, 126
create unique identifier for records, 98–99
data provenance, 91
don’t jump in ...

Get Guerrilla Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.