Just got a client list from a defunct competitorThe dirty secret of data analysisHead First Head Hunters wants the list for their sales teamCleaning messy data is all about preparationOnce you’re organized, you can fix the data itselfUse the # sign as a delimiterExcel split your data into columns using the delimiterUse SUBSTITUTE to replace the carat characterYou cleaned up all the first namesThe last name pattern is too complex for SUBSTITUTEHandle complex patterns with nested text formulasR can use regular expressions to crunch complex data patternsThe sub command fixed your last namesNow you can ship the data to your clientMaybe you’re not quite done yet...Sort your data to show duplicate values togetherThe data is probably from a relational databaseRemove duplicate namesYou created nice, clean, unique recordsHead First Head Hunters is recruiting like gangbusters!Leaving town...It’s been great having you here in Dataville!