November 2017
Beginner to intermediate
366 pages
7h 59m
English
Let's look at the dedup function invocation once again:
> rec.pairs <- compare.dedup(RLdata500+ ,blockfld = list(1, 5:7)+ ,strcmp = c(2,3,4)+ ,strcmpfun = levenshteinSim).
The strcmp and strcmpfun parameters dictate on which fields we need to do string comparison and what kind of string comparison we need to apply. We pass a vector indicating the column IDs to strcmp. We need to do string comparisons in columns 2, 3, and 4. We want to use the Levenshtein distance to find the similarity between two strings.
Read now
Unlock full access