August 2014
Beginner to intermediate
304 pages
7h 10m
English
Swapping the words around a verb can eliminate the passive voice from particular phrases. For example, the book was great can be transformed into the great book. This kind of normalization can also help with frequency analysis, by counting two apparently different phrases as the same phrase.
In transforms.py is a function called swap_verb_phrase(). It swaps the right-hand side of the chunk with the left-hand side, using the verb as the
pivot point. It uses the first_chunk_index() function defined in the previous recipe to find the verb to pivot around.
def swap_verb_phrase(chunk): def vbpred(wt): word, tag = wt return tag != 'VBG' and tag.startswith('VB') and len(tag) > 2 vbidx = first_chunk_index(chunk, vbpred) ...Read now
Unlock full access