image

Thus the multistage decision property can be written in general as

Entropy(p,q,r)=entropy(p,q+r)+(q+r)entropy(qq+r,rq+r)

image

where p+q+r=1image.

Because of the way the log function works, you can calculate the information measure without having to work out the individual fractions:

Info([2,3,4])=2/9×log2/93/9×log3/94/9×log4/9=[2log23log34log4+9log9]/9.

This is the way that the information measure is usually calculated in practice. So the information ...

Get Data Mining, 4th Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.