10. Theory

Source code differentiation is the measure of the similarity of two sets of source code based on the number of lines of code that match completely as a fraction of the total number of lines of code. There are other ways of measuring the similarities and differences between sets of source code, and each way has its own advantages and disadvantages. Source code correlation, for example, is discussed in detail in Part V of this book. Source code differentiation is particularly useful for finding and measuring the amount of code that has been directly copied from one program to another, or one program version to another version, without modification.

In addition to measuring the similarity of two sets of code, source code differentiation ...

Get The Software IP Detective’s Handbook: Measurement, Comparison, and Infringement Detection now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.