3.5 Real Data-Based ROC Analysis

In real applications only a limited number of samples are available for data analysis, referred to as the power of the test. In this case, the data sample pool is generally not sufficiently large to constitute reliable statistics that can be used to characterize the LRT Λ(r) implemented by a detector. Under such a circumstance there is no effective means of producing Λ(r) and the ROC analysis must be carried out with data samples rather than statistics, p0(r) and p1(r).

3.5.1 How to Generate ROC Curves from Real Data

In what follows, we define

N = total number of data samples used for a particular detection method (technique)

Nsignal = total number of data samples with presence of a signal (according to ground truth)

Nno-signal = total number of data samples with absence of a signal (according to ground truth)

ND = total number of data samples with presence of a signal which is actually detected by the method

NF = total number of data samples with absence of a signal, but claimed to have an signal detected by the method

NM = total number of data samples with presence of a signal which is not detected by the method

NTN = total number of data samples with presence of a signal and also claimed to have no signal detected by the method.

False alarm or false positive rate/probability is defined by

(3.10) equation

False negative or miss rate/probability:

(3.11) ...

Get Hyperspectral Data Processing: Algorithm Design and Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.