3.5 Real Data-Based ROC Analysis
In real applications only a limited number of samples are available for data analysis, referred to as the power of the test. In this case, the data sample pool is generally not sufficiently large to constitute reliable statistics that can be used to characterize the LRT Λ(r) implemented by a detector. Under such a circumstance there is no effective means of producing Λ(r) and the ROC analysis must be carried out with data samples rather than statistics, p0(r) and p1(r).
3.5.1 How to Generate ROC Curves from Real Data
In what follows, we define
N = total number of data samples used for a particular detection method (technique)
Nsignal = total number of data samples with presence of a signal (according to ground truth)
Nno-signal = total number of data samples with absence of a signal (according to ground truth)
ND = total number of data samples with presence of a signal which is actually detected by the method
NF = total number of data samples with absence of a signal, but claimed to have an signal detected by the method
NM = total number of data samples with presence of a signal which is not detected by the method
NTN = total number of data samples with presence of a signal and also claimed to have no signal detected by the method.
False alarm or false positive rate/probability is defined by
False negative or miss rate/probability:
(3.11) ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access