Let's assume that there are two target classes: . By using Bayes' theorem, the posterior probability of can be described as follows:
In this case, we are using a logistic sigmoid function, , and defining a as follows:
This inverted logistic sigmoid function is called the logit function, which ...