![]() Server time : 2010-03-10 02:46
|
KDD Cup 2009 |
![]() |
| Prediction | |||
|---|---|---|---|
| Class +1 | Class -1 | ||
| Truth | Class +1 | tp | fn |
| Class -1 | fp | tn | |
We define the sensitivity (also called true positive rate or hit rate) and the specificity (true negative rate)
as:
Sensitivity = tp/pos
Specificity = tn/neg
where pos=tp+fn is the total number of positive examples and neg=tn+fp the total number of negative examples.
The results will be evaluated with the so-called Area Under Curve (AUC). It corresponds to the area under the curve obtained by plotting sensitivity against specificity by varying a threshold on the prediction values to determine the classification result. The AUC is related to the area under the lift curve and the Gini index used in marketing (Gini=2 AUC -1). The AUC is calculated using the trapezoid method. In the case when binary scores are supplied for the classification instead of discriminant values, the curve is given by {(0,1),(tn/(tn+fp),tp/(tp+fn)),(1,0)} and the AUC is just the Balanced ACcuracy BAC.