Positive Predictive Value

Measure to compare true observed labels with predicted labels in binary classification tasks.

Usage

ppv(truth, response, positive, sample_weights = NULL, na_value = NaN, ...)

precision(
  truth,
  response,
  positive,
  sample_weights = NULL,
  na_value = NaN,
  ...
)

Arguments

truth: (factor())
True (observed) labels. Must have the exactly same two levels and the same length as response.
response: (factor())
Predicted response labels. Must have the exactly same two levels and the same length as truth.
positive: (character(1))
Name of the positive class.
sample_weights: (numeric())
Vector of non-negative and finite sample weights. Must have the same length as truth. The vector gets automatically normalized to sum to one. Defaults to equal sample weights.
na_value: (numeric(1))
Value that should be returned if the measure is not defined for the input (as described in the note). Default is NaN.
...: (any)
Additional arguments. Currently ignored.

Value

Performance value as numeric(1).

Details

The Positive Predictive Value is defined as $$ \frac{\mathrm{TP}}{\mathrm{TP} + \mathrm{FP}}. $$ Also know as "precision".

This measure is undefined if TP + FP = 0.

Meta Information

Type: "binary"
Range: $[0, 1]$
Minimize: FALSE
Required prediction: response

References

https://en.wikipedia.org/wiki/Template:DiagnosticTesting_Diagram

Goutte C, Gaussier E (2005). “A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation.” In Lecture Notes in Computer Science, 345–359. doi:10.1007/978-3-540-31865-1_25 .

Examples

set.seed(1)
lvls = c("a", "b")
truth = factor(sample(lvls, 10, replace = TRUE), levels = lvls)
response = factor(sample(lvls, 10, replace = TRUE), levels = lvls)
ppv(truth, response, positive = "a")
#> [1] 0.5