Measure to compare true observed labels with predicted probabilities in multiclass classification tasks.
Arguments
- truth
(
factor())
True (observed) labels. Must have the same levels and length asresponse.- prob
(
matrix())
Matrix of predicted probabilities, each column is a vector of probabilities for a specific class label. Columns must be named with levels oftruth.- ...
(
any)
Additional arguments. Currently ignored.
Details
Brier score for multi-class classification problems with \(k\) labels defined as $$ \frac{1}{n} \sum_{i=1}^n \sum_{j=1}^k (I_{ij} - p_{ij})^2. $$ \(I_{ij}\) is 1 if observation \(x_i\) has true label \(j\), and 0 otherwise. \(p_{ij}\) is the probability that observation \(x_i\) belongs to class \(j\).
Note that there also is the more common definition of the Brier score for binary
classification problems in bbrier().
References
Brier GW (1950). “Verification of forecasts expressed in terms of probability.” Monthly Weather Review, 78(1), 1–3. doi:10.1175/1520-0493(1950)078<0001:vofeit>2.0.co;2 .