Measure to compare true observed labels with predicted probabilities in multiclass classification tasks.
Arguments
- truth
(
factor())
True (observed) labels. Must have the same levels and length asresponse.- prob
(
matrix())
Matrix of predicted probabilities, each column is a vector of probabilities for a specific class label. Columns must be named with levels oftruth.- sample_weights
(
numeric())
Vector of non-negative and finite sample weights. Must have the same length astruth. The vector gets automatically normalized to sum to one. Defaults to equal sample weights.- ...
(
any)
Additional arguments. Currently ignored.
Details
Brier score for multi-class classification problems with \(k\) labels defined as $$ \frac{1}{n} \sum_{i=1}^n \sum_{j=1}^k (I_{ij} - p_{ij})^2. $$ \(I_{ij}\) is 1 if observation \(x_i\) has true label \(j\), and 0 otherwise. \(p_{ij}\) is the probability that observation \(x_i\) belongs to class \(j\).
Note that there also is the more common definition of the Brier score for binary
classification problems in bbrier().
References
Brier GW (1950). “Verification of forecasts expressed in terms of probability.” Monthly Weather Review, 78(1), 1–3. doi:10.1175/1520-0493(1950)078<0001:vofeit>2.0.co;2 .
See also
Other Classification Measures:
acc(),
bacc(),
ce(),
logloss(),
mauc_aunu(),
mcc(),
zero_one()