Transforming classifier scores into accurate multiclass probability estimates

Bianca Zadrozny, Charles Elkan

2002

1 reference

Abstract

Class membership probability estimates are important for many applications of data mining in which classification outputs are combined with other sources of information for decision-making, such as example-dependent misclassification costs, the outputs of other classifiers, or domain knowledge. Previous calibration methods apply only to two-class problems. Here, we show how to obtain accurate probability estimates for multiclass problems by combining calibrated binary probability estimates. We also propose a new method for obtaining calibrated two-class probability estimates that can be applied to any classifier that produces a ranking of examples. Using naive Bayes and support vector machine classifiers, we give experimental results from a variety of two-class and multiclass domains, including direct marketing, text categorization and digit recognition.

View Paper DOI

🤖 Machine Learning

1 repository

1 reference

Code References

▶ scikit-learn/scikit-learn

1 file

▶ sklearn/linear_model/_stochastic_gradient.py

L1346

https://dl.acm.org/doi/pdf/10.1145/775047.775151

Link copied to clipboard!