Transforming classifier scores into accurate multiclass probability estimates

Bianca Zadrozny, Charles Elkan
2002
1 reference

Abstract

Class membership probability estimates are important for many applications of data mining in which classification outputs are combined with other sources of information for decision-making, such as example-dependent misclassification costs, the outputs of other classifiers, or domain knowledge. Previous calibration methods apply only to two-class problems. Here, we show how to obtain accurate probability estimates for multiclass problems by combining calibrated binary probability estimates. We also propose a new method for obtaining calibrated two-class probability estimates that can be applied to any classifier that produces a ranking of examples. Using naive Bayes and support vector machine classifiers, we give experimental results from a variety of two-class and multiclass domains, including direct marketing, text categorization and digit recognition.

1 repository
1 reference

Code References

â–¶ scikit-learn/scikit-learn
1 file
â–¶ sklearn/linear_model/_stochastic_gradient.py
1
https://dl.acm.org/doi/pdf/10.1145/775047.775151
Link copied to clipboard!