Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Random Forests - Probability Estimates (+scikit-learn specific)

I am interested in understanding how probability estimates are calculated by random forests, both in general and specifically in Python's scikit-learn library (where probability estimated are returned by the predict_proba function).

Thanks, Guy

like image 351
Guy Adini Avatar asked Jan 07 '13 08:01

Guy Adini


1 Answers

The probabilities returned by a forest are the mean probabilities returned by the trees in the ensemble (docs). The probabilities returned by a single tree are the normalized class histograms of the leaf a sample lands in.

like image 85
Andreas Mueller Avatar answered Nov 08 '22 22:11

Andreas Mueller