Can you fix the false negative rate in a classifier in scikit learn

Tags:

I am using a Random Forest classifer in scikit learn with an imbalanced data set of two classes. I am much more worried about false negatives than false positives. Is it possible to fix the false negative rate (to, say, 1%) and ask scikit to optimize the false positive rate somehow?

If this classifier doesn't support it, is there another classifier that does?

550

asked Sep 17 '15 18:09

graffe

2 Answers

I believe the problem of class imbalance in sklearn can be partially resolved by using the class_weight parameter.

this parameter is either a dictionary, where each class is assigned a uniform weight, or is a string that tells sklearn how to build this dictionary. For instance, setting this parameter to 'auto', will weight each class in proportion of the inverse of its frequency.

By weighting the class that is less present with a higher amount, you can end up with 'better' results.

Classifier like like SVM or logistic regression also offer this class_weight parameter.

This Stack Overflow answer gives some other ideas on how to handle class imbalance, like under sampling and oversampling.

164

answered Sep 20 '22 14:09

DJanssens

I found this article on class imbalance problem.

http://www.chioka.in/class-imbalance-problem/

It has basically discussed the following possible solutions to summarize:

Cost function based approaches
Sampling based approaches
SMOTE (Synthetic Minority Over-Sampling Technique)
recent approaches : RUSBoost, SMOTEBagging and Underbagging

Hope It may help.

answered Sep 20 '22 14:09

Pappu Jha

Related questions
                            
                                Faster alternative to Python's zipfile module?
                            
                                Permission denied doing os.mkdir(d) after running shutil.rmtree(d) in Python
                            
                                WTForms RadioField default values
                            
                                Does something like CanCan (authorization library) exist for flask and python
                            
                                tab complete dictionary keys in ipython
                            
                                Why is copying a list using a slice[:] faster than using the obvious way?
                            
                                Django: ValueError: Lookup failed for model referenced by field account.UserProfile.user: auth.User
                            
                                gdb pretty printing with python a recursive structure
                            
                                How to prevent Exception ignored in: <module 'threading' from ... > while setting signal handler?
                            
                                How to detect if python script is being run as a background process
                            
                                A python function that accepts as an argument either a scalar or a numpy array
                            
                                python lockf and flock behaviour
                            
                                Python3.x how to share a database connection between processes?
                            
                                Interpolating data from a look up table
                            
                                Change title of Tkinter application in OS X Menu Bar
                            
                                TemplateDoesNotExist at / base.html
                            
                                matplotlib on pycharm with remote ssh intepreter
                            
                                Memory consumption of NumPy function for standard deviation
                            
                                python mock and libraries that are not installed
                            
                                Does multiprocessing.pool.imap has a variant (like starmap) that allows for multiple arguments?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can you fix the false negative rate in a classifier in scikit learn

Tags:

python

scikit-learn

graffe

People also ask

2 Answers

DJanssens

Pappu Jha

Recent Activity

Donate For Us