I am trying to model a classifier for a multi-class Classification problem (3 Classes) using LightGBM in Python. I used the following parameters. <pre class="prettyprint"><code>params = {'task': 'train', 'boosting_type': 'gbdt', 'objective': 'multiclass', 'num_class':3, 'metric': 'multi_logloss', 'learning_rate': 0.002296, 'max_depth': 7, 'num_leaves': 17, 'feature_fraction': 0.4, 'bagging_fraction': 0.6, 'bagging_freq': 17} </code></pre> All the categorical features of the dataset is label encoded with <code>LabelEncoder</code>. I trained the model after running <code>cv</code> with <code>eartly_stopping</code> as shown below. <pre class="prettyprint"><code>lgb_cv = lgbm.cv(params, d_train, num_boost_round=10000, nfold=3, shuffle=True, stratified=True, verbose_eval=20, early_stopping_rounds=100) nround = lgb_cv['multi_logloss-mean'].index(np.min(lgb_cv['multi_logloss-mean'])) print(nround) model = lgbm.train(params, d_train, num_boost_round=nround) </code></pre> After training, I made prediction with model like this, <pre class="prettyprint"><code>preds = model.predict(test) print(preds) </code></pre> I got a nested array as output like this. <pre class="prettyprint"><code>[[ 7.93856847e-06 9.99989550e-01 2.51164967e-06] [ 7.26332978e-01 1.65316511e-05 2.73650491e-01] [ 7.28564308e-01 8.36756769e-06 2.71427325e-01] ..., [ 7.26892634e-01 1.26915179e-05 2.73094674e-01] [ 5.93217601e-01 2.07172044e-04 4.06575227e-01] [ 5.91722491e-05 9.99883828e-01 5.69994435e-05]] </code></pre> As each list in the <code>preds</code> represent the class probabilites I used <code>np.argmax()</code> to find the classes like this.. <pre class="prettyprint"><code>predictions = [] for x in preds: predictions.append(np.argmax(x)) </code></pre> While analyzing the prediction I found that my predictions contain only 2 classes - 0 and 1. Class 2 was the 2nd largest class in the training set, but it was nowhere to be found in the predictions.. On evaluating the result it gave about <code>78%</code> accuracy. So, why didn't my model predict class 2 for any of the cases.? Is there anything wrong in the parameters I used.? Isn't this the proper way to make interpret prediction made by the model.? Should I make any changes for the parameters.??

Try troubleshooting by swapping classes 0 and 2, and re-running the trainining and prediction process. If the new predictions only contain classes 1 and 2 (most likely given your provided data): <ul> <li>Classifier may not have learnt the third class; perhaps its features overlap with those of a larger class, and the classifier defaults to the larger class in order to minimise the objective function. Try providing a balanced training set (same number of samples per class) and retry.</li> </ul> If the new predictions do contain all 3 classes: <ul> <li>Something went wrong in your code somewhere. More information is needed to determine what exactly went wrong.</li> </ul> Hope this helps.

Multiclass Classification with LightGBM

Tags:

python

machine-learning

predict

multiclass-classification

lightgbm

I am trying to model a classifier for a multi-class Classification problem (3 Classes) using LightGBM in Python. I used the following parameters.

params = {'task': 'train',
    'boosting_type': 'gbdt',
    'objective': 'multiclass',
    'num_class':3,
    'metric': 'multi_logloss',
    'learning_rate': 0.002296,
    'max_depth': 7,
    'num_leaves': 17,
    'feature_fraction': 0.4,
    'bagging_fraction': 0.6,
    'bagging_freq': 17}

All the categorical features of the dataset is label encoded with LabelEncoder. I trained the model after running cv with eartly_stopping as shown below.

lgb_cv = lgbm.cv(params, d_train, num_boost_round=10000, nfold=3, shuffle=True, stratified=True, verbose_eval=20, early_stopping_rounds=100)

nround = lgb_cv['multi_logloss-mean'].index(np.min(lgb_cv['multi_logloss-mean']))
print(nround)

model = lgbm.train(params, d_train, num_boost_round=nround)

After training, I made prediction with model like this,

preds = model.predict(test)
print(preds)

I got a nested array as output like this.

[[  7.93856847e-06   9.99989550e-01   2.51164967e-06]
 [  7.26332978e-01   1.65316511e-05   2.73650491e-01]
 [  7.28564308e-01   8.36756769e-06   2.71427325e-01]
 ..., 
 [  7.26892634e-01   1.26915179e-05   2.73094674e-01]
 [  5.93217601e-01   2.07172044e-04   4.06575227e-01]
 [  5.91722491e-05   9.99883828e-01   5.69994435e-05]]

As each list in the preds represent the class probabilites I used np.argmax() to find the classes like this..

predictions = []

for x in preds:
    predictions.append(np.argmax(x))

While analyzing the prediction I found that my predictions contain only 2 classes - 0 and 1. Class 2 was the 2nd largest class in the training set, but it was nowhere to be found in the predictions.. On evaluating the result it gave about 78% accuracy.

So, why didn't my model predict class 2 for any of the cases.? Is there anything wrong in the parameters I used.?

Isn't this the proper way to make interpret prediction made by the model.? Should I make any changes for the parameters.??

588

asked Nov 18 '17 19:11

Sreeram TP

1 Answers

Try troubleshooting by swapping classes 0 and 2, and re-running the trainining and prediction process.

If the new predictions only contain classes 1 and 2 (most likely given your provided data):

Classifier may not have learnt the third class; perhaps its features overlap with those of a larger class, and the classifier defaults to the larger class in order to minimise the objective function. Try providing a balanced training set (same number of samples per class) and retry.

If the new predictions do contain all 3 classes:

Something went wrong in your code somewhere. More information is needed to determine what exactly went wrong.

Hope this helps.

answered Oct 15 '22 16:10

kwypston

Related questions
                            
                                Jupyter notebook stuck in pdb mode
                            
                                Understanding Memory Usage by PyTorch DataLoader Workers
                            
                                No readline support while using pdb.set_trace()
                            
                                Communicate between iOS app and Raspberry Pi via Bluetooth
                            
                                can't install matplotlib using pip [duplicate]
                            
                                Exclude soft deleted items in self referential relationship SQLAlchemy
                            
                                ValueError: You must specify a freq or x must be a pandas object with a timeseries index [duplicate]
                            
                                What to do when Jupyter suddenly hangs?
                            
                                How to get code completion for Tensorflow in PyCharm?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With