How to calculate class weights of a Pandas DataFrame for Keras?

Tags:

I'm trying

print(Y)
print(Y.shape)

class_weights = compute_class_weight('balanced',
                                     np.unique(Y),
                                     Y)
print(class_weights)

But this gives me an error:

ValueError: classes should include all valid labels that can be in y

My Y looks like:

       0  1  2  3  4
0      0  0  1  0  0
1      1  0  0  0  0
2      0  0  0  1  0
3      0  0  1  0  0
...
14992     0  0  1  0  0
14993      0  0  1  0  0

And my Y.shape looks like: (14993, 5)

In my keras model, I want to use the class_weights as it is an uneven distribution:

model.fit(X, Y, epochs=100, shuffle=True, batch_size=1500, class_weights=class_weights, validation_split=0.05, verbose=1, callbacks=[csvLogger])

687

asked Feb 23 '19 13:02

Shamoon

2 Answers

Just transform the one-hot encoding to categorical labels:

from sklearn.utils import class_weight

y = Y.idxmax(axis=1)

class_weights = class_weight.compute_class_weight('balanced',
                                                  np.unique(y),
                                                  y)

# Convert class_weights to a dictionary to pass it to class_weight in model.fit
class_weights = dict(enumerate(class_weights))

answered Sep 18 '22 10:09

Andreas K.

Create some sample data with at least one example per class

df = pd.DataFrame({
    '0': [0, 1, 0, 0, 0, 0],
    '1': [0, 0, 0, 0, 1, 0], 
    '2': [1, 0, 0, 1, 0, 0],
    '3': [0, 0, 1, 0, 0, 0],
    '4': [0, 0, 0, 0, 0, 1],
})

Stack the columns (convert from wide to long table)

df = df.stack().reset_index()
>>> df.head()

  level_0   level_1     0
0   0       0       0
1   0       1       0
2   0       2       1
3   0       3       0
4   0       4       0

Get the class for each data point

Y = df[df[0] == 1]['level_1']
>>> Y
2     2
5     0
13    3
17    2
21    1
29    4

Compute class weights

class_weights = compute_class_weight(
    'balanced', np.unique(Y), Y
)
>>> print(class_weights)
[1.2 1.2 0.6 1.2 1.2]

answered Sep 21 '22 10:09

ulmefors

Related questions
                            
                                Is the Python's grammar LL(1)?
                            
                                Nginx with gunicorn with double authorization
                            
                                Comparing list of Counters in Python
                            
                                Why does networkx redraw my graph different each run?
                            
                                Python Merge Two Numpy Arrays Based on Condition
                            
                                How can I train my Python based OCR with Tesseract to train with different National Identity Cards?
                            
                                Why can’t you use Hash Tables/Dictionaries in Counting Sort algorithm?
                            
                                pytest can't see logs from function being tested
                            
                                How can I get around Keras pad_sequences() rounding float values to zero?
                            
                                delete leap days in pandas
                            
                                Adding a new column with specific dtype in pandas
                            
                                Can't install numpy after a pip upgrade
                            
                                "Feather" library installation failing in PyCharm
                            
                                How to write a regular expression utilizing the Robot Framework to find/replace various date strings
                            
                                How can I make a psycopg2 connection using environment variables?
                            
                                Tensorflow error "has type list, but expected one of: int, long, float"
                            
                                How to run a Method on the exit of a kivy app
                            
                                Reversing string characters while keeping them in the same position
                            
                                create an image with border of certain width in python
                            
                                Unable to connect to flask while running on docker container [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate class weights of a Pandas DataFrame for Keras?

Tags:

python

pandas

keras

Shamoon

People also ask

2 Answers

Andreas K.

ulmefors

Recent Activity

Donate For Us