python 3 get the column name depending of a condition [duplicate]

Question

So i have a pandas df (python 3.6) like this

index   A   B   C  ... 
  A     1   5   0
  B     0   0   1 
  C     1   2   4
 ...

As you can see, the index values are the same as the columns names.

What i'm trying to do is to get a new column in the dataframe that has the name of the columns where the value is > than 0

index   A   B   C  ... NewColumn
  A     1   5   0       [A,B]
  B     0   0   1       [C]
  C     1   2   4       [A,B,C]
 ...

i've been trying with iterrows with no success

also i know i can melt and pivot but i think there should be a way with apply lamnda maybe?

Thanks in advance

jezrael · Accepted Answer

If new column should be string compare by DataFrame.gt with dot product with columns, last remove trailing separator:

df['NewColumn'] = df.gt(0).dot(df.columns + ', ').str.rstrip(', ')
print (df)
   A  B  C NewColumn
A  1  5  0      A, B
B  0  0  1         C
C  1  2  4   A, B, C

And for lists use apply with lambda function:

df['NewColumn'] = df.gt(0).apply(lambda x: x.index[x].tolist(), axis=1)
print (df)
   A  B  C  NewColumn
A  1  5  0     [A, B]
B  0  0  1        [C]
C  1  2  4  [A, B, C]

python 3 get the column name depending of a condition [duplicate]

Tags:

python

python-3.x

pandas

Gera Sanz

1 Answers

jezrael

Recent Activity

Donate For Us

python 3 get the column name depending of a condition [duplicate]

Tags:

python

python-3.x

pandas

Gera Sanz

1 Answers

jezrael

Related questions

Recent Activity

Donate For Us