Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Getting (index, column) pairs for True elements of a boolean DataFrame in Pandas

Tags:

python

pandas

Say I have a Pandas DataFrame and I want to obtain a list of tuples of the form [(index1, column1), (index2, column2) ...] describing the locations of all elements of the DataFrame where some condition is true. For example:

x = pd.DataFrame(np.random.normal(0, 1, (4,4)), index=['a', 'b', 'c', 'd'],
                 columns=['e', 'f', 'g', 'h'])
x


     e           f           g           h
a   -1.342571   -0.274879   -0.903354   -1.458702
b   -1.521502   -1.135800   -1.147913   1.829485
c   -1.199857   0.458135    -1.993701   -0.878301
d   0.485599    0.286608    -0.436289   -0.390755

y = x > 0

Is there any way to obtain:

x.loc[y]

To return:

[(b, h), (c,f), (d, e), (d,f)]

Or some equivalent? Evidently, I can do:

postup = []
for i in x.index:
    for j in x.columns:
        if x.loc[i, j] > 0:
            postup.append((i, j))

But I imagine something better may be possible / already implemented. In matlab the function find combined with sub2ind do the job.

like image 879
dylkot Avatar asked Nov 10 '14 22:11

dylkot


1 Answers

x[x > 0].stack().index.tolist()
like image 62
A. Coady Avatar answered Sep 22 '22 18:09

A. Coady