Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Selecting rows of pandas DataFrame where column is not empty list

I have a pandas DataFrame where one column contains lists and would like to select the rows where the lists are not empty.

Example data:

df = pd.DataFrame({'letter': ["a", "b", "c", "d", "e"], 
               'my_list':[[0,1,2],[1,2],[],[],[0,1]]})

df
    letter   my_list
0   a   [0, 1, 2]
1   b   [1, 2]
2   c   []
3   d   []
4   e   [0, 1]

What I'd like:

df
    letter  my_list
0   a   [0, 1, 2]
1   b   [1, 2]
4   e   [0, 1]

What I'm trying:

df[df.my_list.map(lambda x: if len(x) !=0)]

... which returns an invalid syntax error. Any suggestions?

like image 707
Chinntimes Avatar asked Apr 06 '18 20:04

Chinntimes


2 Answers

Or you can follow your own logic using length of the list to determine keep or not .

df[df.my_list.str.len().gt(0)]
Out[1707]: 
  letter    my_list
0      a  [0, 1, 2]
1      b     [1, 2]
4      e     [0, 1]
like image 198
BENY Avatar answered Sep 21 '22 16:09

BENY


Empty lists evaluate to False in a boolean context

df[df.my_list.astype(bool)]

  letter    my_list
0      a  [0, 1, 2]
1      b     [1, 2]
4      e     [0, 1]
like image 39
piRSquared Avatar answered Sep 21 '22 16:09

piRSquared