Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

np.isreal behavior different in pandas.DataFrame and numpy.array

I have a array like below

np.array(["hello","world",{"a":5,"b":6,"c":8},"usa","india",{"d":9,"e":10,"f":11}])

and a pandas DataFrame like below

df = pd.DataFrame({'A': ["hello","world",{"a":5,"b":6,"c":8},"usa","india",{"d":9,"e":10,"f":11}]})

When I apply np.isreal to DataFrame

df.applymap(np.isreal)
Out[811]: 
       A
0  False
1  False
2   True
3  False
4  False
5   True

When I do np.isreal for the numpy array.

np.isreal( np.array(["hello","world",{"a":5,"b":6,"c":8},"usa","india",{"d":9,"e":10,"f":11}]))
Out[813]: array([ True,  True,  True,  True,  True,  True], dtype=bool)

I must using the np.isreal in the wrong use case, But can you help me about why the result is different ?

like image 231
BENY Avatar asked Dec 14 '22 20:12

BENY


1 Answers

A partial answer is that isreal is only intended to be used on array-like as the first argument.

You want to use isrealobj on each element to get the bahavior you see here:

In [11]: a = np.array(["hello","world",{"a":5,"b":6,"c":8},"usa","india",{"d":9,"e":10,"f":11}])

In [12]: a
Out[12]:
array(['hello', 'world', {'a': 5, 'b': 6, 'c': 8}, 'usa', 'india',
       {'d': 9, 'e': 10, 'f': 11}], dtype=object)

In [13]: [np.isrealobj(aa) for aa in a]
Out[13]: [True, True, True, True, True, True]

In [14]: np.isreal(a)
Out[14]: array([ True,  True,  True,  True,  True,  True], dtype=bool)

That does leave the question, what does np.isreal do on something that isn't array-like e.g.

In [21]: np.isrealobj("")
Out[21]: True

In [22]: np.isreal("")
Out[22]: False

In [23]: np.isrealobj({})
Out[23]: True

In [24]: np.isreal({})
Out[24]: True

It turns out this stems from .imag since the test that isreal does is:

return imag(x) == 0   # note imag == np.imag

and that's it.

In [31]: np.imag(a)
Out[31]: array([0, 0, 0, 0, 0, 0], dtype=object)

In [32]: np.imag("")
Out[32]:
array('',
      dtype='<U1')

In [33]: np.imag({})
Out[33]: array(0, dtype=object)

This looks up the .imag attribute on the array.

In [34]: np.asanyarray("").imag
Out[34]:
array('',
      dtype='<U1')

In [35]: np.asanyarray({}).imag
Out[35]: array(0, dtype=object)

I'm not sure why this isn't set in the string case yet...

like image 141
Andy Hayden Avatar answered Mar 18 '23 08:03

Andy Hayden