Consider this dataFrame: <pre class="prettyprint"><code>df = pd.DataFrame({u'A': {2.0: 2.2, 7.0: 1.4, 8.0: 1.4, 9.0: 2.2}, u'B': {2.0: 7.2, 7.0: 6.3, 8.0: 4.4, 9.0: 5.0}}) </code></pre> Which looks like this: <pre class="prettyprint"><code> A B 2 2.2 7.2 7 1.4 6.3 8 1.4 4.4 9 2.2 5.0 </code></pre> I'd like to get indices with label <code>2</code>and <code>7</code> (numbers, not strings) <pre class="prettyprint"><code>df.loc[[2, 7]] </code></pre> gives an error! <pre class="prettyprint"><code>IndexError: indices are out-of-bounds </code></pre> However, <code>df.loc[7]</code> and <code>df.loc[2]</code> work fine and as expected. Also, if I define the dataframe index with strings instead of numbers: <pre class="prettyprint"><code>df2 = pd.DataFrame({u'A': {'2': 2.2, '7': 1.4, '8': 1.4, '9': 2.2}, u'B': {'2': 7.2, '7': 6.3, '8': 4.4, '9': 5.0}}) df2.loc[['2', '8']] </code></pre> it works fine. This is not the behavior I expected from <code>df.loc</code> (is it a bug or just a gotcha?) Can I pass an array of numbers as label indices and not just positions? I can convert all indices to strings and then operate with <code>.loc</code> but it would be very inconvenient for the rest of my code. Thanks for your time!

This is a bug in 0.12. Version 0.13 fixes this (IOW, label selection, whether number or string should work when you pass a list). You could do this (uses an internal method though): <pre class="prettyprint"><code>In [10]: df.iloc[df.index.get_indexer([2,7])] Out[10]: A B 2 2.2 7.2 7 1.4 6.3 </code></pre>

pandas: selecting array of index labels with .loc

Tags:

python

pandas

Consider this dataFrame:

df = pd.DataFrame({u'A': {2.0: 2.2,
  7.0: 1.4,
  8.0: 1.4,
  9.0: 2.2},  u'B': {2.0: 7.2,
  7.0: 6.3,
  8.0: 4.4,
  9.0: 5.0}})

Which looks like this:

      A       B
2    2.2     7.2
7    1.4     6.3
8    1.4     4.4
9    2.2     5.0

I'd like to get indices with label 2and 7 (numbers, not strings)

df.loc[[2, 7]]

gives an error!

IndexError: indices are out-of-bounds

However, df.loc[7] and df.loc[2] work fine and as expected. Also, if I define the dataframe index with strings instead of numbers:

df2 = pd.DataFrame({u'A': {'2': 2.2,
  '7': 1.4,
  '8': 1.4,
  '9': 2.2},
 u'B': {'2': 7.2,
  '7': 6.3,
  '8': 4.4,
  '9': 5.0}})

df2.loc[['2', '8']]

it works fine.

This is not the behavior I expected from df.loc (is it a bug or just a gotcha?) Can I pass an array of numbers as label indices and not just positions?

I can convert all indices to strings and then operate with .loc but it would be very inconvenient for the rest of my code.

Thanks for your time!

351

asked Nov 07 '13 19:11

cd98

1 Answers

This is a bug in 0.12. Version 0.13 fixes this (IOW, label selection, whether number or string should work when you pass a list).

You could do this (uses an internal method though):

In [10]: df.iloc[df.index.get_indexer([2,7])]
Out[10]: 
     A    B
2  2.2  7.2
7  1.4  6.3

answered Nov 03 '22 09:11

Jeff

Related questions
                            
                                Initialize all the classes in a module into nameless objects in a list
                            
                                How to handle MIME type in tornado?
                            
                                Get min and max elements for 2 corresponding series in pandas
                            
                                Write and read Datetime to binary format in Python
                            
                                Runtime Error with copy.deepcopy in Python
                            
                                UnicodeDecodeError: 'ascii' codec can't decode byte 0xc5
                            
                                Is that possible to run a python built program on iOS as a static lib?
                            
                                ReactorNotRestartable when launching two equivalent unittest with twisted and trial
                            
                                Upload a file using boto
                            
                                What is the best way to run REST API versions with Python Flask [closed]
                            
                                Sending a POST with mechanize and requests.
                            
                                what does the double underscore __ mean in python? [duplicate]
                            
                                Flask-Admin upload and insert in database automatically
                            
                                How write csv file without new line character in last line?
                            
                                Django:No module named django.core.management
                            
                                Print progress of pool.map_async
                            
                                Drawing grid pattern in matplotlib
                            
                                Efficient way to round to arbitrary precision in Python [closed]
                            
                                Custom Scheduler to have sequential + semi-sequential scripts with timeouts/kill switches?
                            
                                Given a pickle dump in python how to I determine the used protocol?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With