I have the following Pandas Dataframe with a MultiIndex(Z,A): <pre class="prettyprint"><code> H1 H2 Z A 0 100 200 0.3112 -0.4197 1 100 201 0.2967 0.4893 2 100 202 0.3084 -0.4873 3 100 203 0.3069 NaN 4 101 203 -0.4956 NaN </code></pre> Question: How can I select all items with A=203? I tried <code>df[:,'A']</code> but it doesn't work. Then I found this in the online documentation so I tried: <code>df.xs(203,level='A')</code> but I get: "<code>TypeError: xs() got an unexpected keyword argument 'level'</code>" Also I dont see this parameter in the installed doc(<code>df.xs?</code>): "Parameters ---------- key : object Some label contained in the index, or partially in a MultiIndex axis : int, default 0 Axis to retrieve cross-section on copy : boolean, default True Whether to make a copy of the data" Note:I have the development version. Edit: I found this thread. They recommend something like: <pre class="prettyprint"><code>df.select(lambda x: x[1]==200, axis=0) </code></pre> I still would like to know what happened with df.xs with the level parameter or what is the recommended way in the current version.

The problem lies in my assumption(incorrect) that I was in the dev version while in reality I had 1.6.1, one can check the current installed version with: <pre class="prettyprint"><code>import pandas print pandas.__version__ </code></pre> in the current version <code>df.xs()</code> with the level parameter works ok.

Select data at a particular level from a MultiIndex

Tags:

python

pandas

I have the following Pandas Dataframe with a MultiIndex(Z,A):

             H1       H2  
   Z    A 
0  100  200  0.3112   -0.4197   
1  100  201  0.2967   0.4893    
2  100  202  0.3084   -0.4873   
3  100  203  0.3069   NaN        
4  101  203  -0.4956  NaN

Question: How can I select all items with A=203? I tried df[:,'A'] but it doesn't work. Then I found this in the online documentation so I tried:
df.xs(203,level='A')
but I get:
"TypeError: xs() got an unexpected keyword argument 'level'"
Also I dont see this parameter in the installed doc(df.xs?):
"Parameters ---------- key : object Some label contained in the index, or partially in a MultiIndex axis : int, default 0 Axis to retrieve cross-section on copy : boolean, default True Whether to make a copy of the data"
Note:I have the development version.

Edit: I found this thread. They recommend something like:

df.select(lambda x: x[1]==200, axis=0)

I still would like to know what happened with df.xs with the level parameter or what is the recommended way in the current version.

703

asked Apr 16 '12 13:04

elyase

2 Answers

The problem lies in my assumption(incorrect) that I was in the dev version while in reality I had 1.6.1, one can check the current installed version with:

import pandas
print pandas.__version__

in the current version df.xs() with the level parameter works ok.

158

answered Sep 18 '22 09:09

elyase

Not a direct answer to the question, but if you want to select more than one value you can use the "slice()" notation:

import numpy
from pandas import  MultiIndex, Series

arrays = [['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux'],
              ['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two']]
tuples = list(zip(*arrays))
index = MultiIndex.from_tuples(tuples, names=['first', 'second'])
s = Series(numpy.random.randn(8), index=index)

In [10]: s
Out[10]:
first  second
bar    one       0.181621
       two       1.016225
baz    one       0.716589
       two      -0.353731
foo    one      -0.326301
       two       1.009143
qux    one       0.098225
       two      -1.087523
dtype: float64

In [11]: s.loc[slice(None)]
Out[11]:
first  second
bar    one       0.181621
       two       1.016225
baz    one       0.716589
       two      -0.353731
foo    one      -0.326301
       two       1.009143
qux    one       0.098225
       two      -1.087523
dtype: float64

In [12]: s.loc[slice(None), "one"]
Out[12]:
first
bar      0.181621
baz      0.716589
foo     -0.326301
qux      0.098225
dtype: float64

In [13]: s.loc["bar", slice(None)]
Out[13]:
first  second
bar    one       0.181621
       two       1.016225
dtype: float64

answered Sep 18 '22 09:09

rogueleaderr

Related questions
                            
                                What Does the python -v Command Do
                            
                                Unit tests fail after a Django upgrade
                            
                                When to use multiple event loops?
                            
                                How to get interactive plot of pyplot when using pycharm
                            
                                cProfile adds significant overhead when calling numba jit functions
                            
                                What is the Big O Complexity of Reversing the Order of Columns in Pandas DataFrame?
                            
                                Pandas DataFrame to multidimensional NumPy Array
                            
                                How annotate a function that takes another function as parameter?
                            
                                Dynamic communication between main and subprocess in Python
                            
                                Can't fetch the profile name using Selenium after logging in using requests
                            
                                "Standardized" docstring/self-documentation of bash scripts
                            
                                using a `tf.Tensor` as a Python `bool` is not allowed in Graph execution. Use Eager execution or decorate this function with @tf.function
                            
                                Python equivalent to Java's JNLP Web Start?
                            
                                Detect English verb tenses using NLTK
                            
                                In setup.py or pip requirements file, how to control order of installing package dependencies?
                            
                                How to adapt the Singleton pattern? (Deprecation warning)
                            
                                Python: inconsistence in the way you define the function __setattr__?
                            
                                Get display count and resolution for each display in Python without xrandr
                            
                                Python subprocess call returns "command not found", Terminal executes correctly
                            
                                How to set NetworkX edge labels offset? (to avoid label overlap)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Select data at a particular level from a MultiIndex

Tags:

python

pandas

elyase

People also ask

2 Answers

elyase

rogueleaderr

Recent Activity

Donate For Us