How can I create a DataFrame slice object piece by piece?

Tags:

I want to be able to achieve the same result with something like the following bit of code, where I specify each criteria one by one. It's also important that I'm able to use a slice_list to allow dynamic behaviour [i.e. the syntax should work whether there are two, three or ten different criteria in the slice_list].

slice_1 = 'foo'
slice_2 = ':'
slice_list = [slice_1, slice_2]

column_slice = "'A':'B'"
print df.loc[idx[slice_list], idx[column_slice]]

870

asked Mar 23 '17 16:03

bluprince13

2 Answers

You can achieve this using the slice built-in function. You can't build slices with strings as ':' is a literal character and not a syntatical one.

slice_1 = 'foo'
slice_2 = slice(None)
column_slice = slice('A', 'B')
df.loc[idx[slice_1, slice_2], idx[column_slice]]

162

answered Oct 02 '22 10:10

Ted Petrou

You might have to build your "slice lists" a little differently than you intended, but here's a relatively compact method using df.merge() and df.ix[]:

# Build a "query" dataframe
slice_df = pd.DataFrame(index=[['foo','qux','qux'],['a','a','b']])
# Explicitly name columns
column_slice = ['A','B']

slice_df.merge(df, left_index=True, right_index=True, how='inner').ix[:,column_slice]

Out[]: 
              A         B
foo a  0.442302 -0.949298
qux a  0.425645 -0.233174
    b -0.041416  0.229281

This method also requires you to be explicit about your second index and columns, unfortunately. But computers are great at making long tedious lists for you if you ask nicely.

EDIT - Example of method to dynamically built a slice list that could be used like above.

Here's a function that takes a dataframe and spits out a list that could then be used to create a "query" dataframe to slice the original by. It only works with dataframes with 1 or 2 indices. Let me know if that's an issue.

def make_df_slice_list(df):
    if df.index.nlevels == 1:
        slice_list = []
        # Only one level of index
        for dex in df.index.unique():
            if input("DF index: " + dex + " - Include? Y/N: ") == "Y":
                # Add to slice list
                slice_list.append(dex)
    if df.index.nlevels > 1:
        slice_list = [[] for _ in xrange(df.index.nlevels)]
        # Multi level
        for i in df.index.levels[0]:
            print "DF index:", i, "has subindexes:", [dex for dex in df.ix[i].index]
            sublist = input("Enter a the indexes you'd like as a list: ")
            # if no response, the first entry
            if len(sublist)==0:
                sublist = [df.ix[i].index[0]]
            # Add an entry to the first index list for each sub item passed
            [slice_list[0].append(i) for item in sublist]
            # Add each of the second index list items
            [slice_list[1].append(item) for item in sublist]
    return slice_list

I'm not advising this as a way to communicate with your user, just an example. When you use it you have to pass strings (e.g. "Y" and "N") and lists of string (["a","b"]) and empty lists [] at prompts. Example:

In [115]: slice_list = make_df_slice_list(df)

DF index: foo has subindexes: ['a', 'b']
Enter a the indexes you'd like as a list: []
DF index: qux has subindexes: ['a', 'b']
Enter a the indexes you'd like as a list: ['a','b']

In [116]:slice_list
Out[116]: [['foo', 'qux', 'qux'], ['a', 'a', 'b']]

# Back to my original solution, but now passing the list:
slice_df = pd.DataFrame(index=slice_list)
column_slice = ['A','B']

slice_df.merge(df, left_index=True, right_index=True, how='inner').ix[:,column_slice]
Out[117]: 
              A         B
foo a -0.249547  0.056414
qux a  0.938710 -0.202213
    b  0.329136 -0.465999

answered Oct 02 '22 10:10

Jammeth_Q

Related questions
                            
                                Subtract a month from a date in Python? [duplicate]
                            
                                Serving interactive bokeh figure on heroku
                            
                                Data Conversion Error while applying a function to each row in pandas Python
                            
                                Pandas: Is there a way to use something like 'droplevel' and in process, rename the other level using the dropped level labels as prefix/suffix?
                            
                                How to debug external .py functions run from Jupyter/IPython notebook
                            
                                How to use a complex type from a WSDL with zeep in Python
                            
                                Replace duplicate values across columns in Pandas
                            
                                airflow startup failed due to gunicorn
                            
                                How to check if a CSV has a header using Python?
                            
                                Convert a numpy array of lists to a numpy array
                            
                                Select data when specific columns have null value in pandas
                            
                                How does one enter a Python virtualenv when executing a bashscript?
                            
                                How to drop the index column while writing the DataFrame in a .csv file in Pandas? [duplicate]
                            
                                Using url_for in tests
                            
                                Find string within JSON with Python
                            
                                Pandas use and operator in LOC function
                            
                                How should we pad text sequence in keras using pad_sequences?
                            
                                How to detect current keyboard language in python
                            
                                How can I see the formulas of an excel spreadsheet in pandas / python?
                            
                                Why we need python packaging (e.g. egg)? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I create a DataFrame slice object piece by piece?

Tags:

python

pandas

bluprince13

People also ask

2 Answers

Ted Petrou

Jammeth_Q

Recent Activity

Donate For Us