Pandas DataFrame: complete spec for getitem()? [closed]

Short version

For pandas Dataframe.__getitem__(), what are the allowed inputs (input types really), and what results does the function produce as a result?

Details

Description of problem

I would like to write code that makes full use of DataFrame[], essentially Dataframe.__getitem__(). To that end, I would like information on inputs/return results, at the level of detail found on the API page, though not available there for this method.

What has been done so far to solve it

I looked for a complete spec for that function at the Pandas API page. Though many other methods are documented, Dataframe.__getitem__() is not.

I also looked at the tutorial, but I don't believe that's attempting to be exhaustive.

I did look at the source code for Dataframe.__getitem__() (first pass at this described in my own answer below). It's evident that a variety of quite different types can be accepted as input, but reverse engineering the code to determine what happens when each of those types is passed seems like it can't be the intended way to master this method.

Additional background

Pandas is one of the most important libraries in Python's role in science and statistics, DataFrame is arguably the most central object in Pandas, and the [] operator is arguably the most central method on DataFrame. Hence, actually answering the question I have posted here has a very high pedagogical value, not just some utility for me.

928

asked Jan 18 '15 12:01

gwideman

1 Answers

I'm suspecting part of the lack of doc for this function is due to lack of doc comments in the source, now that I look at it. In case nobody comes up with anything more user-friendly, here's the actual DataFrame.__getitem__() method:

def __getitem__(self, key):

    # shortcut if we are an actual column
    is_mi_columns = isinstance(self.columns, MultiIndex)
    try:
        if key in self.columns and not is_mi_columns:
            return self._getitem_column(key)
    except:
        pass

    # see if we can slice the rows
    indexer = _convert_to_index_sliceable(self, key)
    if indexer is not None:
        return self._getitem_slice(indexer)

    if isinstance(key, (Series, np.ndarray, list)):
        # either boolean or fancy integer index
        return self._getitem_array(key)
    elif isinstance(key, DataFrame):
        return self._getitem_frame(key)
    elif is_mi_columns:
        return self._getitem_multilevel(key)
    else:
        return self._getitem_column(key)

... which at least gives a top-level breakdown of the kinds of key (index) that DataFrame[] accepts.

answered Sep 19 '22 08:09

gwideman

Related questions
                            
                                Capturing Mac OS X System Audio output with Python
                            
                                append versus resize for numpy array
                            
                                Click on a javascript link within python?
                            
                                Gracefully Terminating Python Threads
                            
                                Python equivalent of find2perl
                            
                                How to portably parse the (Unicode) degree symbol with regular expressions?
                            
                                How to determine CPU and memory cost of a function?
                            
                                Deciding and implementing a trending algorithm in Django
                            
                                Dynamically create plots in Chaco
                            
                                PyCharm, Django: zero code coverage
                            
                                finding the area of a closed 2d uniform cubic B-spline
                            
                                SQLAlchemy and explicit locking
                            
                                Suppressing printout of "Exception ... ignored" message in Python 3
                            
                                Python requests "certificate verify failed"
                            
                                Why is Python's list comprehension loop order backwards? [duplicate]
                            
                                Testing authentication in Django Rest Framework Views -- Cannot authenticate when testing
                            
                                Is Twisted's Deferred the same as a Promise in JavaScript?
                            
                                django - inline - Search for existing record instead of adding a new one
                            
                                Models inside tests - Django 1.7 issue
                            
                                How to return str from MySQL using mysql.connector?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas DataFrame: complete spec for getitem()? [closed]

Tags:

python

indexing

pandas

dataframe

Short version

Details

Description of problem

What has been done so far to solve it

Additional background

gwideman

People also ask

1 Answers

gwideman

Recent Activity

Donate For Us

Pandas DataFrame: complete spec for __getitem__()? [closed]

Tags:

python

indexing

pandas

dataframe

Short version

Details

Description of problem

What has been done so far to solve it

Additional background

gwideman

People also ask

1 Answers

gwideman

Related questions

Recent Activity

Donate For Us

Pandas DataFrame: complete spec for getitem()? [closed]