Could pandas use column as index?

People also ask

How do you make a column an index in pandas?

To create an index, from a column, in Pandas dataframe you use the set_index() method. For example, if you want the column “Year” to be index you type <code>df. set_index(“Year”)</code>. Now, the set_index() method will return the modified dataframe as a result.

Is the index a column pandas?

Rows and columns both have indexes, rows indices are called as index and for columns its general column names. Pandas have three data structures dataframe, series & panel. We mostly use dataframe and series and they both use indexes, which make them very convenient to analyse.

How do you set an index for a column?

One simple way to set an index to a column is by assigning an index as a new column to DataFrame. DataFrame. index property returns a Series object of an index, assign this to DataFrame as shown below which creates a new column with index value on DataFrame.

How do I reference a column by index in pandas?

Use DataFrame. loc[] and DataFrame. iloc[] to select a single column or multiple columns from pandas DataFrame by column names/label or index position respectively. where loc[] is used with column labels/names and iloc[] is used with column index/position.

Yes, with set_index you can make Locality your row index.

data.set_index('Locality', inplace=True)

If inplace=True is not provided, set_index returns the modified dataframe as a result.

Example:

> import pandas as pd
> df = pd.DataFrame([['ABBOTSFORD', 427000, 448000],
                     ['ABERFELDIE', 534000, 600000]],
                    columns=['Locality', 2005, 2006])

> df
     Locality    2005    2006
0  ABBOTSFORD  427000  448000
1  ABERFELDIE  534000  600000

> df.set_index('Locality', inplace=True)
> df
              2005    2006
Locality                  
ABBOTSFORD  427000  448000
ABERFELDIE  534000  600000

> df.loc['ABBOTSFORD']
2005    427000
2006    448000
Name: ABBOTSFORD, dtype: int64

> df.loc['ABBOTSFORD'][2005]
427000

> df.loc['ABBOTSFORD'].values
array([427000, 448000])

> df.loc['ABBOTSFORD'].tolist()
[427000, 448000]

You can change the index as explained already using set_index. You don't need to manually swap rows with columns, there is a transpose (data.T) method in pandas that does it for you:

> df = pd.DataFrame([['ABBOTSFORD', 427000, 448000],
                    ['ABERFELDIE', 534000, 600000]],
                    columns=['Locality', 2005, 2006])

> newdf = df.set_index('Locality').T
> newdf

Locality    ABBOTSFORD  ABERFELDIE
2005        427000      534000
2006        448000      600000

then you can fetch the dataframe column values and transform them to a list:

> newdf['ABBOTSFORD'].values.tolist()

[427000, 448000]

You can set the column index using index_col parameter available while reading from spreadsheet in Pandas.

Here is my solution:

Firstly, import pandas as pd: import pandas as pd
Read in filename using pd.read_excel() (if you have your data in a spreadsheet) and set the index to 'Locality' by specifying the index_col parameter.

df = pd.read_excel('testexcel.xlsx', index_col=0)

At this stage if you get a 'no module named xlrd' error, install it using pip install xlrd.
For visual inspection, read the dataframe using df.head() which will print the following output
Now you can fetch the values of the desired columns of the dataframe and print it

Another simple approach is to assign the column to the data frame index

data = {
  'Locality': ['ABBOTSFORD', 'ABERFELDIE', 'AIREYS INLET'],
  '2005': [427000, 534000, 459000 ],
  '2006': [448000, 448000, 448000],
  '2007': [602500, 602500, 602500],
  '2008': [600000, 710000, 517500],
  '2009': [638500, 775000, 512500]
}

df = pd.DataFrame(data)

# set the locality column as the index
df.index = df['Locality']

And if you no longer want the Locality column as a column, you can just drop it

df.drop('Locality', axis=1)

You'll end up with


              | 2005     | 2006   | 2007   | 2008   | 2009
Locality      |-------------------------------------------              
ABBOTSFORD    | 427000   | 448000 | 602500 | 600000 | 638500
ABERFELDIE    | 534000   | 448000 | 602500 | 710000 | 775000
AIREYS INLET  | 459000   | 448000 | 602500 | 517500 | 512500

Related questions
                            
                                Django-DB-Migrations: cannot ALTER TABLE because it has pending trigger events
                            
                                How to display the value of the bar on each bar with pyplot.barh()
                            
                                how to return index of a sorted list? [duplicate]
                            
                                How do I combine two dataframes?
                            
                                How to duplicate sys.stdout to a log file?
                            
                                Why does @foo.setter in Python not work for me?
                            
                                Python + Django page redirect
                            
                                Installing SciPy and NumPy using pip
                            
                                Evenly distributing n points on a sphere
                            
                                Drop columns whose name contains a specific string from pandas DataFrame
                            
                                Google Colab: how to read data from my google drive?
                            
                                Python, Unicode, and the Windows console
                            
                                What is python's site-packages directory?
                            
                                RuntimeWarning: numpy.dtype size changed, may indicate binary incompatibility
                            
                                Why do two identical lists have a different memory footprint?
                            
                                How to find a Python package's dependencies
                            
                                Send file using POST from a Python script
                            
                                How to get the PATH environment-variable separator in Python?
                            
                                Importing class from another file [duplicate]
                            
                                Python Pandas merge only certain columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Could pandas use column as index?

Tags:

python

pandas

excel

People also ask

Recent Activity

Donate For Us