reading excel to a python data frame starting from row 5 and including headers

Tags:

how do I import excel data into a dataframe in python.

Basically the current excel workbook runs some vba on opening which refreshes a pivot table and does some other stuff.

Then I wish to import the results of the pivot table refresh into a dataframe in python for further analysis.

import xlrd  wb = xlrd.open_workbook('C:\Users\cb\Machine_Learning\cMap_Joins.xlsm')  #sheetnames print wb.sheet_names()  #number of sheets print wb.nsheets

The refreshing and opening of the file works fine. But how do i select the data from the first sheet from say row 5 including header down to last record n.

735

asked Jul 09 '13 12:07

IcemanBerlin

1 Answers

You can use pandas' ExcelFile parse method to read Excel sheets, see io docs:

xls = pd.ExcelFile('C:\Users\cb\Machine_Learning\cMap_Joins.xlsm')  df = xls.parse('Sheet1', skiprows=4, index_col=None, na_values=['NA'])

skiprows will ignore the first 4 rows (i.e. start at row index 4), and several other options.

143

answered Sep 21 '22 17:09

Andy Hayden

Related questions
                            
                                How to pip install a local python package?
                            
                                How to use SequenceMatcher to find similarity between two strings?
                            
                                What does this mean exit (main())
                            
                                Python Itertools.Permutations()
                            
                                How to merge two json string in Python?
                            
                                Find how many lines in string
                            
                                Gmail API Error from Code Sample - a bytes-like object is required, not 'str'
                            
                                What's the difference between casting and coercion in Python?
                            
                                Compare string with all values in list
                            
                                ubuntu ImportError: cannot import name MAXREPEAT
                            
                                Why does checking a variable against multiple values with `OR` only check the first value? [duplicate]
                            
                                TypeError with ufunc bitwise_xor
                            
                                make pandas DataFrame to a dict and dropna
                            
                                Return a variable in a Python list with double quotes instead of single
                            
                                Adjust cell width in Excel
                            
                                replace column values in one dataframe by values of another dataframe
                            
                                Heroku not recognized as an internal or external command (Windows)
                            
                                Test Django views that require login using RequestFactory
                            
                                How to Mock an HTTP request in a unit testing scenario in Python
                            
                                How to find a particular JSON value by key?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

reading excel to a python data frame starting from row 5 and including headers

Tags:

python

import

pandas

excel

IcemanBerlin

People also ask

1 Answers

Andy Hayden

Recent Activity

Donate For Us