I am using pandas to read an excel file. It doesn't have column name but it continues to read the first row as the column name. Following is the excel file that is being read. <pre class="prettyprint"><code>data1 0.994676 data2 0.994588 data3 0.99488 data4 0.994483 data5 0.994312 data6 0.993823 data7 0.993575 data8 0.994231 data9 0.993838 data10 0.994007 data11 0.994328 data12 0.993503 data13 0.99342 data14 0.992729 data15 0.993013 data16 0.993049 data17 0.993133 data18 0.99262 </code></pre> I'm reading the 2nd column using the following code. import pandas as pd <pre class="prettyprint"><code>df=pd.ExcelFile('C:/Users/JohnDoe/Desktop/080718_output.xlsx', header=None, index_col=False).parse('Data_sheet') y=df.iloc[0:17,1] </code></pre> The following is the y. <pre class="prettyprint"><code>In[38]:y Out[38]: 0 0.994588 1 0.994880 2 0.994483 3 0.994312 4 0.993823 5 0.993575 6 0.994231 7 0.993838 8 0.994007 9 0.994328 10 0.993503 11 0.993420 12 0.992729 13 0.993013 14 0.993049 15 0.993133 16 0.992620 Name: 0.994676, dtype: float64 </code></pre> It skips the first data because the first row is being used as a column name.. Any idea on how I can improve this? Edit: 'header=False' to 'header=None'. Both cases give the same outcome.

You can use <code>read_excel</code> with <code>header=None</code> for default columns with <code>rangeIndex</code>: <pre class="prettyprint"><code>df = pd.read_excel('file.xlsx', sheet_name ='Data_sheet', header=None, index_col=False) </code></pre>

Create a column header variable and call that in your excel read in statement as well as stating header=None <pre class="prettyprint"><code>names=['Column1','Column2'] df=pd.read_excel(r"/Users/JohnDoe/Desktop/080718_output.xlsx",header=None,names=names) </code></pre>

Pandas: Reading excel files when the first row is NOT the column name Excel Files

Tags:

python-3.x

pandas

I am using pandas to read an excel file. It doesn't have column name but it continues to read the first row as the column name.

Following is the excel file that is being read.

data1   0.994676
data2   0.994588
data3   0.99488
data4   0.994483
data5   0.994312
data6   0.993823
data7   0.993575
data8   0.994231
data9   0.993838
data10  0.994007
data11  0.994328
data12  0.993503
data13  0.99342
data14  0.992729
data15  0.993013
data16  0.993049
data17  0.993133
data18  0.99262

I'm reading the 2nd column using the following code. import pandas as pd

df=pd.ExcelFile('C:/Users/JohnDoe/Desktop/080718_output.xlsx', header=None, index_col=False).parse('Data_sheet')
y=df.iloc[0:17,1]

The following is the y.

In[38]:y
Out[38]: 
0     0.994588
1     0.994880
2     0.994483
3     0.994312
4     0.993823
5     0.993575
6     0.994231
7     0.993838
8     0.994007
9     0.994328
10    0.993503
11    0.993420
12    0.992729
13    0.993013
14    0.993049
15    0.993133
16    0.992620
Name: 0.994676, dtype: float64

It skips the first data because the first row is being used as a column name.. Any idea on how I can improve this?

Edit: 'header=False' to 'header=None'. Both cases give the same outcome.

599

asked Aug 07 '18 18:08

user7852656

2 Answers

You can use read_excel with header=None for default columns with rangeIndex:

df = pd.read_excel('file.xlsx', 
                   sheet_name ='Data_sheet', 
                   header=None, 
                   index_col=False)

140

answered Oct 12 '22 11:10

jezrael

Create a column header variable and call that in your excel read in statement as well as stating header=None

names=['Column1','Column2']
df=pd.read_excel(r"/Users/JohnDoe/Desktop/080718_output.xlsx",header=None,names=names)

answered Oct 12 '22 13:10

Bram van Hout

Related questions
                            
                                Python joining list elements in a tricky way
                            
                                How do I initialize a Counter from a list of key/initial counts pairs?
                            
                                Increasing the font size for just certain cells/one notebook in Jupyter notebook
                            
                                Creating a custom widget in PyQT5
                            
                                error in loading pickle
                            
                                Plot different columns of different DataFrame in the same plot with Pandas
                            
                                How to read bytes object from csv?
                            
                                Make tkinter buttons the same size
                            
                                Spyder3 Python IDE does not start: "This Windows version does not support the required Bluetooth API"
                            
                                Binance API call with SHA56 and Python requests
                            
                                Pandas split dataframe into multiple when condition is true
                            
                                UserWarning: Implicit dimension choice for log_softmax has been deprecated
                            
                                OpenCV live stream video over socket in Python 3
                            
                                Tensorflow error : unsupported callable
                            
                                Finding items between 2 dates using boto3 and dynamodb scan
                            
                                Using next() on generator function
                            
                                `shutil.rmtree` does not work on `tempfile.TemporaryDirectory()`
                            
                                Message object has no attribute 'server'
                            
                                Change seaborn pair plot figure size
                            
                                Python get decimal number from float64 in a dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With