My excel sheet: <pre class="prettyprint"><code> A B 1 first second 2 3 4 x y 5 z j </code></pre> Python code: <pre class="prettyprint"><code>df = pd.read_excel (filename, parse_cols=1) </code></pre> return a correct output: <pre class="prettyprint"><code> first second 0 NaN NaN 1 NaN NaN 2 x y 3 z j </code></pre> If i want work only with second column <pre class="prettyprint"><code>df = pd.read_excel (filename, parse_cols=[1]) </code></pre> return: <pre class="prettyprint"><code> second 0 y 1 j </code></pre> I'd have information about empty excel rows (NaN in my df) even if I work only with a specific column. If output loose NaN information it's not ok, for example, for skiprows paramater, etc Thanks

For me works parameter <code>skip_blank_lines=False</code>: <pre class="prettyprint"><code>df = pd.read_excel ('test.xlsx', parse_cols=1, skip_blank_lines=False) print (df) A B 0 first second 1 NaN NaN 2 NaN NaN 3 x y 4 z j </code></pre> Or if need omit first row: <pre class="prettyprint"><code>df = pd.read_excel ('test.xlsx', parse_cols=1, skiprows=1, skip_blank_lines=False) print (df) first second 0 NaN NaN 1 NaN NaN 2 x y 3 z j </code></pre>

Python Pandas read_excel doesn't recognize null cell

Tags:

python

pandas

excel

nan

My excel sheet:

   A   B  
1 first second
2
3 
4  x   y  
5  z   j

Python code:

df = pd.read_excel (filename, parse_cols=1)

return a correct output:

  first second
0 NaN   NaN
1 NaN   NaN
2 x     y
3 z     j

If i want work only with second column

df = pd.read_excel (filename, parse_cols=[1])

return:

 second
0  y
1  j

I'd have information about empty excel rows (NaN in my df) even if I work only with a specific column. If output loose NaN information it's not ok, for example, for skiprows paramater, etc

Thanks

672

asked Sep 05 '16 16:09

franco_b

1 Answers

For me works parameter skip_blank_lines=False:

df = pd.read_excel ('test.xlsx', 
                     parse_cols=1, 
                     skip_blank_lines=False)
print (df)

       A       B
0  first  second
1    NaN     NaN
2    NaN     NaN
3      x       y
4      z       j

Or if need omit first row:

df = pd.read_excel ('test.xlsx', 
                     parse_cols=1, 
                     skiprows=1,
                     skip_blank_lines=False)
print (df)

  first second
0   NaN    NaN
1   NaN    NaN
2     x      y
3     z      j

100

answered Sep 19 '22 22:09

jezrael

Related questions
                            
                                Test isolation broken with multiple databases in Django. How to fix it?
                            
                                Splitting duplicates into separate table - Pandas
                            
                                default() method in Python
                            
                                Getting all attributes to appear on python's `__dict__` method
                            
                                how to find the index for a quantile
                            
                                How to center text horizontally in a Kivy text input?
                            
                                Image to text python
                            
                                Is `if x:` completely equivalent to `if bool(x) is True:`?
                            
                                Named string format arguments in Python
                            
                                How to filter data from a data frame when the number of columns are dynamic?
                            
                                How can I capture a key press (key logging) in Linux?
                            
                                what are the differences between import and extends in Flask?
                            
                                Execute flask-SQLAlchemy subquery
                            
                                How to put a JSON file's content in a response
                            
                                List comprehension works but not for loop––why?
                            
                                Finding the area of intersection of multiple overlapping rectangles in Python
                            
                                Opening a gzip file in python Apache Beam
                            
                                Do locally set Cython compiler directives affect one or all functions?
                            
                                additional column when saving pandas data frame to csv file
                            
                                Pandas Dataframe Line Plot: Show Random Markers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With