I want to print all data (all rows) of a specific column in python using <code>openpyxl</code> I am working in this way; <pre class="prettyprint"><code>from openpyxl import load_workbook workbook = load_workbook('----------/dataset.xlsx') sheet = workbook.active for i in sheet: print(sheet.cell(row=i, column=2).value) </code></pre> But it gives <blockquote> if row < 1 or column < 1: TypeError: unorderable types: tuple() < int() </blockquote> Because i am iterating in <code>row=i</code>. If I use <code>sheet.cell(row=4, column=2).value</code> it print the value of cell. But how can I iterate over all document? Edit 1 On some research, it is found that data can be get using Sheet Name. The <code>Sheet 1</code> exists in the <code>.xlsx</code> file but its data is not printing. Any problem in this code? <pre class="prettyprint"><code>workbook = load_workbook('---------------/dataset.xlsx') print(workbook.get_sheet_names()) worksheet =workbook.get_sheet_by_name('Sheet1') c=2 for i in worksheet: d = worksheet.cell(row=c, column=2) if(d.value is None): return else: print(d.value) c=c+1 </code></pre>

Read the OpenPyXL Documentation Iteration over all <code>worksheets</code> in a <code>workbook</code>, for instance: <pre class="prettyprint"><code>for n, sheet in enumerate(wb.worksheets): print('Sheet Index:[{}], Title:{}'.format(n, sheet.title)) </code></pre> <blockquote> Output: <pre class="prettyprint"><code>Sheet Index:[0], Title: Sheet Sheet Index:[1], Title: Sheet1 Sheet Index:[2], Title: Sheet2 </code></pre> </blockquote> <hr> Iteration over all <code>rows</code> and <code>columns</code> in one Worksheet: <pre class="prettyprint"><code>worksheet = workbook.get_sheet_by_name('Sheet') for row_cells in worksheet.iter_rows(): for cell in row_cells: print('%s: cell.value=%s' % (cell, cell.value) ) </code></pre> Output: <pre class="prettyprint"><code><Cell Sheet.A1>: cell.value=²234 <Cell Sheet.B1>: cell.value=12.5 <Cell Sheet.C1>: cell.value=C1 <Cell Sheet.D1>: cell.value=D1 <Cell Sheet.A2>: cell.value=1234 <Cell Sheet.B2>: cell.value=8.2 <Cell Sheet.C2>: cell.value=C2 <Cell Sheet.D2>: cell.value=D2 </code></pre> <hr> Iteration over all <code>columns</code> of one <code>row</code>, for instance <code>row==2</code>: <pre class="prettyprint"><code>for row_cells in worksheet.iter_rows(min_row=2, max_row=2): for cell in row_cells: print('%s: cell.value=%s' % (cell, cell.value) ) </code></pre> Output: <pre class="prettyprint"><code><Cell Sheet.A2>: cell.value=1234 <Cell Sheet.B2>: cell.value=8.2 <Cell Sheet.C2>: cell.value=C2 <Cell Sheet.D2>: cell.value=D2 </code></pre> <hr> Iteration over all <code>rows</code>, only <code>column</code> 2: <pre class="prettyprint"><code>for col_cells in worksheet.iter_cols(min_col=2, max_col=2): for cell in col_cells: print('%s: cell.value=%s' % (cell, cell.value)) </code></pre> Output: <pre class="prettyprint"><code><Cell Sheet.B1>: cell.value=12.5 <Cell Sheet.B2>: cell.value=8.2 <Cell Sheet.B3>: cell.value=9.8 <Cell Sheet.B4>: cell.value=10.1 <Cell Sheet.B5>: cell.value=7.7 </code></pre> Tested with Python:3.4.2 - openpyxl:2.4.1 - LibreOffice: 4.3.3.2

Iterate over Worksheets, Rows, Columns

Q: How do I iterate through a python function in Excel?

The openpyxl module allows a Python program to read and modify Excel files. We will be using this excel worksheet in the below examples: Approach #1: We will create an object of openpyxl, and then we'll iterate through all rows from top to bottom.

Tags:

python

openpyxl

I want to print all data (all rows) of a specific column in python using openpyxl I am working in this way;

from openpyxl import load_workbook
workbook = load_workbook('----------/dataset.xlsx')
sheet = workbook.active  
for i in sheet:
   print(sheet.cell(row=i, column=2).value)

But it gives

if row < 1 or column < 1: TypeError: unorderable types: tuple() < int()

Because i am iterating in row=i. If I use sheet.cell(row=4, column=2).value it print the value of cell. But how can I iterate over all document?

Edit 1

On some research, it is found that data can be get using Sheet Name. The Sheet 1 exists in the .xlsx file but its data is not printing. Any problem in this code?

workbook = load_workbook('---------------/dataset.xlsx')
print(workbook.get_sheet_names())
worksheet =workbook.get_sheet_by_name('Sheet1')
c=2
for i in worksheet: 
    d = worksheet.cell(row=c, column=2)
    if(d.value is None):
        return
    else:
        print(d.value)
    c=c+1

392

asked Mar 23 '17 11:03

Humty

2 Answers

Read the OpenPyXL Documentation

Iteration over all worksheets in a workbook, for instance:

for n, sheet in enumerate(wb.worksheets):
    print('Sheet Index:[{}], Title:{}'.format(n, sheet.title))

Output:

Sheet Index:[0], Title: Sheet    
Sheet Index:[1], Title: Sheet1    
Sheet Index:[2], Title: Sheet2

Iteration over all rows and columns in one Worksheet:

worksheet = workbook.get_sheet_by_name('Sheet')

for row_cells in worksheet.iter_rows():
    for cell in row_cells:
       print('%s: cell.value=%s' % (cell, cell.value) )

Output:

<Cell Sheet.A1>: cell.value=²234
<Cell Sheet.B1>: cell.value=12.5
<Cell Sheet.C1>: cell.value=C1
<Cell Sheet.D1>: cell.value=D1
<Cell Sheet.A2>: cell.value=1234
<Cell Sheet.B2>: cell.value=8.2
<Cell Sheet.C2>: cell.value=C2
<Cell Sheet.D2>: cell.value=D2

Iteration over all columns of one row, for instance row==2:

for row_cells in worksheet.iter_rows(min_row=2, max_row=2):
    for cell in row_cells:
        print('%s: cell.value=%s' % (cell, cell.value) )

Output:

<Cell Sheet.A2>: cell.value=1234  
<Cell Sheet.B2>: cell.value=8.2  
<Cell Sheet.C2>: cell.value=C2  
<Cell Sheet.D2>: cell.value=D2

Iteration over all rows, only column 2:

for col_cells in worksheet.iter_cols(min_col=2, max_col=2):
    for cell in col_cells:
        print('%s: cell.value=%s' % (cell, cell.value))

Output:

<Cell Sheet.B1>: cell.value=12.5
<Cell Sheet.B2>: cell.value=8.2
<Cell Sheet.B3>: cell.value=9.8
<Cell Sheet.B4>: cell.value=10.1
<Cell Sheet.B5>: cell.value=7.7

Tested with Python:3.4.2 - openpyxl:2.4.1 - LibreOffice: 4.3.3.2

107

answered Oct 14 '22 08:10

stovfl

Try this,

from openpyxl import load_workbook
workbook = load_workbook('----------/dataset.xlsx')
sheet = workbook.active
row_count = sheet.max_row
for i in range(row_count):
   print(sheet.cell(row=i, column=2).value)

answered Oct 14 '22 09:10

Chanda Korat

Related questions
                            
                                Python filter function - single result [duplicate]
                            
                                How to show minor tick labels on log-scale with Matplotlib
                            
                                Python regex AttributeError: 'NoneType' object has no attribute 'group'
                            
                                faster alternative to numpy.where?
                            
                                Pandas usecols all except last
                            
                                ImageFont IO error: cannot open resource
                            
                                Pandas df.describe() , is it possible to do it by row without transposing?
                            
                                Create a post activate script in Conda [duplicate]
                            
                                Tensorflow: How to get all variables from rnn_cell.BasicLSTM & rnn_cell.MultiRNNCell
                            
                                How do I pass command line arguments to Python from VS in Debug mode?
                            
                                Check if user is logged in with Flask-Login in template
                            
                                Change working directory of console in PyCharm
                            
                                Openpyxl auto-height row
                            
                                Trying to migrate in Django 1.9 -- strange SQL error "django.db.utils.OperationalError: near ")": syntax error"
                            
                                "Almost Equal" in Jasmine
                            
                                Items vs item loaders in scrapy
                            
                                Cannot import urllib in Python
                            
                                Pandas dataframe.to_html() - add background color to header
                            
                                Getting whole user timeline of a Twitter user
                            
                                How to remove a row from pandas dataframe based on the length of the column values?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Iterate over Worksheets, Rows, Columns

Tags:

python

openpyxl

Humty

People also ask

2 Answers

stovfl

Chanda Korat

Recent Activity

Donate For Us