How do I find the formatting for a subset of text in an Excel document cell

1 Answers

Thanks to @Vyassa for all of the right pointers, I've been able to write the following code which iterates over the rows in a XLS file and outputs style information for cells with "single" style information (e.g., the whole cell is italic) or style "segments" (e.g., part of the cell is italic, part of it is not).

import xlrd

# accessing Column 'C' in this example
COL_IDX = 2

book = xlrd.open_workbook('your-file.xls', formatting_info=True)
first_sheet = book.sheet_by_index(0)

for row_idx in range(first_sheet.nrows):
  text_cell = first_sheet.cell_value(row_idx, COL_IDX)
  text_cell_xf = book.xf_list[first_sheet.cell_xf_index(row_idx, COL_IDX)]

  # skip rows where cell is empty
  if not text_cell:
    continue
  print text_cell,

  text_cell_runlist = first_sheet.rich_text_runlist_map.get((row_idx, COL_IDX))
  if text_cell_runlist:
    print '(cell multi style) SEGMENTS:'
    segments = []
    for segment_idx in range(len(text_cell_runlist)):
      start = text_cell_runlist[segment_idx][0]
      # the last segment starts at given 'start' and ends at the end of the string
      end = None
      if segment_idx != len(text_cell_runlist) - 1:
        end = text_cell_runlist[segment_idx + 1][0]
      segment_text = text_cell[start:end]
      segments.append({
        'text': segment_text,
        'font': book.font_list[text_cell_runlist[segment_idx][1]]
      })
    # segments did not start at beginning, assume cell starts with text styled as the cell
    if text_cell_runlist[0][0] != 0:
      segments.insert(0, {
        'text': text_cell[:text_cell_runlist[0][0]],
        'font': book.font_list[text_cell_xf.font_index]
      })

    for segment in segments:
      print segment['text'],
      print 'italic:', segment['font'].italic,
      print 'bold:', segment['font'].bold

  else:
    print '(cell single style)',
    print 'italic:', book.font_list[text_cell_xf.font_index].italic,
    print 'bold:', book.font_list[text_cell_xf.font_index].bold

122

answered Sep 21 '22 02:09

Greg Sadetsky

Related questions
                            
                                Any way to zip to list of lists?
                            
                                Macports select default Python interpreter for executing scripts? [closed]
                            
                                Is it possible to set the python -O (optimize) flag within a script?
                            
                                Can a Python method check if it has been called from within itself?
                            
                                Re.sub not working for me
                            
                                Parsing meta tags efficiently with lxml?
                            
                                Complex foreign key constraint in SQLAlchemy
                            
                                Using array to generate random text
                            
                                Redirect stdout from python for C calls
                            
                                Output of True and []
                            
                                Python list to bitwise operations
                            
                                set numbers of admin.TabularInline in django admin
                            
                                C/C++ for Python programmer [closed]
                            
                                Python 3.2 won't import cookielib
                            
                                What are the connection limits for Google Cloud SQL from App Engine, and how to best reuse DB connections?
                            
                                python: when can I unpack a generator?
                            
                                Run Python in cmd [duplicate]
                            
                                Implementing a directed graph in python
                            
                                About MySQLdb conn.autocommit(True)
                            
                                Python and MySQL print results

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I find the formatting for a subset of text in an Excel document cell

Tags:

python

xlrd

westmark

People also ask

1 Answers

Greg Sadetsky

Recent Activity

Donate For Us