I'm trying to write data to an excel file that includes Japanese characters. I'm using codec.open() to get the data, and that seems to work fine, but I run into this error when I try to write the data: <pre class="prettyprint"><code>UnicodeEncodeError: 'ascii' codec can't encode characters in position 16-17: ordinal not in range(128) </code></pre> I don't understand why the program would be insisting on using ascii here. When I created a new workbook object, I did so using <pre class="prettyprint"><code>wb = xlwt.Workbook(encoding='utf-8') </code></pre> and both the program file itself and the file it's reading in are saved as UTF-8. Anybody have any ideas? EDIT: Here's a link to the xlwt package. http://pypi.python.org/pypi/xlwt

As suggested by this question, setting the encoding on the WorkBook <pre class="prettyprint"><code>wb = xlwt.Workbook(encoding='latin-1') </code></pre> should also resolve the issue (it worked for me).

Has anyone been able to write out UTF-8 characters using python's xlwt?

Tags:

python

utf-8

multibyte

xlwt

I'm trying to write data to an excel file that includes Japanese characters. I'm using codec.open() to get the data, and that seems to work fine, but I run into this error when I try to write the data:

UnicodeEncodeError: 'ascii' codec can't encode characters in position 16-17: ordinal not in range(128)

I don't understand why the program would be insisting on using ascii here. When I created a new workbook object, I did so using

wb = xlwt.Workbook(encoding='utf-8')

and both the program file itself and the file it's reading in are saved as UTF-8.

Anybody have any ideas?

EDIT: Here's a link to the xlwt package. http://pypi.python.org/pypi/xlwt

664

asked Aug 25 '11 02:08

StormShadow

2 Answers

In an Excel 97-2003 XLS file, each piece of text is encoded in latin1 if that is possible, otherwise UTF-16LE, with a flag to show which. To do that, xlwt nees a unicode object. If the caller supplies a str object, xlwt will attempt to decode it using the encoding specified in the Workbook() call (default is ascii).

This works; try running the following short script and open the resultant file with Excel.

import xlwt
wb = xlwt.Workbook(encoding="UTF-8")
uc = u"".join(unichr(0x0410 + i) for i in xrange(32)) # some Cyrillic characters
u8 = uc.encode("UTF-8")
ws = wb.add_sheet("demo")
ws.write(0, 0, uc)
ws.write(1, 0, u8)
ws.write(2, 0, xlwt.Formula("A1=A2"))
ws.write(3, 0, "ASCII is a subset of UTF-8")
wb.save("xlwt_write_utf8.xls")

The fact that you are getting an encode error, not a decode error, indicates a possible problem in the file input part of your script. Please supply the shortest possible script that causes the error that you are getting. The script should contain something like print repr(your_utf8_text) immediately prior to the failing statement, so that we can see exactly what the text data is. Please include the full error message and the full traceback, and the contents (print repr(contents)) of your very short input file.

124

answered Oct 19 '22 01:10

John Machin

As suggested by this question, setting the encoding on the WorkBook

wb = xlwt.Workbook(encoding='latin-1')

should also resolve the issue (it worked for me).

answered Oct 19 '22 00:10

Troy

Related questions
                            
                                Python - print "x" "y" times when passed in a list of [x,y]
                            
                                Python: Using a dictionary as switch not working
                            
                                How to add a Delete button to django.forms generated form?
                            
                                python: sys.argv[0] meaning in official documentation
                            
                                Django: Convert a POST request parameters to query string
                            
                                Overloading + to support tuples
                            
                                print values in pdb
                            
                                How do I remove south from a django project
                            
                                Python failing to encode bad unicode to ascii
                            
                                ImportError: No module named mysql.base, in django project on Ubuntu 11.04 server
                            
                                Why does PIL thumbnail not resizing correctly?
                            
                                Windows progress bar in python's Tkinter
                            
                                Django forms want to auto-save user, client and datetime
                            
                                Python pickle crash when trying to return default value in __getattr__
                            
                                Can't dump or write an ElementTree element
                            
                                convert ahk to python
                            
                                All but the last N elements of iterator in Python
                            
                                Check string for numbers in Python
                            
                                Using a global flag for python RegExp compile
                            
                                python regex - what does - (dash) mean

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With