Python: How do I force iso-8859-1 file output?

Tags:

character-encoding

How do I force Latin-1 (which I guess means iso-8859-1?) file output in Python?

Here's my code at the moment. It works, but trying to import the resulting output file into a Latin-1 MySQL table produces weird encoding errors.

outputFile = file( "textbase.tab", "w" )
for k, v in textData.iteritems():
    complete_line = k + '~~~~~' + v + '~~~~~' + " ENDOFTHELINE"
    outputFile.write(complete_line)
    outputFile.write( "\n" )
outputFile.close()

The resulting output file seems to be saved in "Western (Mac OS Roman)", but if I then save it in Latin-1, I still get strange encoding problems. How can I make sure that the strings used, and the file itself, are all encoded in Latin-1 as soon as they are generated?

The original strings (in the textData dictionary) have been parsed in from an RTF file - I don't know if that makes a difference.

I'm a bit new to Python and to encoding generally, so apologies if this is a dumb question. I have tried looking at the docs but haven't got very far.

I'm using Python 2.6.1.

215

asked Feb 03 '10 12:02

AP257

2 Answers

Simply use the codecs module for writing the file:

import codecs
outputFile = codecs.open("textbase.tab", "w", "ISO-8859-1")

Of course, the strings you write have to be Unicode strings (type unicode), they won't be converted if they are plain str objects (which are basically just arrays of bytes). I guess you are reading the RTF file with the normal Python file object as well, so you might have to convert that to using codecs.open as well.

136

answered Oct 11 '22 17:10

Torsten Marek

For me, io.open works a bit faster on python 2.7 for writes, and an order of magnitude faster for reads:

import io
with io.open("textbase.tab", "w", encoding="ISO-8859-1") as outputFile:
    ...

In python 3, you can just pass the encoding keyword arg to open.

answered Oct 11 '22 17:10

beardc

Related questions
                            
                                Production ready Python implementations besides CPython? [closed]
                            
                                How the method resolution and invocation works internally in Python?
                            
                                More Pythonic conversion to binary?
                            
                                How to enumerate a list of non-string objects in Python?
                            
                                Python leaking memory while using PyQt and matplotlib
                            
                                Simulate multiple IP addresses for testing
                            
                                "WindowsError: exception: access violation..." - ctypes question
                            
                                Generate and parse Python code from C# application
                            
                                How to prevent log file truncation with python logging module?
                            
                                Creating a program to be broadcasted by avahi
                            
                                Help with JSON format [closed]
                            
                                Google App Engine compatibility layer
                            
                                Decoding Mac OS text in Python
                            
                                Feedparser - retrieve old messages from Google Reader
                            
                                Infinity generated in python code
                            
                                Editing the XML texts from a XML file using Python
                            
                                Interpolate Question
                            
                                Selecting specific column in each row from array
                            
                                stopping a cherrypy server over http
                            
                                Removing html tags from a text using Regular Expression in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With