Suppose I have a mysterious unicode string in Python (2.7) that I want to feed to a command line program such as imagemagick (or really just get it out of Python in any way). The strings might be: <ul> <li>Adolfo López Mateos</li> <li>Stanisława Walasiewicz</li> <li>Jörgen Jönsson</li> </ul> So in Python I might make a little command like this: <pre class="prettyprint"><code>cmd = u'convert -pointsize 24 label:"%s" "%s.png"' % (name, name) </code></pre> If I just print <code>cmd</code> and get <code>convert -pointsize 24 label:"Jörgen Jönsson" "Jörgen Jönsson.png"</code> and then run it myself, everything is fine. <ul> <li>Adolfo López Mateos.png </li> <li>example 1 http://4u.jeffcrouse.info/stackoverflow/A-01.png</li> <li>Stanisława Walasiewicz.png </li> <li>example 2 http://4u.jeffcrouse.info/stackoverflow/A-02.png</li> </ul> But if I do <code>os.system( cmd )</code>, I get this: <ul> <li>Adolfo L√≥pez Mateos.png</li> <li>example 4 http://4u.jeffcrouse.info/stackoverflow/B-01.png</li> <li>Stanis≈Çawa Walasiewicz.png</li> <li>example 5 http://4u.jeffcrouse.info/stackoverflow/B-02.png</li> </ul> I know it's not an imagemagick problem because the filenames are messed up too. I know that Python is converting the command to ascii when it passes it off to os.system, but why is it getting the encoding so wrong? Why is it interpreting each non-ASCII character as 2 characters? According to a few articles that I've read, it might be because it's encoded as latin-1 but it's being read as utf-8, but I've tried encoding it back and forth between them and it's not helping. I get Unicode exceptions when I try to just encode it manually as ascii without a replacement argument, but if I do name.encode('ascii','xmlcharrefreplace'), I get the following: <ul> <li>example 4 http://4u.jeffcrouse.info/stackoverflow/C-01.png</li> <li>example 5 http://4u.jeffcrouse.info/stackoverflow/C-02.png</li> </ul> I'm hoping that someone recognizes this particular kind of encoding problem and can offer some advice, because I'm about out of ideas. Thanks!

Use subprocess.call instead: <pre class="prettyprint"><code>>>> s = u'Jörgen Jönsson' >>> import subprocess >>> subprocess.call(['echo', s]) Jörgen Jönsson 0 </code></pre>

Python: unicode in system commands

Tags:

unicode

Suppose I have a mysterious unicode string in Python (2.7) that I want to feed to a command line program such as imagemagick (or really just get it out of Python in any way). The strings might be:

Adolfo López Mateos
Stanisława Walasiewicz
Jörgen Jönsson

So in Python I might make a little command like this:

Click to copy

cmd = u'convert -pointsize 24 label:"%s" "%s.png"' % (name, name)

If I just print cmd and get convert -pointsize 24 label:"Jörgen Jönsson" "Jörgen Jönsson.png" and then run it myself, everything is fine.

Adolfo López Mateos.png
example 1 http://4u.jeffcrouse.info/stackoverflow/A-01.png
Stanisława Walasiewicz.png
example 2 http://4u.jeffcrouse.info/stackoverflow/A-02.png

But if I do os.system( cmd ), I get this:

Adolfo L√≥pez Mateos.png
example 4 http://4u.jeffcrouse.info/stackoverflow/B-01.png
Stanis≈Çawa Walasiewicz.png
example 5 http://4u.jeffcrouse.info/stackoverflow/B-02.png

I know it's not an imagemagick problem because the filenames are messed up too. I know that Python is converting the command to ascii when it passes it off to os.system, but why is it getting the encoding so wrong? Why is it interpreting each non-ASCII character as 2 characters? According to a few articles that I've read, it might be because it's encoded as latin-1 but it's being read as utf-8, but I've tried encoding it back and forth between them and it's not helping.

I get Unicode exceptions when I try to just encode it manually as ascii without a replacement argument, but if I do name.encode('ascii','xmlcharrefreplace'), I get the following:

example 4 http://4u.jeffcrouse.info/stackoverflow/C-01.png
example 5 http://4u.jeffcrouse.info/stackoverflow/C-02.png

I'm hoping that someone recognizes this particular kind of encoding problem and can offer some advice, because I'm about out of ideas.

Thanks!

396

asked Jan 11 '13 23:01

jefftimesten

1 Answers

Use subprocess.call instead:

Click to copy

>>> s = u'Jörgen Jönsson'
>>> import subprocess
>>> subprocess.call(['echo', s])
Jörgen Jönsson
0

129

answered Oct 24 '22 08:10

jterrace

Related questions
                            
                                The Requests streaming example does not work in my environment
                            
                                Python requests - saving cookie for later url usage
                            
                                Handle multiple socket connections
                            
                                Python 3 - reading text from a file
                            
                                tkinter/py2app created application doesn't show window on initial launch
                            
                                What is an elegant way to select all non-None elements from parameters and place them in a python dictionary?
                            
                                Recompile MacPort's version of MacVim with Python, Ruby & Perl [closed]
                            
                                ignoring directories in os.walk()?
                            
                                Read FORTRAN formatted numbers with Python
                            
                                Password protect a whole django app
                            
                                Calculate daily sums using python pandas
                            
                                Python regular expression for Beautiful Soup
                            
                                What is the proper way to comment code in Python?
                            
                                what does the comma mean in python's unpack?
                            
                                Solving linear system over integers with numpy
                            
                                copy netcdf file using python
                            
                                Reading DBF files with pyodbc
                            
                                How to pass the fields parameter into a google drive python API call
                            
                                Python - Sorting elements in a list of lists
                            
                                lxml truncates text that contains 'less than' character

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python: unicode in system commands

Tags:

python

encoding

unicode

jefftimesten

People also ask

1 Answers

jterrace

Recent Activity

Donate For Us