I am trying to replace newline characters in a unicode string and seem to be missing some magic codes. My particular example is that I am working on AppEngine and trying to put titles from HTML pages into a <code>db.StringProperty()</code> in my model. So I do something like: <pre class="prettyprint"><code>link.title = unicode(page_title,"utf-8").replace('\n','').replace('\r','') </code></pre> and I get: <pre class="prettyprint"><code>Property title is not multi-line </code></pre> Are there other codes I should be using for the replace?

Python uses these characters for splitting in <code>unicode.splitlines()</code>: <ul> <li>U+000A LINE FEED (\n)</li> <li>U+000D CARRIAGE RETURN (\r)</li> <li>U+001C FILE SEPARATOR</li> <li>U+001D GROUP SEPARATOR</li> <li>U+001E RECORD SEPARATOR</li> <li>U+0085 NEXT LINE</li> <li>U+2028 LINE SEPARATOR</li> <li>U+2029 PARAGRAPH SEPARATOR</li> </ul> As Hank says, using <code>splitlines()</code> will let Python take care of all of the details for you, but if you need to do it manually, then this should be the complete list.

Replace newlines in a Unicode string

Tags:

python

unicode

google-app-engine

I am trying to replace newline characters in a unicode string and seem to be missing some magic codes.

My particular example is that I am working on AppEngine and trying to put titles from HTML pages into a db.StringProperty() in my model.

So I do something like:

link.title = unicode(page_title,"utf-8").replace('\n','').replace('\r','')

and I get:

Property title is not multi-line

Are there other codes I should be using for the replace?

674

asked Feb 04 '10 17:02

Jackson Miller

2 Answers

Try ''.join(unicode(page_title, 'utf-8').splitlines()). splitlines() should let the standard library take care of all the possible crazy Unicode line breaks, and then you just join them all back together with the empty string to get a single-line version.

answered Oct 14 '22 01:10

Hank Gay

Python uses these characters for splitting in unicode.splitlines():

U+000A LINE FEED (\n)
U+000D CARRIAGE RETURN (\r)
U+001C FILE SEPARATOR
U+001D GROUP SEPARATOR
U+001E RECORD SEPARATOR
U+0085 NEXT LINE
U+2028 LINE SEPARATOR
U+2029 PARAGRAPH SEPARATOR

As Hank says, using splitlines() will let Python take care of all of the details for you, but if you need to do it manually, then this should be the complete list.

answered Oct 14 '22 03:10

Ian Clelland

Related questions
                            
                                Changing the size of seaborn pairplot markers
                            
                                Does click lib provide a way to print the builtin help message?
                            
                                how to create python empty dataframe where df.empty results in True
                            
                                Get Image Filename from Image PIL
                            
                                Implementing an efficient queue in Python
                            
                                Python Looping through CSV files and their columns
                            
                                Vagrant + Ansible + Python3
                            
                                How do I set matplotlib plot to "no fill"?
                            
                                Wait for timeout or event being set for asyncio.Event
                            
                                Python from django.contrib.auth.views import logout ImportError: cannot import name 'logout'
                            
                                Why pip3 install in python2 sitepackages
                            
                                'numpy.ndarray' object has no attribute 'index'
                            
                                OpenCV Assertion failed: (-215:Assertion failed) npoints >= 0 && (depth == CV_32F || depth == CV_32S)
                            
                                Filter a data-frame and add a new column according to the given condition
                            
                                psycopg2.OperationalError: FATAL: unsupported frontend protocol 1234.5679: server supports 2.0 to 3.0
                            
                                How to sell Python to a client/boss/person [closed]
                            
                                Environment Variables in Python on Linux
                            
                                Should I check the types of constructor arguments (and at other places too)?
                            
                                Ubuntu + virtualenv = a mess? virtualenv hates dist-packages, wants site-packages
                            
                                How can I generate random numbers in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With