How do I remove the last character of an R-T-L string in python?

Tags:

I am trying to remove the last character of a string in a "right-to-left" language. When I do, however, the last character wraps to the beginning of the string. e.g. ותֵיהֶם]׃ becomes ותֵיהֶם]

I know that this is a fundamental issue with how I'm handling the R-T-L paradigm, but if someone could help me think through it, I'd very much appreciate it.

CODE

with open(r"file.txt","r") as f:
    for line in f:
        line = unicode(line,'utf-8')
        the_text = line.split('\t')[1]
        the_text.replace(u'\u05C3','')

377

asked Oct 25 '12 22:10

swasheck

1 Answers

Some characters in Unicode are always LTR, some are always RTL, and some can be either depending on their surrounding context. In addition, the display context for bidirectional text will have a "predominant" directionality (e.g. a text editor configured for mainly-English text would be predominantly LTR and have a ragged right margin, one configured for mainly-Hebrew would be predominantly RTL with a ragged left margin).

It looks like what has happened here is that when a closing square bracket character appears between two RTL characters it is rendered in its RTL form (your first example) but when it appears between a RTL and a LTR character (or at the end of the string - basically, somewhere where it doesn't have other characters of the same directionality on both sides) then it is considered to be part of whichever run of text matches the predominant direction. If you try dragging your mouse over the string to select the characters you'll see that logically the closing ] still follows the ֶם even if visually it appears to have moved.

If the second-to-last character in your string were also a Hebrew character (or other strongly RTL character) rather than a ], or if the display context was predominantly RTL, then it would appear where you expect it to.

139

answered Nov 14 '22 21:11

Ian Roberts

Related questions
                            
                                Generating an evenly sampled array from unevenly sampled data in NumPy
                            
                                Python / rq - monitoring worker status
                            
                                How to read out the text from QLineEdit in python?
                            
                                Unicode characters in Django usernames
                            
                                Socket.io python server
                            
                                Truth tables in python using sympy
                            
                                Multiple lines user input in command-line Python application
                            
                                Dynamically building a Boolean expression
                            
                                Memory Usage in Python: What's the difference between memory_profiler and guppy?
                            
                                Unicode error Ordinal not in range
                            
                                Patching a function with Mock only for one module?
                            
                                Python date iso8601 format with timezone designator
                            
                                pyPandas functionality request: reverse/negative df.drop
                            
                                Launch android app from SL4A script?
                            
                                Sending Arrow Keys to Popen
                            
                                How to set up Django models with two types of users with very different attributes
                            
                                Tastypie: How can I fill the resource without database?
                            
                                How can I replace simplejson with json in django python?
                            
                                Using Python to generate a connection/network graph
                            
                                How do I create a Tiling layout / Flow layout in TkInter?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I remove the last character of an R-T-L string in python?

Tags:

python

string

unicode

right-to-left

swasheck

People also ask

1 Answers

Ian Roberts

Recent Activity

Donate For Us