Finding and removing Non-ASCII characters from an Oracle Varchar2

Tags:

We are currently migrating one of our oracle databases to UTF8 and we have found a few records that are near the 4000 byte varchar limit. When we try and migrate these record they fail as they contain characters that become multibyte UF8 characters. What I want to do within PL/SQL is locate these characters to see what they are and then either change them or remove them.

I would like to do :

SELECT REGEXP_REPLACE(COLUMN,'[^[:ascii:]],'')

but Oracle does not implement the [:ascii:] character class.

Is there a simple way doing what I want to do?

268

asked Feb 10 '10 11:02

Paul Gilfedder

2 Answers

I think this will do the trick:

SELECT REGEXP_REPLACE(COLUMN, '[^[:print:]]', '')

138

answered Oct 01 '22 13:10

Yuri Tkachenko

If you use the ASCIISTR function to convert the Unicode to literals of the form \nnnn, you can then use REGEXP_REPLACE to strip those literals out, like so...

UPDATE table SET field = REGEXP_REPLACE(ASCIISTR(field), '\\[[:xdigit:]]{4}', '')

...where field and table are your field and table names respectively.

answered Oct 01 '22 11:10

Robb Smith

Related questions
                            
                                filemtime "warning stat failed for"
                            
                                Regular expression in PostgreSQL LIKE clause
                            
                                How to negate the Groovy Match Operator?
                            
                                Regex replacement capture followed by digit
                            
                                how do I use f-string with regex in Python
                            
                                Why won't re.groups() give me anything for my one correctly-matched group?
                            
                                Return sql rows where field contains ONLY non-alphanumeric characters
                            
                                BeautifulSoup webscraping find_all( ): finding exact match
                            
                                Regex to find two words on the page
                            
                                get string between two strings with javascript [duplicate]
                            
                                Regular Expression to replace " {" with "(newline){" in xcode
                            
                                How to use sed to replace regex capture group?
                            
                                Regex capitalize first letter every word, also after a special character like a dash
                            
                                Difference between ".+" and ".+?"
                            
                                My Vim replace with a regex is throwing a `E488: Trailing characters`
                            
                                Regular expression to select all whitespace that isn't in quotes?
                            
                                Regular expressions (RegEx) and dplyr::filter()
                            
                                Regex to match an optional '+' symbol followed by any number of digits
                            
                                Javascript: highlight substring keeping original case but searching in case insensitive mode
                            
                                Guide on how to use regex in Nginx location block section?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Finding and removing Non-ASCII characters from an Oracle Varchar2

Tags:

regex

ascii

oracle

Paul Gilfedder

People also ask

2 Answers

Yuri Tkachenko

Robb Smith

Recent Activity

Donate For Us