I have a large table with phone numbers. The phone numbers are all strings and supposed to be '+9628789878' or similar. (a "+" sign followed by between 9 and 13 digits.) A user bug uncovered one row with the string '+987+9873678298'. Clearly it shouldn't be there and I'd like to find out how many other cases there are of this or other such errors. I tried this query but it's not doing the job. My thinking is anything that's not like this string. (Oh, the table is not indexed by phone_number.) <pre class="prettyprint"><code>SELECT user_key, first_name, last_name, phone_number FROM users u WHERE regexp_like(phone_number, '[^\+[0-9]*]') AND phone_number IS NOT NULL </code></pre>

If you need to find all the rows where <code>phone_number</code> is not made by exactly a <code>'+'</code> followed by 9-13 digits, this should do the work: <pre class="prettyprint"><code>select * from users where not regexp_like(phone_number, '^\+[0-9]{9,13}$') </code></pre> What it does: <ul> <li> <code>^</code> the beginning of the string, to avoid things like <code>'XX +123456789'</code> </li> <li> <code>\+</code> the '+'</li> <li> <code>[0-9]{9,13}</code> a sequence of 9-13 digits</li> <li> <code>$</code> the end of the string, to avoid strings like <code>'+123456789 XX'</code> </li> </ul> Another way, with no regexp, could be the following: <pre class="prettyprint"><code>where not ( /* strings of 10-14 chars */ length(phone_number) between 10 and 14 /* ... whose first is a + */ and substr(phone_number, 1, 1 ) = '+' /* ...and that become a '+' after removing all the digits */ and nvl(translate(phone_number, 'X0123456789', 'X'), '+') = '+' ) </code></pre> This could be faster than the regexp approach, even if it's based on more conditions, but I believe only a test will tell you which one is the best performing.

Not REGEXP_LIKE in Oracle

Tags:

regex

sql

oracle

I have a large table with phone numbers. The phone numbers are all strings and supposed to be '+9628789878' or similar. (a "+" sign followed by between 9 and 13 digits.)

A user bug uncovered one row with the string '+987+9873678298'. Clearly it shouldn't be there and I'd like to find out how many other cases there are of this or other such errors.

I tried this query but it's not doing the job. My thinking is anything that's not like this string. (Oh, the table is not indexed by phone_number.)

SELECT user_key,
       first_name,
       last_name,
       phone_number
FROM   users u
WHERE  regexp_like(phone_number, '[^\+[0-9]*]')
AND    phone_number IS NOT NULL

407

asked Mar 01 '17 15:03

SAR622

1 Answers

If you need to find all the rows where phone_number is not made by exactly a '+' followed by 9-13 digits, this should do the work:

select *
from users 
where not regexp_like(phone_number, '^\+[0-9]{9,13}$')

What it does:

^ the beginning of the string, to avoid things like 'XX +123456789'
\+ the '+'
[0-9]{9,13} a sequence of 9-13 digits
$ the end of the string, to avoid strings like '+123456789 XX'

Another way, with no regexp, could be the following:

where not (
                /* strings of 10-14 chars */
                length(phone_number) between 10 and 14 
                /* ... whose first is a + */
            and substr(phone_number, 1, 1 ) = '+' 
                /* ...and that become a '+' after removing all the digits */
            and nvl(translate(phone_number, 'X0123456789', 'X'), '+') = '+' 
          )

This could be faster than the regexp approach, even if it's based on more conditions, but I believe only a test will tell you which one is the best performing.

answered Sep 23 '22 06:09

Aleksej

Related questions
                            
                                sp_send_dbmail: FROM_ADDRESS
                            
                                Linq - Select Date from DateTime
                            
                                Split words with a capital letter in sql
                            
                                H2 - How to create a database trigger that log a row change to another table?
                            
                                How to get column names from a query in SQL Server
                            
                                How do I write a query that outputs the row number as a column?
                            
                                Select * from table1 that does not exist in table2 with conditional
                            
                                Recovery after wrong MySQL update query?
                            
                                SQL Server: use parameter in CREATE DATABASE
                            
                                MYSQL count related records one query
                            
                                Using a HAVING clause in an UPDATE statement
                            
                                SQL Server Update with group by
                            
                                Case Statement on INNER Join
                            
                                Getting two counts and then dividing them
                            
                                sqlite get name of attached databases
                            
                                How to find what foreign key references an index on table
                            
                                Powershell SQL SELECT output to variable
                            
                                The object name contains more than the maximum number of prefixes. The maximum is 3
                            
                                Sql server update multiple columns from another table
                            
                                Which type of binding does PL/SQL use?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With