For simple things is it better to use the <code>translate</code> function on the premise that it is less CPU intensive or is <code>regexp_replace</code> the way to go? This question comes forth from How can I replace brackets to hyphens within Oracle REGEXP_REPLACE function?

I think you're running into simple optimization. The regexp expression is so expensive to compute that the result is cached in the hope that it will be used again in the future. If you actually use distinct strings to convert, you will see that the modest translate is naturally faster because it is its specialized function. Here's my example, running on <code>11.1.0.7.0</code>: <pre class="prettyprint"><code>SQL> DECLARE 2 TYPE t IS TABLE OF VARCHAR2(4000); 3 l t; 4 l_level NUMBER := 1000; 5 l_time TIMESTAMP; 6 l_char VARCHAR2(4000); 7 BEGIN 8 -- init 9 EXECUTE IMMEDIATE 'ALTER SESSION SET PLSQL_OPTIMIZE_LEVEL=2'; 10 SELECT dbms_random.STRING('p', 2000) 11 BULK COLLECT 12 INTO l FROM dual 13 CONNECT BY LEVEL <= l_level; 14 -- regex 15 l_time := systimestamp; 16 FOR i IN 1 .. l.count LOOP 17 l_char := regexp_replace(l(i), '[]()[]', '-', 1, 0); 18 END LOOP; 19 dbms_output.put_line('regex :' || (systimestamp - l_time)); 20 -- tranlate 21 l_time := systimestamp; 22 FOR i IN 1 .. l.count LOOP 23 l_char := translate(l(i), '()[]', '----'); 24 END LOOP; 25 dbms_output.put_line('translate :' || (systimestamp - l_time)); 26 END; 27 / regex :+000000000 00:00:00.979305000 translate :+000000000 00:00:00.238773000 PL/SQL procedure successfully completed </code></pre> on <code>11.2.0.3.0</code> : <pre class="prettyprint"><code>regex :+000000000 00:00:00.617290000 translate :+000000000 00:00:00.138205000 </code></pre> Conclusion: In general I suspect <code>translate</code> will win.

For SQL, I tested this with the following script: <pre class="prettyprint"><code>set timing on select sum(length(x)) from ( select translate('(<FIO>)', '()[]', '----') x from ( select * from dual connect by level <= 2000000 ) ); select sum(length(x)) from ( select regexp_replace('[(<FIO>)]', '[\(\)\[]|\]', '-', 1, 0) x from ( select * from dual connect by level <= 2000000 ) ); </code></pre> and found that the performance of <code>translate</code> and <code>regexp_replace</code> were almost always the same, but it could be that the cost of the other operations is overwhelming the cost of the functions I'm trying to test. Next, I tried a PL/SQL version: <pre class="prettyprint"><code>set timing on declare x varchar2(100); begin for i in 1..2500000 loop x := translate('(<FIO>)', '()[]', '----'); end loop; end; / declare x varchar2(100); begin for i in 1..2500000 loop x := regexp_replace('[(<FIO>)]', '[\(\)\[]|\]', '-', 1, 0); end loop; end; / </code></pre> Here the <code>translate</code> version takes just under 10 seconds, while the <code>regexp_replace</code> version around 0.2 seconds -- around 2 orders of magnitude faster(!) Based on this result, I will be using regular expressions much more often in my performance critical code -- both SQL and PL/SQL.

Performance of regexp_replace vs translate in Oracle?

2 Answers

I think you're running into simple optimization. The regexp expression is so expensive to compute that the result is cached in the hope that it will be used again in the future. If you actually use distinct strings to convert, you will see that the modest translate is naturally faster because it is its specialized function.

Here's my example, running on 11.1.0.7.0:

SQL> DECLARE
  2     TYPE t IS TABLE OF VARCHAR2(4000);
  3     l       t;
  4     l_level NUMBER := 1000;
  5     l_time  TIMESTAMP;
  6     l_char  VARCHAR2(4000);
  7  BEGIN
  8     -- init
  9     EXECUTE IMMEDIATE 'ALTER SESSION SET PLSQL_OPTIMIZE_LEVEL=2';
 10     SELECT dbms_random.STRING('p', 2000)
 11       BULK COLLECT
 12       INTO l FROM dual
 13     CONNECT BY LEVEL <= l_level;
 14     -- regex
 15     l_time := systimestamp;
 16     FOR i IN 1 .. l.count LOOP
 17        l_char := regexp_replace(l(i), '[]()[]', '-', 1, 0);
 18     END LOOP;
 19     dbms_output.put_line('regex     :' || (systimestamp - l_time));
 20     -- tranlate
 21     l_time := systimestamp;
 22     FOR i IN 1 .. l.count LOOP
 23        l_char := translate(l(i), '()[]', '----');
 24     END LOOP;
 25     dbms_output.put_line('translate :' || (systimestamp - l_time));
 26  END;
 27  /

regex     :+000000000 00:00:00.979305000
translate :+000000000 00:00:00.238773000

PL/SQL procedure successfully completed

on 11.2.0.3.0 :

regex     :+000000000 00:00:00.617290000
translate :+000000000 00:00:00.138205000

Conclusion: In general I suspect translate will win.

165

answered Oct 02 '22 11:10

Vincent Malgrat

For SQL, I tested this with the following script:

set timing on

select sum(length(x)) from (
  select translate('(<FIO>)', '()[]', '----') x
  from (
    select *
    from dual
    connect by level <= 2000000
  )
);

select sum(length(x)) from (
  select regexp_replace('[(<FIO>)]', '[\(\)\[]|\]', '-', 1, 0) x
  from (
    select *
    from dual
    connect by level <= 2000000
  )
);

and found that the performance of translate and regexp_replace were almost always the same, but it could be that the cost of the other operations is overwhelming the cost of the functions I'm trying to test.

Next, I tried a PL/SQL version:

set timing on

declare
  x varchar2(100);
begin
  for i in 1..2500000 loop
    x := translate('(<FIO>)', '()[]', '----');
  end loop;
end;
/

declare
  x varchar2(100);
begin
  for i in 1..2500000 loop
    x := regexp_replace('[(<FIO>)]', '[\(\)\[]|\]', '-', 1, 0);
  end loop;
end;
/

Here the translate version takes just under 10 seconds, while the regexp_replace version around 0.2 seconds -- around 2 orders of magnitude faster(!)

Based on this result, I will be using regular expressions much more often in my performance critical code -- both SQL and PL/SQL.

answered Oct 02 '22 12:10

Colin 't Hart

Related questions
                            
                                Should I be using SQL transactions, while reading records?
                            
                                Multiple Alias names for a table
                            
                                SQL: Is it possible to 'group by' according to 'like' function's results?
                            
                                C# Prepared Statements - @ sign (at / strudel sign) queries
                            
                                Eliminate and reduce overlapping date ranges
                            
                                Fastest postgreSQL equivalent to MySQL UTC_DATE() (getting UTC date)?
                            
                                select mysql missing columns in php
                            
                                Group By Except For Certain Value
                            
                                Sum results from two select statements
                            
                                is "where (ParamID = @ParamID) OR (@ParamID = -1)" a good practice in sql selection
                            
                                Lock table while inserting
                            
                                How to reorder items in a table
                            
                                SQL select group query
                            
                                Role of selectivity in index scan/seek
                            
                                Most efficient way to save way points and do comparisons?
                            
                                SQL Server - returning xml child nodes for xml column
                            
                                How do I programmatically run a complex query on an as400?
                            
                                Postgres 9.2 PL/pgSQL simple update in loop
                            
                                How to use subquery into "from" clause in hibernate?
                            
                                Normalizing an extremely big table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance of regexp_replace vs translate in Oracle?

Tags:

performance

regex

sql

oracle

plsql

Colin 't Hart

People also ask

2 Answers

Vincent Malgrat

Colin 't Hart

Recent Activity

Donate For Us