Why is a UDF so much slower than a subquery?

Tags:

I have a case where I need to translate (lookup) several values from the same table. The first way I wrote it, was using subqueries:

SELECT
    (SELECT id FROM user WHERE user_pk = created_by) AS creator,
    (SELECT id FROM user WHERE user_pk = updated_by) AS updater,
    (SELECT id FROM user WHERE user_pk = owned_by) AS owner,
    [name]
FROM asset

As I'm using this subquery a lot (that is, I have about 50 tables with these fields), and I might need to add some more code to the subquery (for example, "AND active = 1" ) I thought I'd put these into a user-defined function UDF and use that. But the performance using that UDF was abysmal.

CREATE FUNCTION dbo.get_user ( @user_pk INT )
RETURNS INT
AS BEGIN 
    RETURN ( SELECT id
             FROM   ice.dbo.[user]
             WHERE  user_pk = @user_pk )
END

SELECT dbo.get_user(created_by) as creator, [name]
FROM asset

The performance of #1 is less than 1 second. Performance of #2 is about 30 seconds...

Why, or more importantly, is there any way I can code in SQL server 2008, so that I don't have to use so many subqueries?

Edit:

Just a litte more explanation of when this is useful. This simple query (that is, get userid) gets a lot more complex when I want to have a text for a user, since I have to join with profile to get the language, with a company to see if the language should be fetch'ed from there instead, and with the translation table to get the translated text. And for most of these queries, performance is a secondary issue to readability and maintainability.

884

asked Feb 04 '09 10:02

devzero

1 Answers

As other posters have suggested, using joins will definitely give you the best overall performance.

However, since you've stated that that you don't want the headache of maintaining 50-ish similar joins or subqueries, try using an inline table-valued function as follows:

CREATE FUNCTION dbo.get_user_inline (@user_pk INT)
RETURNS TABLE AS
RETURN
(
    SELECT TOP 1 id
    FROM ice.dbo.[user]
    WHERE user_pk = @user_pk
        -- AND active = 1
)

Your original query would then become something like:

SELECT
    (SELECT TOP 1 id FROM dbo.get_user_inline(created_by)) AS creator,
    (SELECT TOP 1 id FROM dbo.get_user_inline(updated_by)) AS updater,
    (SELECT TOP 1 id FROM dbo.get_user_inline(owned_by)) AS owner,
    [name]
FROM asset

An inline table-valued function should have better performance than either a scalar function or a multistatement table-valued function.

The performance should be roughly equivalent to your original query, but any future changes can be made in the UDF, making it much more maintainable.

answered Oct 16 '22 07:10

LukeH

Related questions
                            
                                Make all store images the base, small and thumbnail images in Magento?
                            
                                Oracle SQL -- find the values NOT in a table
                            
                                Convert String to Clob in Java
                            
                                SQL join on junction table with many to many relation
                            
                                Why use "Y"/"N" instead of a bit field in Microsoft SQL Server?
                            
                                How much does wrapping inserts in a transaction help performance on Sql Server?
                            
                                Database VIEW does not reflect the data in the underying TABLE
                            
                                SQL: How can I update a value on a column only if that value is null?
                            
                                Does every table really need an auto-incrementing artificial primary key? [closed]
                            
                                concat two integers and result as string in SQL
                            
                                Microsoft SQL Server Management Studio - query result as text
                            
                                How to call Oracle MD5 hash function?
                            
                                How do I escape a literal question mark ('?') in a JDBC prepared statement
                            
                                Trigger Error: The current transaction cannot be committed and cannot support operations that write to the log file
                            
                                Please help me understand SQL vs C like programming?
                            
                                SQL: using WHERE AND instead of HAVING
                            
                                Using DATEADD in sqlalchemy
                            
                                SQL Server Creating a temp table for this query
                            
                                SQL date format convert? [dd.mm.yy to YYYY-MM-DD]
                            
                                sql select records that don't have relation in a second table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is a UDF so much slower than a subquery?

Tags:

performance

sql

sql-server

sql-server-2008

user-defined-functions

Edit:

devzero

People also ask

1 Answers

LukeH

Recent Activity

Donate For Us