Why are relational set-based queries better than cursors?

Tags:

When writing database queries in something like TSQL or PLSQL, we often have a choice of iterating over rows with a cursor to accomplish the task, or crafting a single SQL statement that does the same job all at once.

Also, we have the choice of simply pulling a large set of data back into our application and then processing it row by row, with C# or Java or PHP or whatever.

Why is it better to use set-based queries? What is the theory behind this choice? What is a good example of a cursor-based solution and its relational equivalent?

527

asked Aug 23 '08 12:08

Eric Z Beard

2 Answers

The main reason that I'm aware of is that set-based operations can be optimised by the engine by running them across multiple threads. For example, think of a quicksort - you can separate the list you're sorting into multiple "chunks" and sort each separately in their own thread. SQL engines can do similar things with huge amounts of data in one set-based query.

When you perform cursor-based operations, the engine can only run sequentially and the operation has to be single threaded.

200

answered Sep 21 '22 15:09

Matt Hamilton

Set based queries are (usually) faster because:

They have more information for the query optimizer to optimize
They can batch reads from disk
There's less logging involved for rollbacks, transaction logs, etc.
Less locks are taken, which decreases overhead
Set based logic is the focus of RDBMSs, so they've been heavily optimized for it (often, at the expense of procedural performance)

Pulling data out to the middle tier to process it can be useful, though, because it removes the processing overhead off the DB server (which is the hardest thing to scale, and is normally doing other things as well). Also, you normally don't have the same overheads (or benefits) in the middle tier. Things like transactional logging, built-in locking and blocking, etc. - sometimes these are necessary and useful, other times they're just a waste of resources.

A simple cursor with procedural logic vs. set based example (T-SQL) that will assign an area code based on the telephone exchange:

--Cursor DECLARE @phoneNumber char(7) DECLARE c CURSOR LOCAL FAST_FORWARD FOR    SELECT PhoneNumber FROM Customer WHERE AreaCode IS NULL OPEN c FETCH NEXT FROM c INTO @phoneNumber WHILE @@FETCH_STATUS = 0 BEGIN    DECLARE @exchange char(3), @areaCode char(3)    SELECT @exchange = LEFT(@phoneNumber, 3)     SELECT @areaCode = AreaCode     FROM AreaCode_Exchange     WHERE Exchange = @exchange     IF @areaCode IS NOT NULL BEGIN        UPDATE Customer SET AreaCode = @areaCode        WHERE CURRENT OF c    END    FETCH NEXT FROM c INTO @phoneNumber END CLOSE c DEALLOCATE c END  --Set UPDATE Customer SET     AreaCode = AreaCode_Exchange.AreaCode FROM Customer JOIN AreaCode_Exchange ON     LEFT(Customer.PhoneNumber, 3) = AreaCode_Exchange.Exchange WHERE     Customer.AreaCode IS NULL

answered Sep 21 '22 15:09

Mark Brackett

Related questions
                            
                                Creating a threaded private messaging system like facebook and gmail
                            
                                "Not supported for DML operations" with simple UPDATE query
                            
                                T-SQL Output Message During execution in SSMS
                            
                                SQL Server "cannot perform an aggregate function on an expression containing an aggregate or a subquery", but Sybase can
                            
                                Difference between different types of SQL? [closed]
                            
                                MySQL SELECT query string matching
                            
                                Maintaining Referential Integrity - Good or Bad?
                            
                                SQL Server - NOT IN
                            
                                Function with SQL query has no destination for result data
                            
                                Why does a query invoke a auto-flush in SQLAlchemy?
                            
                                Unknown column in 'having clause'
                            
                                How to get Return Value of a Stored Procedure
                            
                                insert multiple rows into DB2 database
                            
                                How to constraint no empty strings on an NVARCHAR column
                            
                                How can I detect a SQL table's existence in Java?
                            
                                Sql query for updating database if value is not null?
                            
                                how to convert numeric to nvarchar in sql command
                            
                                MYSQL Left Join how do I select NULL values?
                            
                                SQL only a throw inside if statement
                            
                                `show create table` equivalent in oracle sql

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why are relational set-based queries better than cursors?

Tags:

language-agnostic

sql

database-cursor

Eric Z Beard

People also ask

2 Answers

Matt Hamilton

Mark Brackett

Recent Activity

Donate For Us