In the process of trying to help out an app dev team with performance issues on a SQL 2000 server (from a bunch of Java applications on separate app servers), I ran a SQL trace and discovered that all calls to the database are full of API Server Cursor statements (sp_cursorprepexec, sp_cursorfetch, sp_cursorclose). Looks like they're specifying some connection string properties that force the use of server-side cursors, retrieving only 128 rows of data at a time: (From http://msdn.microsoft.com/en-us/library/Aa172588) <blockquote> When the API cursor attributes or properties are set to anything other than their defaults, the OLE DB provider for SQL Server and the SQL Server ODBC driver use API server cursors instead of default result sets. Each call to an API function that fetches rows generates a roundtrip to the server to fetch the rows from the API server cursor. </blockquote> UPDATE: The connection string at issue is a JDBC connection string parameter, <code>selectMethod=cursor</code> (which enables the server-side cursors we discussed above) vs the alternative <code>selectMethod=direct</code>. They have been using <code>selectMethod=cursor</code> as their standard connection string from all apps. From my DBA perspective, that's just annoying (it clutters the trace up with useless junk), and (I would speculate) is resulting in many extra app-to-SQL server round trips, reducing overall performance. They apparently did test changing (just one of about 60 different app connections) to <code>selectMethod=direct</code> but experienced some issues (of which I have no details) and are concerned about the application breaking. So, my questions are: <ul> <li>Can using <code>selectMethod=cursor</code> lower application performance, as I have tried to argue? (by increasing the number of round trips necessary on a SQL server that already has a very high queries/sec)</li> <li>Is <code>selectMethod=</code> an application-transparent setting on a JDBC connection? Could this break their app if we change it? </li> <li>More generally, when should you use <code>cursor</code> vs <code>direct</code>?</li> </ul> Also cross-posted to SF. EDIT: Received actual technical details that warrant a significant edit to title, question, and tags. EDIT: Added bounty. Also added bounty to the SF question (this question is focused on application behavior, the SF question is focused on SQL performance.) Thanks!!

Briefly, <ol> <li> <code>selectMethod=cursor</code> <ul> <li>theoretically requires more server-side resources than <code>selectMethod=direct</code> </li> <li>only loads at most batch-size records into client memory at once, resulting in a more predictable client memory footprint</li> </ul> </li> <li> <code>selectMethod=direct</code> <ul> <li>theoretically requires less server-side resources than <code>selectMethod=cursor</code> </li> <li>will read the entire result set into client memory (unless the driver natively supports asynchronous result set retrieval) before the client application can iterate over it; this can reduce performance in two ways: <ol> <li>reduced performance with large result sets if the client application is written in such a way as to stop processing after traversing only a fraction of the result set (with <code>direct</code> it has already paid the cost of retrieving data it will essentially throw away; with <code>cursor</code> the waste is limited to at most batch-size - 1 rows -- the early termination condition should probably be recoded in SQL anyway e.g. as <code>SELECT TOP</code> or window functions)</li> <li>reduced performance with large result sets because of potential garbage collection and/or out-of-memory issues associated with an increased memory footprint</li> </ol> </li> </ul> </li> </ol> In summary, <ul> <li> Can using <code>selectMethod=cursor</code> lower application performance? -- either method can lower performance, for different reasons. Past a certain resultset size, <code>cursor</code> may still be preferable. See below for when to use one or the other</li> <li> Is <code>selectMethod=</code> an application-transparent setting on a JDBC connection? -- it is transparent, but it can still break their app if memory usage grows significantly enough to hog their client system (and, correspondingly, your server) or crash the client altogether</li> <li> More generally, when should you use <code>cursor</code> vs <code>direct</code>? -- I personally use <code>cursor</code> when dealing with potentially large or otherwise unbounded result sets. The roundtrip overhead is then amortized given a large enough batch size, and my client memory footprint is predictable. I use <code>direct</code> when the size of the result set I expect is known to be inferior to whatever batch size I use with <code>cursor</code>, or bound in some way, or when memory is not an issue.</li> </ul>

JDBC connection to very busy SQL 2000: selectMethod=cursor vs selectMethod=direct?

Tags:

sql-server-2000

jdbc

database-cursor

selectmethod

In the process of trying to help out an app dev team with performance issues on a SQL 2000 server (from a bunch of Java applications on separate app servers), I ran a SQL trace and discovered that all calls to the database are full of API Server Cursor statements (sp_cursorprepexec, sp_cursorfetch, sp_cursorclose).

Looks like they're specifying some connection string properties that force the use of server-side cursors, retrieving only 128 rows of data at a time: (From http://msdn.microsoft.com/en-us/library/Aa172588)

When the API cursor attributes or properties are set to anything other than their defaults, the OLE DB provider for SQL Server and the SQL Server ODBC driver use API server cursors instead of default result sets. Each call to an API function that fetches rows generates a roundtrip to the server to fetch the rows from the API server cursor.

UPDATE: The connection string at issue is a JDBC connection string parameter, selectMethod=cursor (which enables the server-side cursors we discussed above) vs the alternative selectMethod=direct. They have been using selectMethod=cursor as their standard connection string from all apps.

From my DBA perspective, that's just annoying (it clutters the trace up with useless junk), and (I would speculate) is resulting in many extra app-to-SQL server round trips, reducing overall performance.

They apparently did test changing (just one of about 60 different app connections) to selectMethod=direct but experienced some issues (of which I have no details) and are concerned about the application breaking.

So, my questions are:

Can using selectMethod=cursor lower application performance, as I have tried to argue? (by increasing the number of round trips necessary on a SQL server that already has a very high queries/sec)
Is selectMethod= an application-transparent setting on a JDBC connection? Could this break their app if we change it?
More generally, when should you use cursor vs direct?

Also cross-posted to SF.

EDIT: Received actual technical details that warrant a significant edit to title, question, and tags.

EDIT: Added bounty. Also added bounty to the SF question (this question is focused on application behavior, the SF question is focused on SQL performance.) Thanks!!

662

asked Sep 01 '10 22:09

BradC

1 Answers

Briefly,

selectMethod=cursor
- theoretically requires more server-side resources than selectMethod=direct
- only loads at most batch-size records into client memory at once, resulting in a more predictable client memory footprint
selectMethod=direct
- theoretically requires less server-side resources than selectMethod=cursor
- will read the entire result set into client memory (unless the driver natively supports asynchronous result set retrieval) before the client application can iterate over it; this can reduce performance in two ways:
  1. reduced performance with large result sets if the client application is written in such a way as to stop processing after traversing only a fraction of the result set (with direct it has already paid the cost of retrieving data it will essentially throw away; with cursor the waste is limited to at most batch-size - 1 rows -- the early termination condition should probably be recoded in SQL anyway e.g. as SELECT TOP or window functions)
  2. reduced performance with large result sets because of potential garbage collection and/or out-of-memory issues associated with an increased memory footprint

In summary,

Can using selectMethod=cursor lower application performance? -- either method can lower performance, for different reasons. Past a certain resultset size, cursor may still be preferable. See below for when to use one or the other
Is selectMethod= an application-transparent setting on a JDBC connection? -- it is transparent, but it can still break their app if memory usage grows significantly enough to hog their client system (and, correspondingly, your server) or crash the client altogether
More generally, when should you use cursor vs direct? -- I personally use cursor when dealing with potentially large or otherwise unbounded result sets. The roundtrip overhead is then amortized given a large enough batch size, and my client memory footprint is predictable. I use direct when the size of the result set I expect is known to be inferior to whatever batch size I use with cursor, or bound in some way, or when memory is not an issue.

104

answered Nov 04 '22 07:11

vladr

Related questions
                            
                                Declaring multiple identical service in tnsnames.ora supported by oracle thin driver
                            
                                Unable to get db connection after Java 8 upgrade
                            
                                JDBC - Connect Multiple Databases
                            
                                Can not read response from server
                            
                                Printing interpolated SQL query in Slick
                            
                                SimpleJdbcTemplate and null parameters
                            
                                "System resource exceeded" during connection to Access file through Java jdbc odbc
                            
                                How to handle the transaction in J2EE 1.4 with plain JDBC
                            
                                Can I use multiple statements in a JDBC prepared query?
                            
                                Lock wait timeout exceeded; try restarting transaction using JDBC
                            
                                Tomcat 8 - java.sql.SQLException: Cannot create JDBC driver of class '' for connect URL 'jdbc:mysql://xxx/myApp'
                            
                                Hibernate and Multi-Tenant Database using Schemas in PostgreSQL
                            
                                Listagg function and ORA-01489: result of string concatenation is too long
                            
                                Data source rejected establishment of connection, message from server: "Too many connections" [closed]
                            
                                Is jdbcType necessary in a MyBatis mapper?
                            
                                javax.naming.NameNotFoundException: Name [jdbc/rhwebDB] is not bound in this Context. Unable to find [jdbc]
                            
                                JDBC connection with auto reconnect
                            
                                JDBC Prepared statement parameter inside json
                            
                                What is a Connection in JDBC?
                            
                                ClassNotFoundException - com.microsoft.jdbc.sqlserver.SQLServerDriver

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With