In the process of trying to help out an app dev team with performance issues on a SQL 2000 server (from a bunch of Java applications on separate app servers), I ran a SQL trace and discovered that all calls to the database are full of API Server Cursor statements (sp_cursorprepexec, sp_cursorfetch, sp_cursorclose).
Looks like they're specifying some connection string properties that force the use of server-side cursors, retrieving only 128 rows of data at a time: (From http://msdn.microsoft.com/en-us/library/Aa172588)
When the API cursor attributes or properties are set to anything other than their defaults, the OLE DB provider for SQL Server and the SQL Server ODBC driver use API server cursors instead of default result sets. Each call to an API function that fetches rows generates a roundtrip to the server to fetch the rows from the API server cursor.
UPDATE: The connection string at issue is a JDBC connection string parameter, selectMethod=cursor
(which enables the server-side cursors we discussed above) vs the alternative selectMethod=direct
. They have been using selectMethod=cursor
as their standard connection string from all apps.
From my DBA perspective, that's just annoying (it clutters the trace up with useless junk), and (I would speculate) is resulting in many extra app-to-SQL server round trips, reducing overall performance.
They apparently did test changing (just one of about 60 different app connections) to selectMethod=direct
but experienced some issues (of which I have no details) and are concerned about the application breaking.
So, my questions are:
selectMethod=cursor
lower application performance, as I have tried to argue? (by increasing the number of round trips necessary on a SQL server that already has a very high queries/sec)selectMethod=
an application-transparent setting on a JDBC connection? Could this break their app if we change it? cursor
vs direct
?Also cross-posted to SF.
EDIT: Received actual technical details that warrant a significant edit to title, question, and tags.
EDIT: Added bounty. Also added bounty to the SF question (this question is focused on application behavior, the SF question is focused on SQL performance.) Thanks!!
The selectMethod property is optional to the jdbc URL. When this property is set to "cursor", a database cursor is created. This is useful when reading very large result sets that cannot be contained in the clients memory.
Connect to SQL Server Using JDBC Driver and Command Linedatasource = "MSSQLServerAuth"; username = ""; password = ""; conn = database(datasource,username,password); Or, to connect without Windows authentication, use the configured JDBC data source and specify the user name username and the password pwd .
A cursor provides you with the ability to step through and process the rows in a ResultSet one by one. A java. sql. ResultSet object constitutes a cursor. You do not need to use a language construct, such as SQL-92's DECLARE CURSOR, to work with cursors in a Java application.
In our continued commitment to interoperability, Microsoft provides a Java Database Connectivity (JDBC) driver for use with SQL Server, Azure SQL Database, and Azure SQL Managed Instance.
Briefly,
selectMethod=cursor
selectMethod=direct
selectMethod=direct
selectMethod=cursor
direct
it has already paid the cost of retrieving data it will essentially throw away; with cursor
the waste is limited to at most batch-size - 1 rows -- the early termination condition should probably be recoded in SQL anyway e.g. as SELECT TOP
or window functions)In summary,
selectMethod=cursor
lower application performance? -- either method can lower performance, for different reasons. Past a certain resultset size, cursor
may still be preferable. See below for when to use one or the otherselectMethod=
an application-transparent setting on a JDBC connection? -- it is transparent, but it can still break their app if memory usage grows significantly enough to hog their client system (and, correspondingly, your server) or crash the client altogethercursor
vs direct
? -- I personally use cursor
when dealing with potentially large or otherwise unbounded result sets. The roundtrip overhead is then amortized given a large enough batch size, and my client memory footprint is predictable. I use direct
when the size of the result set I expect is known to be inferior to whatever batch size I use with cursor
, or bound in some way, or when memory is not an issue.If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With