I am dealing with what is apparently a performance issue while retrieving a relatively large <code>ResultSet</code> from a remote Microsoft SQL Server 2012 to a Java client that uses Microsoft JDBC Driver 4.0. When I run the corresponding query on the remote server's Microsoft SQL Server Management Studio, it returns approx. 220k rows almost instantaneously. When I issue the same query from the client, it stalls. The same test has worked fine also on the client with an earlier version of the database where only approx. 400 rows qualified. I tried to tackle this by appending <code>;responseBuffering=adaptive"</code> to the URL passed to <code>DriverManager.getConnection()</code>. After the connection is established, I see this property (among several others) in the result from <code>connection.getMetaData().getURL()</code>, but[ <code>connection.getClientInfo(responseBuffering)</code> returns <code>null</code>, and what is more the client is still stalling. What could be going wrong here and how can I instruct the a Microsoft SQL Server (not just suggest to it -- programmatically in Java) that it must return rows in smaller chunks rather than all at once or improve JDBC query times by some other measures. Two further observations that seem somewhat strange and that perhaps point to a different root cause entirely: <ul> <li>When the client stalls it still shows only relatively light CPU load, unlike what I would expect from heavy garbage collection</li> <li>"responseBuffering=adaptive" should be the normal default by now</li> </ul> UPDATE I've checked and found that switching from <code>PreparedStatement</code> to <code>Statement</code>does not improve things in my case (it apparently can help in other cases). UPDATE Here is my current query: <pre class="prettyprint"><code>select PARENT.IDENTIFIER as PARENT_IDENTIFIER, PARENT.CLASS as PARENT_CLASS, CHILD.TYPE as CHILD_TYPE, CHILD.IDENTIFIER as CHILD_IDENTIFIER, PROPERTY.IDENTIFIER as PROPERTY_IDENTIFIER, PROPERTY.DESCRIPTION as PROPERTY_DESCRIPTION, PROPERTY.TYPE as PROPERTY_TYPE, PROPERTY.PP as PROPERTY_PP, PROPERTY.STATUS as PROPERTY_STATUS, PROPERTY.TARGET as PROPERTY_TARGET -- a date from OBJECTS as CHILD left outer join RELATIONS on RELATIONS.CHILD = CHILD.IDENTIFIER left outer join OBJECTS as PARENT on RELATIONS.PARENT = PARENT.IDENTIFIER inner join PROPERTIES as PROPERTY on PROPERTY.OBJECT = CHILD.IDENTIFIER where PROPERTY.TARGET is not null order by case when PARENT.IDENTIFIER is null then 1 else 0 end, PARENT.IDENTIFIER, CHILD.IDENTIFIER, PROPERTY.TARGET, PROPERTY.IDENTIFIER </code></pre>

The adaptive buffering is a good answer. I would also recommend checking the connections' <code>SET</code> options via SQL Server Profiler. When you start a trace, make sure <code>ExistingConnections</code> is selected. Compare a SPID from a JDBC connection and a SSMS connection. <code>ARITHABORT</code> comes to mind as one that I have seen cause a difference in performance between SSMS and JDBC driver. Microsoft briefly mentions it here: http://msdn.microsoft.com/en-us/library/ms190306.aspx. Stack Exchange information here: https://dba.stackexchange.com/questions/9840/why-would-set-arithabort-on-dramatically-speed-up-a-query On Oracle, I have seen huge impacts by playing with the <code>setFetchSize</code> method on the <code>Statement</code> / <code>PreparedStatement</code> object. Apparently, the SQL Server driver does not support that method. However, there is an internal method in the driver for it. See Set a default row prefetch in SQL Server using JDBC driver for details. Also, what are you doing in your <code>while (rs.next())</code> loop? Try doing nothing other than reading a column, like <code>rs.getInt(1)</code>. See what happens. If it flies, that suggests the bottleneck is in your former processing of the result set. If it is still slow, then the problem must be in the driver or database. You could use SQL Server Profiler to compare the executions as they come in via JDBC and as you run it via SSMS. Compare the CPU, reads, writes and duration. If they are different, then the execution plan is probably different, which points me back to the first thing I mentioned: the <code>SET</code> options.

Why does Microsoft SQL Server 2012 query take minutes over JDBC 4.0 but second(s) in Management Studio?

Tags:

java

sql-server

sql-server-2012

jdbc

I am dealing with what is apparently a performance issue while retrieving a relatively large ResultSet from a remote Microsoft SQL Server 2012 to a Java client that uses Microsoft JDBC Driver 4.0.

When I run the corresponding query on the remote server's Microsoft SQL Server Management Studio, it returns approx. 220k rows almost instantaneously. When I issue the same query from the client, it stalls. The same test has worked fine also on the client with an earlier version of the database where only approx. 400 rows qualified.

I tried to tackle this by appending ;responseBuffering=adaptive" to the URL passed to DriverManager.getConnection(). After the connection is established, I see this property (among several others) in the result from connection.getMetaData().getURL(), but[ connection.getClientInfo(responseBuffering) returns null, and what is more the client is still stalling.

What could be going wrong here and how can I instruct the a Microsoft SQL Server (not just suggest to it -- programmatically in Java) that it must return rows in smaller chunks rather than all at once or improve JDBC query times by some other measures.

Two further observations that seem somewhat strange and that perhaps point to a different root cause entirely:

When the client stalls it still shows only relatively light CPU load, unlike what I would expect from heavy garbage collection
"responseBuffering=adaptive" should be the normal default by now

UPDATE I've checked and found that switching from PreparedStatement to Statementdoes not improve things in my case (it apparently can help in other cases).

UPDATE Here is my current query:

select 
    PARENT.IDENTIFIER    as PARENT_IDENTIFIER,
    PARENT.CLASS         as PARENT_CLASS,
    CHILD.TYPE           as CHILD_TYPE,
    CHILD.IDENTIFIER     as CHILD_IDENTIFIER,
    PROPERTY.IDENTIFIER  as PROPERTY_IDENTIFIER,
    PROPERTY.DESCRIPTION as PROPERTY_DESCRIPTION,
    PROPERTY.TYPE        as PROPERTY_TYPE,
    PROPERTY.PP          as PROPERTY_PP,
    PROPERTY.STATUS      as PROPERTY_STATUS,
    PROPERTY.TARGET      as PROPERTY_TARGET -- a date
from
    OBJECTS as CHILD
    left outer join RELATIONS              on RELATIONS.CHILD = CHILD.IDENTIFIER
    left outer join OBJECTS    as PARENT   on RELATIONS.PARENT = PARENT.IDENTIFIER
    inner join      PROPERTIES as PROPERTY on PROPERTY.OBJECT = CHILD.IDENTIFIER
where
    PROPERTY.TARGET is not null
order by
    case when PARENT.IDENTIFIER is null then 1 else 0 end,
    PARENT.IDENTIFIER,
    CHILD.IDENTIFIER,
    PROPERTY.TARGET,
    PROPERTY.IDENTIFIER

688

asked Oct 22 '14 06:10

Drux

2 Answers

The adaptive buffering is a good answer. I would also recommend checking the connections' SET options via SQL Server Profiler.

When you start a trace, make sure ExistingConnections is selected. Compare a SPID from a JDBC connection and a SSMS connection. ARITHABORT comes to mind as one that I have seen cause a difference in performance between SSMS and JDBC driver. Microsoft briefly mentions it here: http://msdn.microsoft.com/en-us/library/ms190306.aspx. Stack Exchange information here: https://dba.stackexchange.com/questions/9840/why-would-set-arithabort-on-dramatically-speed-up-a-query

On Oracle, I have seen huge impacts by playing with the setFetchSize method on the Statement / PreparedStatement object. Apparently, the SQL Server driver does not support that method. However, there is an internal method in the driver for it. See Set a default row prefetch in SQL Server using JDBC driver for details.

Also, what are you doing in your while (rs.next()) loop? Try doing nothing other than reading a column, like rs.getInt(1). See what happens. If it flies, that suggests the bottleneck is in your former processing of the result set. If it is still slow, then the problem must be in the driver or database.

You could use SQL Server Profiler to compare the executions as they come in via JDBC and as you run it via SSMS. Compare the CPU, reads, writes and duration. If they are different, then the execution plan is probably different, which points me back to the first thing I mentioned: the SET options.

130

answered Oct 07 '22 19:10

Brandon

I'm simply going to toss out this suggestion, and leave it for you to test.

The JDBC driver may well be FETCHING all of the rows before it returns, whereas the other system is simply returning the open cursor.

I have seen this behavior on other databases with JDBC, but had not direct experience with SQL Server.

In the examples where I have seen it, setting the auto commit to false for the connection prevents it from loading the entire result set. There are other settings to have it load only portions, etc.

But that could well be the underlying issue you are facing.

answered Oct 07 '22 19:10

Will Hartung

Related questions
                            
                                Thread.isInterrupted() returns false after thread has been terminated
                            
                                How to configure hibernate logging using log4j2.xml?
                            
                                How do I find out how many references an object has? [duplicate]
                            
                                Webcam supported picture sizes
                            
                                How do you inject jdbiFactory DAOs into a Dropwizard Command?
                            
                                Java - Heap vs Direct memory access
                            
                                Android notification with RemoteViews - having activity associated with RemoteViews layout
                            
                                How to make an executable jar file using IntelliJ from a Selenium/TestNG java file?
                            
                                Would Java annotation impact runtime performance?
                            
                                java.net.SocketException: Too many open files Spring Hibernate Tomcat
                            
                                Mockito and HttpServletResponse - write output to textfile
                            
                                javafx NullPointerException when rendering big image
                            
                                Akka Java fault tolerance and actor restarting
                            
                                Change text of the edit text by html
                            
                                How to make Jackson throw exception as is when deserialization mapping fail
                            
                                How to use the Radaee Pdf reader sdk
                            
                                How to get Apple Java Extensions (com.apple.eawt) work on JDK 7 and higher?
                            
                                Alternative to fetch join with "ON clause" in Hibernate
                            
                                Hibernate Interceptors - Why is onFlushDirty called after onSave?
                            
                                Flattening Java Bean to a Map

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With