Understanding parallel TSQL connections

Tags:

I managed to create parallel connections in R to a TSQL server using the below code:

SQL_retrieve <- function(x){
con <- odbcDriverConnect(
'driver={SQL Server};server=OPTMSLMSOFT02;database=Ad_History;trusted_connection=true')

odbcGetInfo(con)
rawData <- sqlQuery(con,
paste("select * from AD_MDL_R_INPUT a where a.itm_lctn_num = ",
facility[x] )) odbcClose(con) return(rawData) }

cl <- makeCluster(5) registerDoParallel(cl)
outputPar <- foreach(j = 1:facility_count, .packages="RODBC")
%dopar% SQL_retrieve(j) stopCluster(cl)

I would expect to see all connections actively downloading in parallel, but the reality is that only one or two connections are active at a time (see image below).
Even with 32 connections, the total download time is cut by slightly more than 1/2 (should be closer to 1/32, in theory, right?). There are also large pauses between connection activity. Why is this?

Connection Utilization

Some notes to keep in mind:

The TSQL server and R are both on the same server, so network latency not an issue.
The TSQL server allows up to a max of ~32k connections, so we are not bumping into a session limit issue.

UPDATE 7/26/17 Taking another stab at this problem and it now works (code unchanged). Not sure what happened between now and initial posting, but perhaps some changes to MS SQL server settings (unlikely).

The time to pull 7.9 million rows follows the curve in the image below.

Time versus SQL Connections

302

asked Aug 17 '16 14:08

Evan Larson

1 Answers

SQL Server uses "Connection Pooling."

A connection takes a lot of time to establish from scratch.

An applications will make repeated identical connections, so pooling increases performance. SQL half-closes connections, so the next connection will start with a leg up and be much quicker.

You don't want to use pooling in your instance. You can turn off pooling by adding "pooling=false;" as mentioned above by @rene-lykke-dahl. That should resolve your issue.

Read about connection pooling here:

answered Oct 07 '22 02:10

Jason Geiger

Related questions
                            
                                Text Categorization in R
                            
                                Setting parent.env, followed by `detach`, segfaults
                            
                                How to identify overlaps in multiple columns
                            
                                Format model display in texreg or stargazer R as scientific
                            
                                Error in ls(envir = envir, all.names = private) : invalid 'envir' argument in R
                            
                                Base function that behaves like `cat` but returns value instead of writing to file
                            
                                Why is GGally::ggpairs significantly slower in RStudio vs. base R?
                            
                                How to assign fixed memory size to a variable in R
                            
                                Combine group_by and distinct
                            
                                Rcharts nvd3 2-D zoom possible?
                            
                                R / RStudio : graph scaling issues & fuzziness on high dpi screens
                            
                                How do I quickly find out whether two (large) factors are relabelings of each other?
                            
                                Treat words separated by space in the same manner
                            
                                R could not find function "%dopar%"
                            
                                Why is dplyr removing values not met by condition?
                            
                                R - ggplot geom_dotplot shape option
                            
                                R's shiny app goes grey when deployed, works fine locally
                            
                                Saving .R script File Using Script
                            
                                Load balancing R requests coming to RServe
                            
                                What is the difference betwen Microsoft R Open (MRO) and R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding parallel TSQL connections

Tags:

tsql

r

parallel-foreach

Evan Larson

People also ask

1 Answers

Jason Geiger

Recent Activity

Donate For Us