I'm currently using code like this to detect if a SQL server job is running. (this is SQL Server 2005, all SP's) <pre class="prettyprint"><code>return (select isnull( (select top 1 CASE WHEN current_execution_status = 4 THEN 0 ELSE 1 END from openquery(devtestvm, 'EXEC msdb.dbo.sp_help_job') where current_execution_status = 4 and name = 'WQCheckQueueJob' + cast(@Index as varchar(10)) ), 1) ) </code></pre> No problems there, and generally speaking, it works just fine. But.... (always a but) On occasion, I'll invoke this, get back a "job is not running" result, at which point I'll try and start the job, via <pre class="prettyprint"><code>exec msdb.dbo.sp_start_job @JobName </code></pre> and SQL will return that "SQLAgent has refused to start the job because it already has a pending request". Ok. Also not a problem. It's conceivable that there's a slight window where the target job could get started before this code can start it, but after checking if it's started. However, I can just wrap that up in a try catch and just ignore the error, right? <pre class="prettyprint"><code>begin try if dbo.WQIsQueueJobActive(@index) = 0 begin exec msdb.dbo.sp_start_job @JobName break end end try begin catch -- nothing here end catch </code></pre> here's the problem, though. 9 times out of 10, this works just fine. SQL agent will raise the error, it's caught, and processing just continues on, since the job is already running, no harm no foul. But occasionally, I'll get a message in the Job History view (keep in mind the above code to detect if a specific job is running and start it if not is actually running from another job) saying that the job failed because "SQLAgent has refused to start the job because it already has a pending request". Of course, this is the exact error that TRY CATCH is supposed to be handling! When this happens, the executing job just dies, but not immediately from what I can tell, just pretty close. I've put logging all over the place and there's no consistency. One time it fails, it'll be at place a, the next time at place b. In some cases, Place A and place B have nothing but a <pre class="prettyprint"><code>select @var = 'message' </code></pre> in between them. Very strange. Basically, the job appears to be unceremoniously dumped and anything left to execute in the job is +not+ executed at all. However, if I remove the "exec StartJob" (or have it invoked exactly one time, when I KNOW that the target job can't already be running), everything works perfectly and all my processing in the job runs through. The purpose behind all this is to have a job started as a result of a trigger (among other things), and, if the job is already started, there's really no need to "start it again". Anyone ever run into behavior like this with SQL Agent's Job handling? EDIT: Current flow of control is like so: <ol> <li>Change to a table (update or insert)...</li> <li>fires trigger which calls...</li> <li>a stored proc which calls...</li> <li>sp_Start_Job which...</li> <li>starts a specific job which...</li> <li>calls another stored proc (called CheckQueue) which...</li> <li>performs some processing and...</li> <li>checks several tables and depending on their contents might...</li> <li>invoke sp_start_job on another job to start up a second, simultaneous job to process the additional work (this second job calls the CheckQueue sproc also but the two invocations operate on completely separate sets of data)</li> </ol>

First of all, have you had a chance to look at service broker? From your description, it sounds like that's what you actually want. The difference would be instead of starting a job, you put your data into a SB queue and SB will call your processing proc asynchronously and completely side-step issues with already-running jobs etc. It will auto spawn/terminate additional threads and demand dictates, it takes care of order etc. Here's a good (and vaguely related) tutorial. http://www.sqlteam.com/article/centralized-asynchronous-auditing-with-service-broker Let's assume that you can't use SB for whatever reason (but seriously, do!). What about using the job spid's context_info. <ol> <li>Your job calls a wrapper proc that execs each step individually.</li> <li> The first statement inside the wrapper proc is <pre class="prettyprint"><code>DECLARE @context_info VARBINARY(30) SET @context_info = CAST('MyJob1' AS VARBINARY) SET CONTEXT_INFO @context_info </code></pre> </li> <li> When your proc finishes (or in your catch block) <pre class="prettyprint"><code>SET CONTEXT_INFO 0x0 </code></pre> </li> <li> When you are looking at calling your job, do this: <pre class="prettyprint"><code>IF NOT EXISTS (SELECT * FROM master..sysprocesses WITH (NOLOCK) WHERE context_info=CAST('MyJob1' AS VARBINARY)) EXEC StartJob </code></pre> </li> </ol> When your wrapper proc terminates or the connection is closed, your context_info goes away. You could also use a global temp table (i.e. ##JobStatus) They will disappear when all spids that reference it disconnect or if it's explicitly dropped. Just a few thoughts.

How to accurately detect if a SQL Server job is running and deal with the job already running?

Tags:

sql-server

tsql

sql-server-agent

sql-agent-job

I'm currently using code like this to detect if a SQL server job is running. (this is SQL Server 2005, all SP's)

return (select isnull(  
(select top 1 CASE 
    WHEN current_execution_status = 4 THEN 0
    ELSE 1
    END
from openquery(devtestvm, 'EXEC msdb.dbo.sp_help_job')
where current_execution_status = 4 and
    name = 'WQCheckQueueJob' + cast(@Index as varchar(10))
), 1)
)

No problems there, and generally speaking, it works just fine.

But.... (always a but)

On occasion, I'll invoke this, get back a "job is not running" result, at which point I'll try and start the job, via

exec msdb.dbo.sp_start_job @JobName

and SQL will return that "SQLAgent has refused to start the job because it already has a pending request".

Ok. Also not a problem. It's conceivable that there's a slight window where the target job could get started before this code can start it, but after checking if it's started. However, I can just wrap that up in a try catch and just ignore the error, right?

begin try
if dbo.WQIsQueueJobActive(@index) = 0 begin
    exec msdb.dbo.sp_start_job @JobName
    break
end         
end try begin catch
    -- nothing here
end catch

here's the problem, though.

9 times out of 10, this works just fine. SQL agent will raise the error, it's caught, and processing just continues on, since the job is already running, no harm no foul.

But occasionally, I'll get a message in the Job History view (keep in mind the above code to detect if a specific job is running and start it if not is actually running from another job) saying that the job failed because "SQLAgent has refused to start the job because it already has a pending request".

Of course, this is the exact error that TRY CATCH is supposed to be handling!

When this happens, the executing job just dies, but not immediately from what I can tell, just pretty close. I've put logging all over the place and there's no consistency. One time it fails, it'll be at place a, the next time at place b. In some cases, Place A and place B have nothing but a

select @var = 'message'

in between them. Very strange. Basically, the job appears to be unceremoniously dumped and anything left to execute in the job is +not+ executed at all.

However, if I remove the "exec StartJob" (or have it invoked exactly one time, when I KNOW that the target job can't already be running), everything works perfectly and all my processing in the job runs through.

The purpose behind all this is to have a job started as a result of a trigger (among other things), and, if the job is already started, there's really no need to "start it again".

Anyone ever run into behavior like this with SQL Agent's Job handling?

EDIT: Current flow of control is like so:

Change to a table (update or insert)...
fires trigger which calls...
a stored proc which calls...
sp_Start_Job which...
starts a specific job which...
calls another stored proc (called CheckQueue) which...
performs some processing and...
checks several tables and depending on their contents might...
invoke sp_start_job on another job to start up a second, simultaneous job to process the additional work (this second job calls the CheckQueue sproc also but the two invocations operate on completely separate sets of data)

229

asked May 02 '11 19:05

DarinH

2 Answers

First of all, have you had a chance to look at service broker? From your description, it sounds like that's what you actually want.

The difference would be instead of starting a job, you put your data into a SB queue and SB will call your processing proc asynchronously and completely side-step issues with already-running jobs etc. It will auto spawn/terminate additional threads and demand dictates, it takes care of order etc.

Here's a good (and vaguely related) tutorial. http://www.sqlteam.com/article/centralized-asynchronous-auditing-with-service-broker

Let's assume that you can't use SB for whatever reason (but seriously, do!).

What about using the job spid's context_info.

Your job calls a wrapper proc that execs each step individually.

The first statement inside the wrapper proc is

DECLARE @context_info VARBINARY(30)
SET @context_info = CAST('MyJob1' AS VARBINARY)
SET CONTEXT_INFO @context_info

When your proc finishes (or in your catch block)
```
SET CONTEXT_INFO 0x0
```

When you are looking at calling your job, do this:

IF NOT EXISTS (SELECT * FROM master..sysprocesses WITH (NOLOCK) WHERE context_info=CAST('MyJob1' AS VARBINARY))
    EXEC StartJob

When your wrapper proc terminates or the connection is closed, your context_info goes away.

You could also use a global temp table (i.e. ##JobStatus) They will disappear when all spids that reference it disconnect or if it's explicitly dropped.

Just a few thoughts.

181

answered Sep 25 '22 01:09

Code Magician

I have a query that gives me the running jobs, maybe it can help you. It has been working for me, but if you find any fault on it, let me know, I will try to rectify. cheers.

-- get the running jobs
--marcelo miorelli
-- 10-dec-2013


SELECT sj.name
      ,DATEDIFF(SECOND,aj.start_execution_date,GetDate()) AS Seconds
 FROM msdb..sysjobactivity aj
 JOIN msdb..sysjobs sj on sj.job_id = aj.job_id
WHERE aj.stop_execution_date IS NULL -- job hasn't stopped running
 AND aj.start_execution_date IS NOT NULL -- job is currently running
--AND sj.name = 'JobName'
and not exists( -- make sure this is the most recent run
    select 1
    from msdb..sysjobactivity new
    where new.job_id = aj.job_id
      and new.start_execution_date > aj.start_execution_date )

answered Sep 26 '22 01:09

Marcello Miorelli

Related questions
                            
                                Unable to connect to remote SQL server from container
                            
                                Capture the user who deleted the row in Temporal table
                            
                                Considerations when dropping columns in large tables
                            
                                Length of integer in SQL (i.e. length of decimal string)
                            
                                SQL Server Reporting Services for Amazon RDS
                            
                                Preview SQL DELETE for records which have ON CASCADE constraints
                            
                                Determine whether SP Parameter has a Default Value in T-SQL
                            
                                SQL Server: arbitrary auto-increment of primary key [duplicate]
                            
                                call a SQL Server Stored Procedure with Parameter in R
                            
                                ECommerce Storefront Website: Discovering Similar Products Programmatically
                            
                                SSIS Package runs for 500x longer on one server
                            
                                SQL Server: Why are dates in ISO-8601 format language dependent?
                            
                                DB Schema Compare Error: 'Version store out of memory'
                            
                                How can I use SqlConnection.GetSchema to get synonym information?
                            
                                SQL Server Performance Comparison Between Over Partition By And Group By
                            
                                Database interaction using C# without Entity Framework
                            
                                Error: The reference to external elements from the source named 'master.dacpac' could not be resolved
                            
                                Any clever way to fix 'string or binary data would be truncated' warning with LINQ
                            
                                How to store many years worth of 100 x 25 Hz time-series - Sql Server or timeseries database
                            
                                Deadlock caused by SELECT JOIN statement with SQL Server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With