I need to design a Redis-driven scalable task scheduling system. Requirements: <ul> <li>Multiple worker processes.</li> <li>Many tasks, but long periods of idleness are possible.</li> <li>Reasonable timing precision.</li> <li>Minimal resource waste when idle.</li> <li>Should use synchronous Redis API.</li> <li>Should work for Redis 2.4 (i.e. no features from upcoming 2.6).</li> <li>Should not use other means of RPC than Redis.</li> </ul> Pseudo-API: <code>schedule_task(timestamp, task_data)</code>. Timestamp is in integer seconds. Basic idea: <ul> <li>Listen for upcoming tasks on list.</li> <li>Put tasks to buckets per timestamp. </li> <li>Sleep until the closest timestamp. </li> <li>If a new task appears with timestamp less than closest one, wake up.</li> <li>Process all upcoming tasks with timestamp ≤ now, in batches (assuming that task execution is fast). </li> <li>Make sure that concurrent worker wouldn't process same tasks. At the same time, make sure that no tasks are lost if we crash while processing them.</li> </ul> So far I can't figure out how to fit this in Redis primitives... Any clues? Note that there is a similar old question: Delayed execution / scheduling with Redis? In this new question I introduce more details (most importantly, many workers). So far I was not able to figure out how to apply old answers here — thus, a new question.

Here's another solution that builds on a couple of others [1]. It uses the redis WATCH command to remove the race condition without using lua in redis 2.6. The basic scheme is: <ul> <li>Use a redis zset for scheduled tasks and redis queues for ready to run tasks.</li> <li>Have a dispatcher poll the zset and move tasks that are ready to run into the redis queues. You may want more than 1 dispatcher for redundancy but you probably don't need or want many.</li> <li>Have as many workers as you want which do blocking pops on the redis queues.</li> </ul> I haven't tested it :-) The foo job creator would do: <pre class="prettyprint"><code>def schedule_task(queue, data, delay_secs): # This calculation for run_at isn't great- it won't deal well with daylight # savings changes, leap seconds, and other time anomalies. Improvements # welcome :-) run_at = time.time() + delay_secs # If you're using redis-py's Redis class and not StrictRedis, swap run_at & # the dict. redis.zadd(SCHEDULED_ZSET_KEY, run_at, {'queue': queue, 'data': data}) schedule_task('foo_queue', foo_data, 60) </code></pre> The dispatcher(s) would look like: <pre class="prettyprint"><code>while working: redis.watch(SCHEDULED_ZSET_KEY) min_score = 0 max_score = time.time() results = redis.zrangebyscore( SCHEDULED_ZSET_KEY, min_score, max_score, start=0, num=1, withscores=False) if results is None or len(results) == 0: redis.unwatch() sleep(1) else: # len(results) == 1 redis.multi() redis.rpush(results[0]['queue'], results[0]['data']) redis.zrem(SCHEDULED_ZSET_KEY, results[0]) redis.exec() </code></pre> The foo worker would look like: <pre class="prettyprint"><code>while working: task_data = redis.blpop('foo_queue', POP_TIMEOUT) if task_data: foo(task_data) </code></pre> [1] This solution is based on not_a_golfer's, one at http://www.saltycrane.com/blog/2011/11/unique-python-redis-based-queue-delay/, and the redis docs for transactions.

You didn't specify the language you're using. You have at least 3 alternatives of doing this without writing a single line of code in Python at least. <ol> <li>Celery has an optional redis broker. http://celeryproject.org/</li> <li>resque is an extremely popular redis task queue using redis. https://github.com/defunkt/resque</li> <li>RQ is a simple and small redis based queue that aims to "take the good stuff from celery and resque" and be much simpler to work with. http://python-rq.org/</li> </ol> You can at least look at their design if you can't use them. But to answer your question - what you want can be done with redis. I've actually written more or less that in the past. EDIT: As for modeling what you want on redis, this is what I would do: <ol> <li>queuing a task with a timestamp will be done directly by the client - you put the task in a sorted set with the timestamp as the score and the task as the value (see ZADD).</li> <li>A central dispatcher wakes every N seconds, checks out the first timestamps on this set, and if there are tasks ready for execution, it pushes the task to a "to be executed NOW" list. This can be done with ZREVRANGEBYSCORE on the "waiting" sorted set, getting all items with timestamp<=now, so you get all the ready items at once. pushing is done by RPUSH.</li> <li>workers use BLPOP on the "to be executed NOW" list, wake when there is something to work on, and do their thing. This is safe since redis is single threaded, and no 2 workers will ever take the same task.</li> <li>once finished, the workers put the result back in a response queue, which is checked by the dispatcher or another thread. You can add a "pending" bucket to avoid failures or something like that. </li> </ol> so the code will look something like this (this is just pseudo code): client: <pre class="prettyprint"><code>ZADD "new_tasks" <TIMESTAMP> <TASK_INFO> </code></pre> dispatcher: <pre class="prettyprint"><code>while working: tasks = ZREVRANGEBYSCORE "new_tasks" <NOW> 0 #this will only take tasks with timestamp lower/equal than now for task in tasks: #do the delete and queue as a transaction MULTI RPUSH "to_be_executed" task ZREM "new_tasks" task EXEC sleep(1) </code></pre> I didn't add the response queue handling, but it's more or less like the worker: worker: <pre class="prettyprint"><code>while working: task = BLPOP "to_be_executed" <TIMEOUT> if task: response = work_on_task(task) RPUSH "results" response </code></pre> EDit: stateless atomic dispatcher : <pre class="prettyprint"><code>while working: MULTI ZREVRANGE "new_tasks" 0 1 ZREMRANGEBYRANK "new_tasks" 0 1 task = EXEC #this is the only risky place - you can solve it by using Lua internall in 2.6 SADD "tmp" task if task.timestamp <= now: MULTI RPUSH "to_be_executed" task SREM "tmp" task EXEC else: MULTI ZADD "new_tasks" task.timestamp task SREM "tmp" task EXEC sleep(RESOLUTION) </code></pre>

Scalable delayed task execution with Redis

Tags:

redis

scalability

scheduled-tasks

I need to design a Redis-driven scalable task scheduling system.

Requirements:

Multiple worker processes.
Many tasks, but long periods of idleness are possible.
Reasonable timing precision.
Minimal resource waste when idle.
Should use synchronous Redis API.
Should work for Redis 2.4 (i.e. no features from upcoming 2.6).
Should not use other means of RPC than Redis.

Pseudo-API: schedule_task(timestamp, task_data). Timestamp is in integer seconds.

Basic idea:

Listen for upcoming tasks on list.
Put tasks to buckets per timestamp.
Sleep until the closest timestamp.
If a new task appears with timestamp less than closest one, wake up.
Process all upcoming tasks with timestamp ≤ now, in batches (assuming that task execution is fast).
Make sure that concurrent worker wouldn't process same tasks. At the same time, make sure that no tasks are lost if we crash while processing them.

So far I can't figure out how to fit this in Redis primitives...

Any clues?

Note that there is a similar old question: Delayed execution / scheduling with Redis? In this new question I introduce more details (most importantly, many workers). So far I was not able to figure out how to apply old answers here — thus, a new question.

315

asked Jun 03 '12 07:06

Alexander Gladysh

2 Answers

Here's another solution that builds on a couple of others [1]. It uses the redis WATCH command to remove the race condition without using lua in redis 2.6.

The basic scheme is:

Use a redis zset for scheduled tasks and redis queues for ready to run tasks.
Have a dispatcher poll the zset and move tasks that are ready to run into the redis queues. You may want more than 1 dispatcher for redundancy but you probably don't need or want many.
Have as many workers as you want which do blocking pops on the redis queues.

I haven't tested it :-)

The foo job creator would do:

def schedule_task(queue, data, delay_secs):
    # This calculation for run_at isn't great- it won't deal well with daylight
    # savings changes, leap seconds, and other time anomalies. Improvements
    # welcome :-)
    run_at = time.time() + delay_secs

    # If you're using redis-py's Redis class and not StrictRedis, swap run_at &
    # the dict.
    redis.zadd(SCHEDULED_ZSET_KEY, run_at, {'queue': queue, 'data': data})

schedule_task('foo_queue', foo_data, 60)

The dispatcher(s) would look like:

while working:
    redis.watch(SCHEDULED_ZSET_KEY)
    min_score = 0
    max_score = time.time()
    results = redis.zrangebyscore(
        SCHEDULED_ZSET_KEY, min_score, max_score, start=0, num=1, withscores=False)
    if results is None or len(results) == 0:
        redis.unwatch()
        sleep(1)
    else: # len(results) == 1
        redis.multi()
        redis.rpush(results[0]['queue'], results[0]['data'])
        redis.zrem(SCHEDULED_ZSET_KEY, results[0])
        redis.exec()

The foo worker would look like:

while working:
    task_data = redis.blpop('foo_queue', POP_TIMEOUT)
    if task_data:
        foo(task_data)

[1] This solution is based on not_a_golfer's, one at http://www.saltycrane.com/blog/2011/11/unique-python-redis-based-queue-delay/, and the redis docs for transactions.

answered Oct 22 '22 17:10

Dan Benamy

You didn't specify the language you're using. You have at least 3 alternatives of doing this without writing a single line of code in Python at least.

Celery has an optional redis broker. http://celeryproject.org/
resque is an extremely popular redis task queue using redis. https://github.com/defunkt/resque
RQ is a simple and small redis based queue that aims to "take the good stuff from celery and resque" and be much simpler to work with. http://python-rq.org/

You can at least look at their design if you can't use them.

But to answer your question - what you want can be done with redis. I've actually written more or less that in the past.

EDIT: As for modeling what you want on redis, this is what I would do:

queuing a task with a timestamp will be done directly by the client - you put the task in a sorted set with the timestamp as the score and the task as the value (see ZADD).
A central dispatcher wakes every N seconds, checks out the first timestamps on this set, and if there are tasks ready for execution, it pushes the task to a "to be executed NOW" list. This can be done with ZREVRANGEBYSCORE on the "waiting" sorted set, getting all items with timestamp<=now, so you get all the ready items at once. pushing is done by RPUSH.
workers use BLPOP on the "to be executed NOW" list, wake when there is something to work on, and do their thing. This is safe since redis is single threaded, and no 2 workers will ever take the same task.
once finished, the workers put the result back in a response queue, which is checked by the dispatcher or another thread. You can add a "pending" bucket to avoid failures or something like that.

so the code will look something like this (this is just pseudo code):

client:

ZADD "new_tasks" <TIMESTAMP> <TASK_INFO>

dispatcher:

while working:
   tasks = ZREVRANGEBYSCORE "new_tasks" <NOW> 0 #this will only take tasks with timestamp lower/equal than now
   for task in tasks:

       #do the delete and queue as a transaction
       MULTI
       RPUSH "to_be_executed" task
       ZREM "new_tasks" task
       EXEC

   sleep(1)

I didn't add the response queue handling, but it's more or less like the worker:

worker:

while working:
   task = BLPOP "to_be_executed" <TIMEOUT>
   if task:
      response = work_on_task(task)
      RPUSH "results" response

EDit: stateless atomic dispatcher :

while working:

   MULTI
   ZREVRANGE "new_tasks" 0 1
   ZREMRANGEBYRANK "new_tasks" 0 1
   task = EXEC

   #this is the only risky place - you can solve it by using Lua internall in 2.6
   SADD "tmp" task

   if task.timestamp <= now:
       MULTI
       RPUSH "to_be_executed" task
       SREM "tmp" task
       EXEC
   else:

       MULTI
       ZADD "new_tasks" task.timestamp task
       SREM "tmp" task
       EXEC

   sleep(RESOLUTION)

answered Oct 22 '22 16:10

Not_a_Golfer

Related questions
                            
                                Get Set value from Redis using RedisTemplate
                            
                                In Redis pubsub, is it possible to pass an object to the PUBLISH command?
                            
                                Redis data structure design for sorting time-based values
                            
                                How to use Redis within a C++ program?
                            
                                Redis server can't run more than 1024M maxheap
                            
                                Saving Redis query output to file
                            
                                How to store array of hashes in redis
                            
                                phpredis on windows 7 64bit xampp
                            
                                Does the redis pub/sub model require persistent connections to redis?
                            
                                Node Redis - SET with EX and NX?
                            
                                Why Getting NoClassDefFound error for JedisConnection when using Spring Redis
                            
                                RQ - Empty & Delete Queues
                            
                                Heroku Redis logs are noisy - how can they be filtered out?
                            
                                How can I support the Redis sentinel architecture using StackExchange.Redis?
                            
                                How does StackExchange.Redis use multiple endpoints and connections?
                            
                                Redis Cluster vs ZeroMQ in Pub/Sub, for horizontally scaled distributed systems
                            
                                Achieving JMS/AMQP messaging patterns using Redis
                            
                                Can redis fully replace mysql?
                            
                                How to check Resque worker status to determine whether it's dead or stale
                            
                                How to fix the WARNINGs when running the redis:alpine Docker image

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With