I have a requirement to fire off web service requests to an online api and I thought that Parallel Extensions would be a good fit for my needs. The web service in question is designed to be called repeatedly, but has a mechanism that charges you if you got over a certain number of calls per second. I obviously want to minimize my charges and so was wondering if anyone has seen a TaskScheduler that can cope with the following requirements: <ol> <li>Limit the number of tasks scheduled per timespan. I guess if the number of requests exceeded this limit then it would need to throw away the task or possibly block? (to stop a back log of tasks) </li> <li>Detect if the same request is already in the scheduler to be executed but hasn't been yet and if so not queue the second task but return the first instead.</li> </ol> Do people feel that these are the sorts of responsibilities a task scheduler should be dealing with or am i barking up the wrong tree? If you have alternatives I am open to suggestions.

I agree with others that TPL Dataflow sounds like a good solution for this. To limit the processing, you could create a <code>TransformBlock</code> that doesn't actually transform the data in any way, it just delays it if it arrived too soon after the previous data: <pre class="prettyprint"><code>static IPropagatorBlock<T, T> CreateDelayBlock<T>(TimeSpan delay) { DateTime lastItem = DateTime.MinValue; return new TransformBlock<T, T>( async x => { var waitTime = lastItem + delay - DateTime.UtcNow; if (waitTime > TimeSpan.Zero) await Task.Delay(waitTime); lastItem = DateTime.UtcNow; return x; }, new ExecutionDataflowBlockOptions { BoundedCapacity = 1 }); } </code></pre> Then create a method that produces the data (for example integers starting from 0): <pre class="prettyprint"><code>static async Task Producer(ITargetBlock<int> target) { int i = 0; while (await target.SendAsync(i)) i++; } </code></pre> It's written asynchronously, so that if the target block isn't able to process the items right now, it will wait. Then write a consumer method: <pre class="prettyprint"><code>static void Consumer(int i) { Console.WriteLine(i); } </code></pre> And finally, link it all together and start it up: <pre class="prettyprint"><code>var delayBlock = CreateDelayBlock<int>(TimeSpan.FromMilliseconds(500)); var consumerBlock = new ActionBlock<int>( (Action<int>)Consumer, new ExecutionDataflowBlockOptions { MaxDegreeOfParallelism = DataflowBlockOptions.Unbounded }); delayBlock.LinkTo(consumerBlock, new DataflowLinkOptions { PropagateCompletion = true }); Task.WaitAll(Producer(delayBlock), consumerBlock.Completion); </code></pre> Here, <code>delayBlock</code> will accept at most one item every 500 ms and the <code>Consumer()</code> method can run multiple times in parallel. To finish processing, call <code>delayBlock.Complete()</code>. If you want to add some caching per your #2, you could create another <code>TransformBlock</code> do the work there and link it to the other blocks.

Honestly I would work at a higher level of abstraction and use the TPL Dataflow API for this. The only catch is you would need to write a custom block that will throttle the requests at the rate at which you need because, by default, blocks are "greedy" and will just process as fast as possible. The implementation would be something like this: <ol> <li>Start with a <code>BufferBlock<T></code> which is the logical block that you would post to. </li> <li>Link the <code>BufferBlock<T></code> to a custom block which has the knowledge of requests/sec and throttling logic.</li> <li>Link the custom block from 2 to to your <code>ActionBlock<T></code>.</li> </ol> I don't have the time to write the custom block for #2 right this second, but I will check back later and try to fill in an implementation for you if you haven't already figured it out.

Task Parallel Library - Custom Task Schedulers

Tags:

c#

task-parallel-library

parallel-extensions

I have a requirement to fire off web service requests to an online api and I thought that Parallel Extensions would be a good fit for my needs.

The web service in question is designed to be called repeatedly, but has a mechanism that charges you if you got over a certain number of calls per second. I obviously want to minimize my charges and so was wondering if anyone has seen a TaskScheduler that can cope with the following requirements:

Limit the number of tasks scheduled per timespan. I guess if the number of requests exceeded this limit then it would need to throw away the task or possibly block? (to stop a back log of tasks)
Detect if the same request is already in the scheduler to be executed but hasn't been yet and if so not queue the second task but return the first instead.

Do people feel that these are the sorts of responsibilities a task scheduler should be dealing with or am i barking up the wrong tree? If you have alternatives I am open to suggestions.

544

asked Mar 20 '12 21:03

Fen

2 Answers

I agree with others that TPL Dataflow sounds like a good solution for this.

To limit the processing, you could create a TransformBlock that doesn't actually transform the data in any way, it just delays it if it arrived too soon after the previous data:

Click to copy

static IPropagatorBlock<T, T> CreateDelayBlock<T>(TimeSpan delay)
{
    DateTime lastItem = DateTime.MinValue;
    return new TransformBlock<T, T>(
        async x =>
                {
                    var waitTime = lastItem + delay - DateTime.UtcNow;
                    if (waitTime > TimeSpan.Zero)
                        await Task.Delay(waitTime);

                    lastItem = DateTime.UtcNow;

                    return x;
                },
        new ExecutionDataflowBlockOptions { BoundedCapacity = 1 });
}

Then create a method that produces the data (for example integers starting from 0):

Click to copy

static async Task Producer(ITargetBlock<int> target)
{
    int i = 0;
    while (await target.SendAsync(i))
        i++;
}

It's written asynchronously, so that if the target block isn't able to process the items right now, it will wait.

Then write a consumer method:

Click to copy

static void Consumer(int i)
{
    Console.WriteLine(i);
}

And finally, link it all together and start it up:

Click to copy

var delayBlock = CreateDelayBlock<int>(TimeSpan.FromMilliseconds(500));

var consumerBlock = new ActionBlock<int>(
    (Action<int>)Consumer,
    new ExecutionDataflowBlockOptions { MaxDegreeOfParallelism = DataflowBlockOptions.Unbounded });

delayBlock.LinkTo(consumerBlock, new DataflowLinkOptions { PropagateCompletion = true });

Task.WaitAll(Producer(delayBlock), consumerBlock.Completion);

Here, delayBlock will accept at most one item every 500 ms and the Consumer() method can run multiple times in parallel. To finish processing, call delayBlock.Complete().

If you want to add some caching per your #2, you could create another TransformBlock do the work there and link it to the other blocks.

111

answered Sep 29 '22 08:09

svick

Honestly I would work at a higher level of abstraction and use the TPL Dataflow API for this. The only catch is you would need to write a custom block that will throttle the requests at the rate at which you need because, by default, blocks are "greedy" and will just process as fast as possible. The implementation would be something like this:

Start with a BufferBlock<T> which is the logical block that you would post to.
Link the BufferBlock<T> to a custom block which has the knowledge of requests/sec and throttling logic.
Link the custom block from 2 to to your ActionBlock<T>.

I don't have the time to write the custom block for #2 right this second, but I will check back later and try to fill in an implementation for you if you haven't already figured it out.

answered Sep 29 '22 06:09

Drew Marsh

Related questions
                            
                                C# adding objects (similar to delegates)
                            
                                Issue with connection string in web config file
                            
                                C# How to make a recursive version of GetEnumerator()
                            
                                System.Collections.Generic.Find() vs Linq.First()
                            
                                How to use Quartz.net in console application [showing error]? [closed]
                            
                                Warning given when debugging source code in visual studio 2010
                            
                                Can a stack overflow happen for any other reason that recursion?
                            
                                How to get height and width of dynamically designed user control while added in wpf canvas control?
                            
                                Does a .NET technology for synchronizing local and remote SQL server databases exist? [closed]
                            
                                Can't Get Updated Value From Textbox
                            
                                Windows Authentication in .NET
                            
                                Handling network disconnect
                            
                                JavaScriptSerializer.Deserialize() into a dictionary
                            
                                Finding a method's declaring type
                            
                                How to get id from String with using Regex
                            
                                Using this() in code
                            
                                A virtual machine for C++ for optimizing performance
                            
                                Is it acceptable to put seed data in the OnModelCreating method in EF code-first?
                            
                                Dynamic chaining of List<T> orderby [duplicate]
                            
                                C# WebApp log4net partial trust (High or Medium) not working

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Task Parallel Library - Custom Task Schedulers

Tags:

c#

task-parallel-library

parallel-extensions

Fen

People also ask

2 Answers

svick

Drew Marsh

Recent Activity

Donate For Us