I have a SnowFlake script for Python, and I convert it to a Raku module, and call it 10,000,000 times, and it is very slow (file test.raku): <pre class="prettyprint"><code>use IdWorker; my $worker = IdWorker.new(worker_id => 10, sequence => 0); my @ids = gather for (1...10000000) { take $worker.get_id() }; my $duration = now - INIT now; say sprintf("%-8s %-8s %-20s", @ids.elems, Set(@ids).elems, $duration); </code></pre> As @codesections's answer says, it's <code>now</code> that takes so much time. Python takes about 12 seconds, while Raku takes minutes. How can I fix this? This empty for loop takes about 0.12 seconds: <pre class="prettyprint"><code>for (1...10000000) { ; } </code></pre> And the call <code>get_id()</code> on <code>$worker</code> takes minutes: <pre class="prettyprint"><code>for (1...10000000) { $worker.get_id(); } </code></pre>

I believe that the issue here does not come from constructing the array but rather from <code>now</code> itself – which seems to be oddly slow. For example, this code: <pre class="prettyprint"><code>no worries; # skip printing warning for useless `now` for ^10_000_000 { now } say now - INIT now; </code></pre> also takes minutes to run. This strikes me as a bug, and I'll open an issue [Edit: I located rakudo/rakudo#3620 on this issue. The good news is that there's already a plan for a fix.] Since your code calls <code>now</code> multiple times in each iteration, this issue impacts your loop even more. Apart from that, there are a few other areas where you could speed this code up: First, using an implicit return (that is, changing <code>return new_id;</code> to just <code>new_id</code>, and making similar changes for the other places where you use <code>return</code>) is generally slightly faster/lets the JIT optimize a bit better. Second, the line <pre class="prettyprint"><code>my @ids = gather for (1...10000000) { take $worker.get_id() }; </code></pre> is somewhat wastefully using <code>gather</code>/<code>take</code> (which adds support for lazy lists and is just a more complex construct). You can simplify this into <pre class="prettyprint"><code>my @ids = (1...10000000).map: { $worker.get_id() }; </code></pre> (This still constructs an intermediate <code>Seq</code>, though.) Third – and this one is more major from a performance impact, though literally as small as it's possible to be from a code change perspective – is to change the <code>(1...10000000)</code> into <code>(1..10000000)</code>. The difference is that <code>...</code> is the sequence operator while <code>..</code> is the range operator. Sequences have some supper powers compared to Ranges (see the docs if you're curious), but are significantly slower to iterate over in a loop like this. Again, though, these are minor issues; I believe the performance of <code>now</code> is the largest problem. The long-term solution for <code>now</code> being slow is for it to be fixed (we're working on it!) As a temporary workaround, though, if you don't mind dipping into a slightly lower level than is generally advisable for user code, you can use <code>nqp::time_n</code> to get a floating point number of seconds for the current time. Using this would make your <code>get_timestamp</code> method look like: <pre class="prettyprint"><code>method get_timestamp() { use nqp; (nqp::time_n() * 1000).Int; } </code></pre> With this workaround and the other refactorings I suggested above, your code now executes in around 55 seconds on my machine – still not nearly as fast as I'd like Raku to be, but well over an order of magnitude better than where we started.

`now` becomes slow in a 10 million iterations loop

Tags:

performance

raku

I have a SnowFlake script for Python, and I convert it to a Raku module, and call it 10,000,000 times, and it is very slow (file test.raku):

use IdWorker;

my $worker = IdWorker.new(worker_id => 10, sequence => 0);
my @ids = gather for (1...10000000) { take $worker.get_id() };

my $duration = now - INIT now;
say sprintf("%-8s %-8s %-20s", @ids.elems, Set(@ids).elems, $duration);

As @codesections's answer says, it's now that takes so much time.

Python takes about 12 seconds, while Raku takes minutes. How can I fix this?

This empty for loop takes about 0.12 seconds:

for (1...10000000) {
    ;
}

And the call get_id() on $worker takes minutes:

for (1...10000000) {
    $worker.get_id();
}

260

asked Mar 10 '21 13:03

chenyf

Video Answer

1 Answers

I believe that the issue here does not come from constructing the array but rather from now itself – which seems to be oddly slow.

For example, this code:

no worries; # skip printing warning for useless `now`
for ^10_000_000 { now }
say now - INIT now;

also takes minutes to run. This strikes me as a bug, and I'll open an issue [Edit: I located rakudo/rakudo#3620 on this issue. The good news is that there's already a plan for a fix.] Since your code calls now multiple times in each iteration, this issue impacts your loop even more.

Apart from that, there are a few other areas where you could speed this code up:

First, using an implicit return (that is, changing return new_id; to just new_id, and making similar changes for the other places where you use return) is generally slightly faster/lets the JIT optimize a bit better.

Second, the line

my @ids = gather for (1...10000000) { take $worker.get_id() };

is somewhat wastefully using gather/take (which adds support for lazy lists and is just a more complex construct). You can simplify this into

my @ids = (1...10000000).map: { $worker.get_id() };

(This still constructs an intermediate Seq, though.)

Third – and this one is more major from a performance impact, though literally as small as it's possible to be from a code change perspective – is to change the (1...10000000) into (1..10000000). The difference is that ... is the sequence operator while .. is the range operator. Sequences have some supper powers compared to Ranges (see the docs if you're curious), but are significantly slower to iterate over in a loop like this.

Again, though, these are minor issues; I believe the performance of now is the largest problem.

The long-term solution for now being slow is for it to be fixed (we're working on it!) As a temporary workaround, though, if you don't mind dipping into a slightly lower level than is generally advisable for user code, you can use nqp::time_n to get a floating point number of seconds for the current time. Using this would make your get_timestamp method look like:

method get_timestamp() {
    use nqp;
    (nqp::time_n() * 1000).Int;
}

With this workaround and the other refactorings I suggested above, your code now executes in around 55 seconds on my machine – still not nearly as fast as I'd like Raku to be, but well over an order of magnitude better than where we started.

answered Oct 23 '22 01:10

codesections

Related questions
                            
                                When should you use a script loader?
                            
                                Why does this V8/Javascript code perform so badly?
                            
                                Request and basic profiling information for Flask
                            
                                Is it better to maintain a separate count table vs running count query every time?
                            
                                UIView performance: opaque, backgroundColor, clearsContextBeforeDrawing?
                            
                                Xcode 4.3.2 and 100% CPU constantly in the idle time
                            
                                Entity Framework startup time
                            
                                Web application very slow in Tomcat 7
                            
                                Using WCF from WPF very slow on first use
                            
                                How can I Subtract these lists faster?
                            
                                DECLARE GLOBAL TEMPORARY TABLE Vs CREATE GLOBAL TEMPORARY TABLE in DB2
                            
                                Correct SQL index for Partition + Order to remove SORT
                            
                                Storing data into session and storing to database upon "major" action
                            
                                Efficiency of unfoldr versus zipWith
                            
                                Do C++11 delegated ctors perform worse than C++03 ctors calling init functions?
                            
                                Does Prolog have an alias "operator" like Haskell?
                            
                                Is slicing really slower in Python 3.4?
                            
                                how to measure loading time in Angular2?
                            
                                Is there a performance gain by using lambda expressions?
                            
                                Why Pylint is too slow while pep8 just takes a second to check the same code?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With