I'm interested in information about the speed of <code>sin()</code> and <code>cos()</code> in Open GL Shader Language. The GLSL Specification Document indicates that: <blockquote> The built-in functions basically fall into three categories: <ul> <li>...</li> <li>...</li> <li>They represent an operation graphics hardware is likely to accelerate at some point. The trigonometry functions fall into this category.</li> </ul> </blockquote> EDIT: As has been pointed out, counting clock cycles of individual operations like <code>sin()</code> and <code>cos()</code> doesn't really tell the whole performance story. So to clarify my question, what I'm really interested in is whether it's worthwhile to optimize away <code>sin()</code> and <code>cos()</code> calls for common cases. For example, in my application it'll be very common for the argument to be <code>0</code>. So does something like this make sense: <pre class="prettyprint"><code>float sina, cosa; if ( rotation == 0 ) { sina = 0; cosa = 1; } else { sina = sin( rotation ); cosa = cos( rotation ); } </code></pre> Or will the <code>GLSL</code> compiler or the <code>sin()</code> and <code>cos()</code> implementations take care of optimizations like that for me?

<blockquote> For example, in my application it'll be very common for the argument to be 0. So does something like this make sense: </blockquote> No. Your compiler will do one of two things. <ol> <li>It will issue an actual conditional branch. In the best possible case, if 0 is a value that is coherent locally (such that groups of shaders will often hit 0 or non-zero together), then you might get improved performance.</li> <li>It will evaluate both sides of the condition, and only store the result for the correct one of them. In which case, you've gained nothing.</li> </ol> In general, it's not a good idea to use conditional logic to dance around small performance like this. It needs to be really big to be worthwhile, like a <code>discard</code> or something. Also, do note that floating-point equivalence is not likely to work. Not unless you actually pass a uniform or vertex attribute containing exactly 0.0 to the shader. Even interpolating between 0 and non-zero will likely never produce exactly 0 for any fragment.

This is a good question. I too wondered this. Google'd links say <code>cos</code> and <code>sin</code> are single-cycle on mainstream cards since 2005 or so.

Speed of cos() and sin() function in GLSL shaders?

Tags:

optimization

opengl

glsl

shader

jogl

I'm interested in information about the speed of sin() and cos() in Open GL Shader Language.

The GLSL Specification Document indicates that:

The built-in functions basically fall into three categories:

...

...

They represent an operation graphics hardware is likely to accelerate at some point. The trigonometry functions fall into this category.

EDIT:

As has been pointed out, counting clock cycles of individual operations like sin() and cos() doesn't really tell the whole performance story.

So to clarify my question, what I'm really interested in is whether it's worthwhile to optimize away sin() and cos() calls for common cases.

For example, in my application it'll be very common for the argument to be 0. So does something like this make sense:

float sina, cosa;

if ( rotation == 0 )
{
   sina = 0;
   cosa = 1;
}
else
{
   sina = sin( rotation );
   cosa = cos( rotation );
}

Or will the GLSL compiler or the sin() and cos() implementations take care of optimizations like that for me?

202

asked Apr 14 '12 15:04

ulmangt

4 Answers

For example, in my application it'll be very common for the argument to be 0. So does something like this make sense:

No.

Your compiler will do one of two things.

It will issue an actual conditional branch. In the best possible case, if 0 is a value that is coherent locally (such that groups of shaders will often hit 0 or non-zero together), then you might get improved performance.
It will evaluate both sides of the condition, and only store the result for the correct one of them. In which case, you've gained nothing.

In general, it's not a good idea to use conditional logic to dance around small performance like this. It needs to be really big to be worthwhile, like a discard or something.

Also, do note that floating-point equivalence is not likely to work. Not unless you actually pass a uniform or vertex attribute containing exactly 0.0 to the shader. Even interpolating between 0 and non-zero will likely never produce exactly 0 for any fragment.

132

answered Oct 23 '22 16:10

Nicol Bolas

This is a good question. I too wondered this.

Google'd links say cos and sin are single-cycle on mainstream cards since 2005 or so.

answered Oct 23 '22 15:10

Will

You'd have to test this out yourself, but I'm pretty sure that branching in a shader is far more expensive than a sin or cos calculation. GLSL compilers are pretty good about optimizing shaders, worrying about this is premature optimization. If you later find that, through your entire program, your shaders are the bottleneck, then you can worry about optimizing this.

If you want to take a look at the assembly code of your shader for a specific platform, I would recommend AMD GPU ShaderAnalyzer.

answered Oct 23 '22 16:10

Robert Rouhani

Not sure if this answers your question, but it's very difficult to tell you how many clocks/slots an instruction takes as it depends very much on the GPU. Usually it's a single cycle. But even if not, the compiler may rearrange the order of instruction execution to hide the true cost. It's certainly slower to use texture lookups for sin/cos as it is to execute the instructions.

answered Oct 23 '22 16:10

Robinson

Related questions
                            
                                MySQL not using indexes ("Using filesort") when using ORDER BY
                            
                                Read large amount of data from file in Java
                            
                                How to find performance bottlenecks in C++ code
                            
                                Speeding up conversion from MyISAM to InnoDB
                            
                                Preserve code readability while optimising
                            
                                Alias Analysis in Java
                            
                                Hide vs Remove DOM elements [closed]
                            
                                Is numpy.transpose reordering data in memory?
                            
                                Does C++11 for loop allow new or better optimizations?
                            
                                Faster integer division when denominator is known?
                            
                                How to speed up C# math code
                            
                                Best way to interpolate a numpy.ndarray along an axis
                            
                                When to evaluate strictly in Haskell?
                            
                                Use of non-clustered index on guid type column in SQL Server
                            
                                Efficiently eliminate common sub-expressions in .NET Expression Tree
                            
                                C# Array or Dictionary?
                            
                                Do string literals get optimised by the compiler?
                            
                                Dyamic vs Static Polymorphism in C++ : which is preferable?
                            
                                Disable all optimization options in GCC
                            
                                Getting the number of trailing 1 bits

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Speed of cos() and sin() function in GLSL shaders?

Tags:

optimization

opengl

glsl

shader

jogl

ulmangt

People also ask

4 Answers

Nicol Bolas

Will

Robert Rouhani

Robinson

Recent Activity

Donate For Us