For example, if I use vertex shader like the following: <pre class="prettyprint"><code>#version 400 core uniform mat4 projM; uniform mat4 viewM; uniform mat4 modelM; in vec4 in_Position; out vec4 pass_position_model; void main(void) { gl_Position = projM * viewM * modelM * in_Position; pass_position_model = modelM * in_Position; } </code></pre> Will it do <code>projM * viewM * modelM</code> matrix multiplication for each vertex, or it it smart enough to calculate if once and do not recalculate until uniform variables are changed? If it isn't "smart enough", then is there a way to optimize it other than computing all uniform-dependent values on CPU and send them as uniform variables to GPU? Also I'm interested in solutions that can be ported to OpenGL ES 2.0 later without problems.

So there is no general answer, as I understand. I did some tests on my hardware, though. I have 2 GPUs in my inventory, <code>Intel HD Graphics 3000</code> and <code>NVidia GeForce GT 555M</code>. I tested my program (the program itself is written in java/scala) with matrix multiplication in vertex shader, and then moved multiplication to the CPU program and tested again. (sphereN - it's a continuously rotating sphere with 2*N^2 quads, drawn with glDrawElements(GL_QUADS,...) with 1 texture and without any lighting/other effects) matrix multiplication in vertex shader: <pre class="prettyprint"><code>intel: sphere400: 57.17552887364208 fps sphere40: 128.1394156842645 fps nvidia: sphere400: 134.9527665317139 fps sphere40: 242.0135527589545 fps </code></pre> matrix multiplication on cpu: <pre class="prettyprint"><code>intel: sphere400: 57.37234652897303 fps sphere40: 128.2051282051282 fps nvidia: sphere400: 142.28799089356858 fps sphere40: 247.1576866040534 fps </code></pre> Tests show that multiplicating (uniform) matrices in vertex shader is bad idea, at least on this hardware. So in general one may not rely on corresponding GLSL compiler optimization.

Does GLSL really do unnecessary computations with uniform (not per-vertex) values?

Tags:

optimization

compiler-optimization

opengl

glsl

For example, if I use vertex shader like the following:

#version 400 core

uniform mat4 projM;
uniform mat4 viewM;
uniform mat4 modelM;

in vec4 in_Position;

out vec4 pass_position_model;

void main(void) {
    gl_Position = projM * viewM * modelM * in_Position;
    pass_position_model = modelM * in_Position;
}

Will it do projM * viewM * modelM matrix multiplication for each vertex, or it it smart enough to calculate if once and do not recalculate until uniform variables are changed? If it isn't "smart enough", then is there a way to optimize it other than computing all uniform-dependent values on CPU and send them as uniform variables to GPU?
Also I'm interested in solutions that can be ported to OpenGL ES 2.0 later without problems.

341

asked Mar 17 '13 14:03

Display Name

1 Answers

So there is no general answer, as I understand. I did some tests on my hardware, though. I have 2 GPUs in my inventory, Intel HD Graphics 3000 and NVidia GeForce GT 555M. I tested my program (the program itself is written in java/scala) with matrix multiplication in vertex shader, and then moved multiplication to the CPU program and tested again.

(sphereN - it's a continuously rotating sphere with 2*N^2 quads, drawn with glDrawElements(GL_QUADS,...) with 1 texture and without any lighting/other effects)

matrix multiplication in vertex shader:

intel:
    sphere400: 57.17552887364208 fps
    sphere40: 128.1394156842645 fps
nvidia:
    sphere400: 134.9527665317139 fps
    sphere40: 242.0135527589545 fps

matrix multiplication on cpu:

intel:
    sphere400: 57.37234652897303 fps
    sphere40: 128.2051282051282 fps
nvidia:
    sphere400: 142.28799089356858 fps
    sphere40: 247.1576866040534 fps

Tests show that multiplicating (uniform) matrices in vertex shader is bad idea, at least on this hardware. So in general one may not rely on corresponding GLSL compiler optimization.

123

answered Nov 01 '22 06:11

Display Name

Related questions
                            
                                Alpha rendering difference between OpenGL and WebGL
                            
                                C++ time measurement looks too slow
                            
                                WorldWind PointPlacemark Heading
                            
                                Switching from OpenGL to GDI
                            
                                OpenGL and GLUT in Eclipse on OS X
                            
                                Which version of OpenGL/Direct3D should I target for optimum compatibility? [closed]
                            
                                Setting glutBitmapCharacter color?
                            
                                How to apply a normal map in OpenGL?
                            
                                How does a GLSL sampler determine the minification, and thus the mipmap level, of a texture?
                            
                                Opengl, DrawArrays without binding VBO
                            
                                Performance of different CG/GLSL/HLSL functions
                            
                                Anti-aliasing filled shapes in libgdx
                            
                                use of undeclared identifier 'glGenVertexArrays' error even after including OpenGL/gl3.h in OSX 10.8.5
                            
                                What is a renderpass?
                            
                                How to get VBO working
                            
                                Callback function in freeglut from object
                            
                                glGenFramebuffers or glGenFramebuffersEXT?
                            
                                Why does glLoadIdentity have to be called after every call to glMatrixMode?
                            
                                Loading 3d model into OpenGL scene [closed]
                            
                                How to draw multiple cubes in OpenGL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does GLSL really do unnecessary computations with uniform (not per-vertex) values?

Tags:

optimization

compiler-optimization

opengl

glsl

Display Name

People also ask

1 Answers

Display Name

Recent Activity

Donate For Us