OpenGL ES 渲染性能

发布于 2024-12-12 02:53:49 字数 865 浏览 0 评论 0原文

我有一个关于 OpenGL ES 下渲染性能的简单问题。

假设我正在 iPhone 或 Samsung Galaxy S 等移动设备上渲染一个简单的 2D 粒子系统，假设有 1000 个粒子。

所有粒子都从相同的纹理渲染。粒子在其生命周期中会缩放和旋转。我们这里讨论的是 OpenGL ES。

更实用的方法是什么：

1）设置一批顶点并将每个粒子变换到其中（使用CPU进行所需的变换），然后对glDrawArrays进行1次调用以一次绘制所有粒子。

2）使用（伪！）代码绘制每个粒子，如下所示：

glPushMatrix();         
glColor4f(_act_color.r, _act_color.g, _act_color.b, _act_color.a);  
glTranslatef(_pos.x, _pos.y, 0.0f);
glRotatef(_rot, 0, 0, 1);
glVertexPointer(2, GL_FLOAT, sizeof(vertexVT), &verBuf[0].v[0]);
glTexCoordPointer(2, GL_FLOAT, sizeof(vertexVT), &verBuf[0].t[0]);
glDrawArrays(GL_TRIANGLE_STRIP, 0, 4);
glPopMatrix();

哪种方式更好。选择第一种方式时，它需要更多的 CPU 能力，但它在所有设备上的行为应该相同。第一种方法的一个缺点是我会得到一些顶点开销，因为我必须在每个粒子之间使用“退化”顶点。

第二种方式是在硬件中进行转换，但是所有 Open GL 突击队在不同平台上的行为方式是否相同？

您对每项实施有何看法？我想展示每种方式的优点和缺点。

原文

I have a simple question concerning the render performance under OpenGL ES.

Lets assume i am rendering a simple 2D particle system, with lets say 1000 particles, on a mobile device like an iPhone or Samsung Galaxy S.

All particles are rendered from the same textures.
Particles get scaled and rotated during their lifecycle.
We are talking about OpenGL ES here.

What is the more practicable way:

1) Setup a batch of vertices and transform each particle into it ( using the CPU to do the required transformation) then do 1 single call to glDrawArrays to draw all particles at once.

2) Draw each single particle using (pseudo!) code like this:

glPushMatrix();         
glColor4f(_act_color.r, _act_color.g, _act_color.b, _act_color.a);  
glTranslatef(_pos.x, _pos.y, 0.0f);
glRotatef(_rot, 0, 0, 1);
glVertexPointer(2, GL_FLOAT, sizeof(vertexVT), &verBuf[0].v[0]);
glTexCoordPointer(2, GL_FLOAT, sizeof(vertexVT), &verBuf[0].t[0]);
glDrawArrays(GL_TRIANGLE_STRIP, 0, 4);
glPopMatrix();

Which way is better. When choosing the first way, it requires more CPU power, but it should behave the same on all devices. One withdraw of the first way will be that I get some vertex overhead because I have to use "degenerated" vertices between every particle.

Second way does transformation in HW but will all the Open GL commandos behave the same way on different platforms?

What is your opinion to each implementation? I would like to show up the pros and contras of each way.

分享到QQ

分享到微博