How are 3D games so efficient? [closed]

V

17

190

There is something I have never understood. How can a great big PC game like GTA IV use 50% of my CPU and run at 60fps while a DX demo of a rotating Teapot @ 60fps uses a whopping 30% ?

Volvulus answered 7/2, 2010 at 17:19 Comment(6)

I don't see what's wrong with this question -- it's perfectly natural to be curious about how other developers have accomplished certain things. We should be encouraging this sort of curiosity, not punishing it with close votes. – Uniaxial 7/2, 2010 at 17:31

@user146780: who asked the question... The best programmers I've met where working in CGI. Gurus from SGI, people working on paralelizing Adobe Photoshop, etc. People here don't realize how complicated it is to write a modern game nor how skilled these coders are. If you want a humbling experience look what the germans from Cryotech did with the Crysis engine. There are videos on Youtube. You simply won't believe it. It's not just about "using octrees". Typically these programmers are simply much more skilled than the average programmers. And you can bet that the GT4 coders are very good. – Transfigure 7/2, 2010 at 18:50

you got gta4 running at 60fps!? GW! gta4 is a P.O.S that runs quite poorly, I've heard Force unleashed does too. I'd say Euphoria is the culprit. honestly, "CPU usage" is a very poor way to compare, simply uncap the frame rate and see which one runs fastest, thats the proper way to do it. also, remember, this "complicated game" while it might render lots of stuff, there is still only a screen worth of stuff, and if it's rendered in the right order, you might end up with near the same amount of pixel work as your "simple" demo, and pixels work is really what kills it. – Strenta 8/2, 2010 at 7:11

You need a profiler that shows you how much the GPU (Graphics Processing Unit) is used. I bet GTA IV shows you ~99%, and the demo 3%. – Desulphurize 7/7, 2010 at 1:53

From experience, about 10% of the game programmers I've worked with were any good, the rest were average at best. Some were utterly incompetent. – Innuendo 23/9, 2010 at 12:31

You could write a while loop that prints "hello" that would take up 50% CPU as well (if you have 2 cores). If you have 4 cores, then that same while(1){ puts("Hello"); } program would take up 25% CPU, regardless of how "hard" that code is to execute. Its not that drawing a teapot is "hard" for the CPU/GPU, its that you are using a rapidly iterating while loop to do it, causing a CPU core to be tied up with the task. – Lifesize 9/7, 2011 at 15:16

U

70

In general, it's because

The games are being optimal about what they need to render, and
They take special advantage of your hardware.

For instance, one easy optimization you can make involves not actually trying to draw things that can't be seen. Consider a complex scene like a cityscape from Grand Theft Auto IV. The renderer isn't actually rendering all of the buildings and structures. Instead, it's rendering only what the camera can see. If you could fly around to the back of those same buildings, facing the original camera, you would see a half-built hollowed-out shell structure. Every point that the camera cannot see is not rendered -- since you can't see it, there's no need to try to show it to you.

Furthermore, optimized instructions and special techniques exist when you're developing against a particular set of hardware, to enable even better speedups.

The other part of your question is why a demo uses so much CPU:

... while a DX demo of a rotating Teapot @ 60fps uses a whopping 30% ?

It's common for demos of graphics APIs (like dxdemo) to fall back to what's called a software renderer when your hardware doesn't support all of the features needed to show a pretty example. These features might include things like shadows, reflection, ray-tracing, physics, et cetera.

This mimics the function of a completely full-featured hardware device which is unlikely to exist, in order to show off all the features of the API. But since the hardware doesn't actually exist, it runs on your CPU instead. That's much more inefficient than delegating to a graphics card -- hence your high CPU usage.

Uniaxial answered 7/2, 2010 at 17:22 Comment(8)

A DX demo uses your hardware, too. So, what's 'special'? – Tiemannite 7/2, 2010 at 17:24

but a demo is unlikely to be optimal about it. – Stat 7/2, 2010 at 17:25

@tur1ng, the teapot demo, for example, may have enabled reflection, shadows and other effects. – Melliemelliferous 7/2, 2010 at 17:27

The teapot might have more polygons than a GTA4 scene. The fact is, the current bottleneck in graphic rendering is more texture effects like bump mapping derived techniques to add details and other post rendering effects. – Mailbox 7/2, 2010 at 17:31

@Klaim: That's true. I'm implicitly assuming above that the teapot is comparatively easier to render than the GTA4 scene. – Uniaxial 7/2, 2010 at 17:32

Textures - the teapot is being created from a large number of individual triangles all with normals and lighting interactions. What looks like an insanely complex 3d world in the game is often fairly simple large blocks covered with a detailed picture. A lot of the '3d' is clever shadow and perspective artistic effects in a static 2d image drawn on the 3d shape – Invitatory 7/2, 2010 at 19:2

True, but it does not answer the question - see my answer about vsync ;-) – Saury 11/7, 2010 at 21:30

Big studios get custom hardware support through development teams and driver patches after release. – Jarib 24/5, 2014 at 22:28

C

99

Patience, technical skill and endurance.

First point is that a DX Demo is primarily a teaching aid so it's done for clarity not speed of execution.

It's a pretty big subject to condense but games development is primarily about understanding your data and your execution paths to an almost pathological degree.

Your code is designed around two things - your data and your target hardware.
The fastest code is the code that never gets executed - sort your data into batches and only do expensive operations on data you need to
How you store your data is key - aim for contiguous access this allows you to batch process at high speed.
Parellise everything you possibly can
Modern CPUs are fast, modern RAM is very slow. Cache misses are deadly.
Push as much to the GPU as you can - it has fast local memory so can blaze through the data but you need to help it out by organising your data correctly.
Avoid doing lots of renderstate switches ( again batch similar vertex data together ) as this causes the GPU to stall
Swizzle your textures and ensure they are powers of two - this improves texture cache performance on the GPU.
Use levels of detail as much as you can -- low/medium/high versions of 3D models and switch based on distance from camera player - no point rendering a high-res version if it's only 5 pixels on screen.

Cummine answered 7/2, 2010 at 17:42 Comment(0)

U

70

In general, it's because

The games are being optimal about what they need to render, and
They take special advantage of your hardware.