How prevalent is branch prediction on current CPUs?

Tags:

Due to the huge impact on performance, I never wonder if my current day desktop CPU has branch prediction. Of course it does. But how about the various ARM offerings? Does iPhone or android phones have branch prediction? The older Nintendo DS? How about PowerPC based Wii? PS 3?

Whether they have a complex prediction unit is not so important, but if they have at least some dynamic prediction, and whether they do some execution of instructions following an expected branch.

What is the cutoff for CPUs with branch prediction? A hand held calculator from decades ago obviously doesn't have one, while my desktop does. But can anyone more clearly outline where one can expect dynamic branch prediction?

If it is unclear, I am talking about the kind of prediction where the condition is changing, varying the expected path during runtime.

929

asked Nov 23 '11 11:11

porgarmingduod

1 Answers

Any CPU with a pipeline beyond a few stages requires at least some primitive branch prediction, otherwise it can stall waiting on computation results in order to decide which way to go. The Intel Atom is an in-order core, but with a fairly deep pipeline, and it therefore requires a pretty decent branch predictor.

Old ARM 7 designs were only three stages. Combine that with things like branch delay slots (required on MIPS, optional on SPARC), and branch prediction isn't so useful.

Incidentally, when MIPS decided to get more performance by going beyond 4 pipeline stages, the branch delay slot became an annoyance. In the original design, it was necessary, because there was no branch predictor. Therefore, you had to sequence your branch instruction prior to the last instruction to be executed before the branch. With the longer pipeline, they needed a branch predictor, obviating the need for a branch delay slot, but they had to emulate it anyway in order to run older code.

The problem with a branch delay slot is that it can only be filled with a useful instruction about 50% of the time. The rest of the time, you either fill it with an instruction whose result is likely to be thrown away, or you use a NO-OP.

answered Nov 10 '22 17:11

Timothy Miller

Related questions
                            
                                What are the semantics of ADRP and ADRL instructions in ARM assembly?
                            
                                How can ARM's MOV instruction work with a large number as the second operand?
                            
                                Using objdump for ARM architecture: Disassembling to ARM
                            
                                Who is refreshing hardware watchdog in Linux?
                            
                                How to trap unaligned memory access?
                            
                                C++ Tips for code optimization on ARM devices
                            
                                Edit (patch) a binary file in IDA Pro
                            
                                Windows -1252 is not supported encoding name
                            
                                itte in arm assembly
                            
                                No backtrace from SIGABRT signal on ARM platform?
                            
                                Do Kernel pages get swapped out?
                            
                                Set all bits in CPU register to 1 efficiently
                            
                                What are .axf files?
                            
                                STM32 WWDG interrupt firing when not configured
                            
                                Is it possible to run a native arm binary on a non-rooted android phone?
                            
                                Different Static Global Variables Share the Same Memory Address
                            
                                Converting very simple ARM instructions to binary/hex
                            
                                Is there a good reference for ARM Neon intrinsics?
                            
                                Clang Cross Compiling for ARM?
                            
                                what's ARM TCM memory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How prevalent is branch prediction on current CPUs?

Tags:

cpu-architecture

branch-prediction

arm

porgarmingduod

People also ask

1 Answers

Timothy Miller

Recent Activity

Donate For Us