I want to know how Intel i7 processor's branch prediction works? Currenly, I know the predictor called "dynamic branch prediction". For 1-bit predictor: The hardware always predicts a branch instruction to take the same direction it took the last time it was executed. A refined version working better in practice is the 2-bit predictor. In order to further improve the prediction accuracy, 2-bit prediction schemes were introduced. In these schemes the prediction must be wrong twice before it is changed. Does i7 have the same predictor as the above?

Most of what we know about the branch predictor comes from testing. Intel has not released much in the way of details. The misprediction penalty is about 18 clock cycles, so accurate branch prediction is important. Intel uses a two level branch predictor. The inner level is believed to be unchanged from the Core 2 CPUs. The outer level is more sophisticated and can even correctly predict loops with fixed counts up to 64. Two 18-bit global history buffers are used. One contains all jumps that have been taken at least once. The other contains the most important jumps. (The number of entries in these buffers is unknown.) Note that indirect jumps and calls have their own predictor.

About Branch Prediction of i7

Tags:

cpu-architecture

branch

architecture

I want to know how Intel i7 processor's branch prediction works?

Currenly, I know the predictor called "dynamic branch prediction".

For 1-bit predictor: The hardware always predicts a branch instruction to take the same direction it took the last time it was executed.

A refined version working better in practice is the 2-bit predictor. In order to further improve the prediction accuracy, 2-bit prediction schemes were introduced. In these schemes the prediction must be wrong twice before it is changed.

Does i7 have the same predictor as the above?

957

asked Jun 30 '12 06:06

Fan Zhang

1 Answers

Most of what we know about the branch predictor comes from testing. Intel has not released much in the way of details. The misprediction penalty is about 18 clock cycles, so accurate branch prediction is important.

Intel uses a two level branch predictor. The inner level is believed to be unchanged from the Core 2 CPUs.

The outer level is more sophisticated and can even correctly predict loops with fixed counts up to 64. Two 18-bit global history buffers are used. One contains all jumps that have been taken at least once. The other contains the most important jumps. (The number of entries in these buffers is unknown.)

Note that indirect jumps and calls have their own predictor.

answered Oct 23 '22 17:10

David Schwartz

Related questions
                            
                                Who should learn the "old" system?
                            
                                create mention like twitter or convore with php
                            
                                Sample N-tier ASP.NET MVC3 application with best practices (using EF 4.1)
                            
                                Should our MySQL DB be separate from our Apache servers?
                            
                                Parse Cloud Code Structure
                            
                                How to avoid CORS preflight requests in Single Page Applications?
                            
                                Authorization & User info in a Service Layer (.NET application)
                            
                                Is it good practice to blank out inherited functionality that will not be used?
                            
                                Entity Framework in layered architecture
                            
                                How do architect an ASP.Net MVC app with EF?
                            
                                CRUD in DDD Application Services?
                            
                                How do you organise a Java solution into multiple projects, like in Visual Studio?
                            
                                Implementing dynamically updating upvote/downvote
                            
                                Redux data structuring
                            
                                Android MVP explanation
                            
                                Best way to connect n nodes to a single node?
                            
                                System stories for agile architecture [closed]
                            
                                CouchDB: "Database-per-user" or "One-Database-For-All" design?
                            
                                Operators and inheritance
                            
                                Using multiple allocators efficiently

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With