I've heard the phrase 'priority inversion' in reference to development of operating systems. What exactly is priority inversion? What is the problem it's meant to solve, and how does it solve it?

Imagine three (3) tasks of different priority: tLow, tMed and tHigh. tLow and tHigh access the same critical resource at different times; tMed does its own thing. <ol> <li>tLow is running, tMed and tHigh are presently blocked (but not in critical section).</li> <li>tLow comes along and enters the critical section.</li> <li>tHigh unblocks and since it is the highest priority task in the system, it runs.</li> <li>tHigh then attempts to enter the critical resource but blocks as tLow is in there.</li> <li>tMed unblocks and since it is now the highest priority task in the system, it runs.</li> </ol> tHigh can not run until tLow gives up the resource. tLow can not run until tMed blocks or ends. The priority of the tasks has been inverted; tHigh though it has the highest priority is at the bottom of the execution chain. To "solve" priority inversion, the priority of tLow must be bumped up to be at least as high as tHigh. Some may bump its priority to the highest possible priority level. Just as important as bumping up the priority level of tLow, is dropping the priority level of tLow at the appropriate time(s). Different systems will take different approaches. When to drop the priority of tLow ... <ol> <li>No other tasks are blocked on any of the resources that tLow has. This may be due to timeouts or the releasing of resources.</li> <li>No other tasks contributing to the raising the priority level of tLow are blocked on the resources that tLow has. This may be due to timeouts or the releasing of resources.</li> <li>When there is a change in which tasks are waiting for the resource(s), drop the priority of tLow to match the priority of the highest priority level task blocked on its resource(s).</li> </ol> Method #2 is an improvement over method #1 in that it shortens the length of time that tLow has had its priority level bumped. Note that its priority level stays bumped at tHigh's priority level during this period. Method #3 allows the priority level of tLow to step down in increments if necessary instead of in one all-or-nothing step. Different systems will implement different methods depending upon what factors they consider important. <ul> <li>memory footprint</li> <li>complexity</li> <li>real time responsiveness</li> <li>developer knowledge</li> </ul> Hope this helps.

Suppose an application has three threads: <pre class="prettyprint"><code>Thread 1 has high priority. Thread 2 has medium priority. Thread 3 has low priority. </code></pre> Let's assume that Thread 1 and Thread 3 share the same critical section code Thread 1 and thread 2 are sleeping or blocked at the beginning of the example. Thread 3 runs and enters a critical section. At that moment, thread 2 starts running, preempting thread 3 because thread 2 has a higher priority. So, thread 3 continues to own a critical section. Later, thread 1 starts running, preempting thread 2. Thread 1 tries to enter the critical section that thread 3 owns, but because it is owned by another thread, thread 1 blocks, waiting for the critical section. At that point, thread 2 starts running because it has a higher priority than thread 3 and thread 1 is not running. Thread 3 never releases the critical section that thread 1 is waiting for because thread 2 continues to run. Therefore, the highest-priority thread in the system, thread 1, becomes blocked waiting for lower-priority threads to run.

What is priority inversion?

4 Answers

Imagine three (3) tasks of different priority: tLow, tMed and tHigh. tLow and tHigh access the same critical resource at different times; tMed does its own thing.

tLow is running, tMed and tHigh are presently blocked (but not in critical section).
tLow comes along and enters the critical section.
tHigh unblocks and since it is the highest priority task in the system, it runs.
tHigh then attempts to enter the critical resource but blocks as tLow is in there.
tMed unblocks and since it is now the highest priority task in the system, it runs.

tHigh can not run until tLow gives up the resource. tLow can not run until tMed blocks or ends. The priority of the tasks has been inverted; tHigh though it has the highest priority is at the bottom of the execution chain.

To "solve" priority inversion, the priority of tLow must be bumped up to be at least as high as tHigh. Some may bump its priority to the highest possible priority level. Just as important as bumping up the priority level of tLow, is dropping the priority level of tLow at the appropriate time(s). Different systems will take different approaches.

When to drop the priority of tLow ...

No other tasks are blocked on any of the resources that tLow has. This may be due to timeouts or the releasing of resources.
No other tasks contributing to the raising the priority level of tLow are blocked on the resources that tLow has. This may be due to timeouts or the releasing of resources.
When there is a change in which tasks are waiting for the resource(s), drop the priority of tLow to match the priority of the highest priority level task blocked on its resource(s).

Method #2 is an improvement over method #1 in that it shortens the length of time that tLow has had its priority level bumped. Note that its priority level stays bumped at tHigh's priority level during this period.

Method #3 allows the priority level of tLow to step down in increments if necessary instead of in one all-or-nothing step.

Different systems will implement different methods depending upon what factors they consider important.

memory footprint
complexity
real time responsiveness
developer knowledge

Hope this helps.

106

answered Oct 13 '22 21:10

Sparky

Priority inversion is a problem, not a solution. The typical example is a low priority process acquiring a resource that a high priority process needs, and then being preempted by a medium priority process, so the high priority process is blocked on the resource while the medium priority one finishes (effectively being executed with a lower priority).

A rather famous example was the problem experienced by the Mars Pathfinder rover: http://www.cs.duke.edu/~carla/mars.html, it's a pretty interesting read.

answered Oct 13 '22 21:10

Dmitri

Suppose an application has three threads:

Thread 1 has high priority.
Thread 2 has medium priority.
Thread 3 has low priority.

Let's assume that Thread 1 and Thread 3 share the same critical section code

Thread 1 and thread 2 are sleeping or blocked at the beginning of the example. Thread 3 runs and enters a critical section.

At that moment, thread 2 starts running, preempting thread 3 because thread 2 has a higher priority. So, thread 3 continues to own a critical section.

Later, thread 1 starts running, preempting thread 2. Thread 1 tries to enter the critical section that thread 3 owns, but because it is owned by another thread, thread 1 blocks, waiting for the critical section.

At that point, thread 2 starts running because it has a higher priority than thread 3 and thread 1 is not running. Thread 3 never releases the critical section that thread 1 is waiting for because thread 2 continues to run.

Therefore, the highest-priority thread in the system, thread 1, becomes blocked waiting for lower-priority threads to run.

answered Oct 13 '22 22:10

Pratik Bhat

It is the problem rather than the solution.

It describes the situation that when low-priority threads obtain locks during their work, high-priority threads will have to wait for them to finish (which might take especially long since they are low-priority). The inversion here is that the high-priority thread cannot continue until the low-priority thread does, so in effect it also has low priority now.

A common solution is to have the low-priority threads temporarily inherit the high priority of everyone who is waiting on locks they hold.

answered Oct 13 '22 22:10

Thilo

Related questions
                            
                                Check the open FD limit for a given process in Linux
                            
                                which one should I use: os.sep or os.path.sep?
                            
                                Why is the CPU not needed to service I/O requests?
                            
                                Difference between sequential write and random write
                            
                                How does the OS scheduler regain control of CPU?
                            
                                How closely are Mac OS X and BSD related?
                            
                                Resources to develop an operating system [closed]
                            
                                List Directories and get the name of the Directory
                            
                                Why is Windows 32-bit called Windows x86 and not Windows x32? [closed]
                            
                                How to get the "friendly" OS Version Name?
                            
                                What is the return value of os.system() in Python?
                            
                                DNS caching in linux
                            
                                What is the difference between a stack overflow and buffer overflow?
                            
                                Available and used System Memory in Python? [duplicate]
                            
                                Is OSX a POSIX OS?
                            
                                What is the difference between Full, Para and Hardware assisted virtualization?
                            
                                How to make parent wait for all child processes to finish?
                            
                                Difference between multitasking, multithreading and multiprocessing? [closed]
                            
                                Best way to find os name and version in Unix/Linux platform
                            
                                Why was the comment that said "Don't format a floppy at the same time" funny when talking about threads and processes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is priority inversion?

Tags:

operating-system

priority-inversion

paxdiablo

People also ask