What is paging?

Tags:

Paging is explained here, slide #6 :

http://www.cs.ucc.ie/~grigoras/CS2506/Lecture_6.pdf

in my lecture notes, but I cannot for the life of me understand it. I know its a way of translating virtual addresses to physical addresses. So the virtual addresses, which are on disks are divided into chunks of 2^k. I am really confused after this. Can someone please explain it to me in simple terms?

744

asked May 11 '11 23:05

John Curtsy

2 Answers

Paging is, as you've noted, a type of virtual memory. To answer the question raised by @John Curtsy: it's covered separately from virtual memory in general because there are other types of virtual memory, although paging is now (by far) the most common.

Paged virtual memory is pretty simple: you split all of your physical memory up into blocks, mostly of equal size (though having a selection of two or three sizes is fairly common in practice). Making the blocks equal sized makes them interchangeable.

Then you have addressing. You start by breaking each address up into two pieces. One is an offset within a page. You normally use the least significant bits for that part. If you use (say) 4K pages, you need 12 bits for the offset. With (say) a 32-bit address space, that leaves 20 more bits.

From there, things are really a lot simpler than they initially seem. You basically build a small "descriptor" to describe each page of memory. This will have a linear address (the address used by the client application to address that memory), and a physical address for the memory, as well as a Present bit. There will (at least usually) be a few other things like permissions to indicate whether data in that page can be read, written, executed, etc.

Then, when client code uses an address, the CPU starts by breaking up the page offset from the rest of the address. It then takes the rest of the linear address, and looks through the page descriptors to find the physical address that goes with that linear address. Then, to address the physical memory, it uses the upper 20 bits of the physical address with the lower 12 bits of the linear address, and together they form the actual physical address that goes out on the processor pins and gets data from the memory chip.

Now, we get to the part where we get "true" virtual memory. When programs are using more memory than is actually available, the OS takes the data for some of those descriptors, and writes it out to the disk drive. It then clears the "Present" bit for that page of memory. The physical page of memory is now free for some other purpose.

When the client program tries to refer to that memory, the CPU checks that the Present bit is set. If it's not, the CPU raises an exception. When that happens, the CPU frees up a block of physical memory as above, reads the data for the current page back in from disk, and fills in the page descriptor with the address of the physical page where it's now located. When it's done all that, it returns from the exception, and the CPU restarts execution of the instruction that caused the exception to start with -- except now, the Present bit is set, so using the memory will work.

There is one more detail that you probably need to know: the page descriptors are normally arranged into page tables, and (the important part) you normally have a separate set of page tables for each process in the system (and another for the OS kernel itself). Having separate page tables for each process means that each process can use the same set of linear addresses, but those get mapped to different set of physical addresses as needed. You can also map the same physical memory to more than one process by just creating two separate page descriptors (one for each process) that contain the same physical address. Most OSes use this so that, for example, if you have two or three copies of the same program running, it'll really only have one copy of the executable code for that program in memory -- but it'll have two or three sets of page descriptors that point to that same code so all of them can use it without making separate copies for each.

Of course, I'm simplifying a lot -- quite a few complete (and often fairly large) books have been written about virtual memory. There's also a fair amount of variation among machines, with various embellishments added, minor changes in parameters made (e.g., whether a page is 4K or 8K), and so on. Nonetheless, this is at least a general idea of the core of what happens (and it's still at a high enough level to apply about equally to an ARM, x86, MIPS, SPARC, etc.)

184

answered Oct 03 '22 15:10

Jerry Coffin

Simply put, its a way of holding far more data than your address space would normally allow. I.e, if you have a 32 bit address space and 4 bit virtual address, you can hold (2^32)^(2^4) addresses (far more than a 32 bit address space).

answered Oct 03 '22 15:10

soandos

Related questions
                            
                                Why is Memory Usage section disabled in performance profiler?
                            
                                Allocating large blocks of memory with new
                            
                                intrinsic memcmp
                            
                                How to prevent unnecessary memory use in recursive functions
                            
                                C++0x Tuples Store Elements Backwards
                            
                                How do I use a chain of operator overloads without modifying the operands?
                            
                                Trivial Destructibility and Necessity of Calling Destructor
                            
                                Very brutal swap using template, xor and pointers to to the memory
                            
                                Log memory accesses that cause major page faults
                            
                                Kafka Memory requirement
                            
                                Does fork() duplicate all the memory of the parent?
                            
                                Java's RAM usage doesn't correspond to what the Task Manager says
                            
                                Is CPU speed limited by the speed of fetching instructions from memory?
                            
                                Windows ring buffer without copying
                            
                                Practicing buffer overflow attack in Ubuntu
                            
                                Python's layout of low-value ints in memory
                            
                                Constructor called on an already created object
                            
                                Java Integer memory allocation
                            
                                Application Servers Maximum Memory Limit
                            
                                Virtual address range of a process

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is paging?

Tags:

memory

ram

virtual

translation

paging

John Curtsy

People also ask

2 Answers

Jerry Coffin

soandos

Recent Activity

Donate For Us