On multiprocessor, each core can have its own variables. I thought they are different variables in different addresses, although they are in same process and have the same name. But I am wondering, how does the kernel implement this? Does it dispense a piece of memory to deposit all the percpu pointers, and every time it redirects the pointer to certain address with shift or something?

Normal global variables are not per CPU. Automatic variables are on the stack, and different CPUs use different stack, so naturally they get separate variables. I guess you're referring to Linux's per-CPU variable infrastructure. Most of the magic is here (<code>asm-generic/percpu.h</code>): <pre class="prettyprint"><code>extern unsigned long __per_cpu_offset[NR_CPUS]; #define per_cpu_offset(x) (__per_cpu_offset[x]) /* Separate out the type, so (int[3], foo) works. */ #define DEFINE_PER_CPU(type, name) \ __attribute__((__section__(".data.percpu"))) __typeof__(type) per_cpu__##name /* var is in discarded region: offset to particular copy we want */ #define per_cpu(var, cpu) (*RELOC_HIDE(&per_cpu__##var, __per_cpu_offset[cpu])) #define __get_cpu_var(var) per_cpu(var, smp_processor_id()) </code></pre> The macro <code>RELOC_HIDE(ptr, offset)</code> simply advances <code>ptr</code> by the given offset in bytes (regardless of the pointer type). What does it do? <ol> <li>When defining <code>DEFINE_PER_CPU(int, x)</code>, an integer <code>__per_cpu_x</code> is created in the special <code>.data.percpu</code> section.</li> <li>When the kernel is loaded, this section is loaded multiple times - once per CPU (this part of the magic isn't in the code above).</li> <li>The <code>__per_cpu_offset</code> array is filled with the distances between the copies. Supposing 1000 bytes of per cpu data are used, <code>__per_cpu_offset[n]</code> would contain <code>1000*n</code>.</li> <li>The symbol <code>per_cpu__x</code> will be relocated, during load, to CPU 0's <code>per_cpu__x</code>.</li> <li> <code>__get_cpu_var(x)</code>, when running on CPU 3, will translate to <code>*RELOC_HIDE(&per_cpu__x, __per_cpu_offset[3])</code>. This starts with CPU 0's <code>x</code>, adds the offset between CPU 0's data and CPU 3's, and eventually dereferences the resulting pointer.</li> </ol>

How are percpu pointers implemented in the Linux kernel?

Tags:

linux

linux-kernel

smp

On multiprocessor, each core can have its own variables. I thought they are different variables in different addresses, although they are in same process and have the same name.

But I am wondering, how does the kernel implement this? Does it dispense a piece of memory to deposit all the percpu pointers, and every time it redirects the pointer to certain address with shift or something?

661

asked Jun 07 '13 07:06

dspjm

1 Answers

Normal global variables are not per CPU. Automatic variables are on the stack, and different CPUs use different stack, so naturally they get separate variables.

I guess you're referring to Linux's per-CPU variable infrastructure.
Most of the magic is here (asm-generic/percpu.h):

extern unsigned long __per_cpu_offset[NR_CPUS];

#define per_cpu_offset(x) (__per_cpu_offset[x])

/* Separate out the type, so (int[3], foo) works. */
#define DEFINE_PER_CPU(type, name) \
    __attribute__((__section__(".data.percpu"))) __typeof__(type) per_cpu__##name

/* var is in discarded region: offset to particular copy we want */
#define per_cpu(var, cpu) (*RELOC_HIDE(&per_cpu__##var, __per_cpu_offset[cpu]))
#define __get_cpu_var(var) per_cpu(var, smp_processor_id())

The macro RELOC_HIDE(ptr, offset) simply advances ptr by the given offset in bytes (regardless of the pointer type).

What does it do?

When defining DEFINE_PER_CPU(int, x), an integer __per_cpu_x is created in the special .data.percpu section.
When the kernel is loaded, this section is loaded multiple times - once per CPU (this part of the magic isn't in the code above).
The __per_cpu_offset array is filled with the distances between the copies. Supposing 1000 bytes of per cpu data are used, __per_cpu_offset[n] would contain 1000*n.
The symbol per_cpu__x will be relocated, during load, to CPU 0's per_cpu__x.
__get_cpu_var(x), when running on CPU 3, will translate to *RELOC_HIDE(&per_cpu__x, __per_cpu_offset[3]). This starts with CPU 0's x, adds the offset between CPU 0's data and CPU 3's, and eventually dereferences the resulting pointer.

200

answered Oct 04 '22 09:10

ugoren

Related questions
                            
                                How to use a template for configuration file in Puppet
                            
                                Linking libc++ to CMake project on Linux
                            
                                Compiler can't find libxml/parser.h
                            
                                Check if directory does not exist [duplicate]
                            
                                A general linux file permissions question: Apache and WordPress
                            
                                Should i look at VmSize, VmRSS, or some combination for memory stats on linux?
                            
                                How to get all parent processes and all subprocesses with `pstree`
                            
                                Suppress log entry for single sudo commands
                            
                                Why am I able to perform floating point operations inside a Linux kernel module?
                            
                                ssh config name alias not working for scp [closed]
                            
                                Unrecognized option: - Could not create the Java virtual machine
                            
                                Are gnu syslog(), openlog() and closelog() thread-safe?
                            
                                Daemonizing celery
                            
                                CentOS error - sudo: effective uid is not 0, is sudo installed setuid root?
                            
                                What format to use when entering an IP address into an EC2 Security Group rule?
                            
                                How to list dependencies of c/c++ static library?
                            
                                vscode "#include errors detected. Please update your includePath
                            
                                copy a directory structure with file names without content
                            
                                What happens to RAII objects after a process forks?
                            
                                Unable to execute script file with +x permission, even with sudo

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With