What is the use of fork() - ing before exec()?

Tags:

In *nix systems, processes are created by using fork() system call. Consider for example, init process creates another process.. First it forks itself and creates the a process which has the context like init. Only on calling exec(), this child process turns out to be a new process. So why is the intermediate step ( of creating a child with same context as parent ) needed? Isn't that a waste of time and resource, because we are creating a context ( consumes time and wastes memory ) and then over writing it?

Why is this not implemented as allocating a vacant memory area and then calling exec()? This would save time and resources right?

816

asked Apr 04 '13 17:04

shar

2 Answers

The intermediate step enables you to set up shared resources in the child process without the external program being aware of it. The canonical example is constructing a pipe:

// read output of "ls"
// (error checking omitted for brevity)
int pipe_fd[2];
pipe(&pipe_fd);
if (fork() == 0) {       // child:
    close(pipe_fd[0]);   // we don't want to read from the pipe
    dup2(pipe_fd[1], 1); // redirect stdout to the write end of the pipe
    execlp("ls", "ls", (char *) NULL);
    _exit(127);          // in case exec fails
}
// parent:
close(pipe_fd[1]);
fp = fdopen(pipe_fd[0], "r");
while (!feof(fp)) {
    char line[256];
    fgets(line, sizeof line, fp);
    ...
}

Note how the redirection of standard output to the pipe is done in the child, between fork and exec. Of course, for this simple case, there could be a spawning API that would simply do this automatically, given the proper parameters. But the fork() design enables arbitrary manipulation of per-process resources in the child — one can close unwanted file descriptors, modify per-process limits, drop privileges, manipulate signal masks, and so on. Without fork(), the API for spawning processes would end up either extremely fat or not very useful. And indeed, the process spawning calls of competing operating systems typically fall somewhere in between.

As for the waste of memory, it is avoided with the copy on write technique. fork() doesn't allocate new memory for the child process, but points the child to the parent's memory, with the instructions to make a copy of a page only if the page is ever written to. This makes fork() not only memory-efficient, but also fast, because it only needs to copy a "table of contents".

answered Sep 21 '22 18:09

user4815162342

This is an old complaint. Many people have asked Why fork() first? and typically they suggest an operation that will both create a new process from scratch and run a program in it. This operation is called something like spawn().

And they always say, Won't that be faster?

And in fact, every system other than the Unix family does go the "spawn" way. Only Unix is based on fork() and exec().

But it's funny, Unix has always been much faster than other full-featured systems. It has always handled way more users and load.

And Unix has been made even faster over the years. Fork() no longer really duplicates the address space, it just shares it using a technique called copy-on-write. (A very old fork optimization called vfork() is also still around.)

Drink the Kool-Aid.

answered Sep 23 '22 18:09

DigitalRoss

Related questions
                            
                                C programming error handling
                            
                                Strcpy a bigger string to a smaller array of char
                            
                                Why does the standard let functions that don't return compile?
                            
                                ECMAScript-262 implementation in C?
                            
                                Python-C Api wrapper in Objective-C crashes with call to __getattr__ when passed a Python Object
                            
                                Const Multidimensional array
                            
                                file IO performance C
                            
                                Macro with variable length of parameters
                            
                                Why do common section variables only show up in object file not the executable?
                            
                                The use of "r+" in fopen on windows vs linux
                            
                                Is it possible to change the allocation area of automatic variables in C/C++?
                            
                                Compiler error: incompatible types when assigning to 'struct' from type 'void *' during malloc
                            
                                Odd behavior of function with variable number of parameters in C
                            
                                Converting a cdecl function call from C to Pascal that uses callback with variable argument lists
                            
                                Parallel Demonstration Program
                            
                                How to achieve Steam-like window using winapi?
                            
                                Accelerate programme with multiple processors
                            
                                gcc -Wpadded does not provide any warning
                            
                                Using c# to call a function from another process
                            
                                64bit array operation by C/C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the use of fork() - ing before exec()?

Tags:

c

unix

process

shar

People also ask

2 Answers

user4815162342

DigitalRoss

Recent Activity

Donate For Us