Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Difference between array type and array allocated with malloc

Today I was helping a friend of mine with some C code, and I've found some strange behavior that I couldn't explain him why it was happening. We had TSV file with a list of integers, with an int each line. The first line was the number of lines the list had.

We also had a c file with a very simple "readfile". The first line was read to n, the number of lines, then there was an initialization of:

int list[n] 

and finally a for loop of n with a fscanf.

For small n's (till ~100.000), everything was fine. However, we've found that when n was big (10^6), a segfault would occur.

Finally, we changed the list initialization to

int *list = malloc(n*sizeof(int)) 

and everything when well, even with very large n.

Can someone explain why this occurred? what was causing the segfault with int list[n], that was stopped when we start using list = malloc(n*sizeof(int))?

like image 979
Jorge Leitao Avatar asked May 13 '12 21:05

Jorge Leitao


People also ask

What is the difference between array and malloc?

The array is created at compile time while with malloc memory is allocated during run time; unless you grab all the memory with one malloc but then there is no advantage to using malloc.

What is the type of array allocation *?

the allocated type is T and the size of the request is simply sizeof(T) . For an array new-expression of the form: new T [n]; the allocated type is T [n] (array with n elements of type T ), and the size of the request is at least n * sizeof(T) .

Can you malloc an array?

int *array = malloc(10 * sizeof(int)); This computes the number of bytes that ten integers occupy in memory, then requests that many bytes from malloc and assigns the result to a pointer named array (due to C syntax, pointers and arrays can be used interchangeably in some situations).


2 Answers

There are several different pieces at play here.

The first is the difference between declaring an array as

int array[n]; 

and

int* array = malloc(n * sizeof(int)); 

In the first version, you are declaring an object with automatic storage duration. This means that the array lives only as long as the function that calls it exists. In the second version, you are getting memory with dynamic storage duration, which means that it will exist until it is explicitly deallocated with free.

The reason that the second version works here is an implementation detail of how C is usually compiled. Typically, C memory is split into several regions, including the stack (for function calls and local variables) and the heap (for malloced objects). The stack typically has a much smaller size than the heap; usually it's something like 8MB. As a result, if you try to allocate a huge array with

int array[n]; 

Then you might exceed the stack's storage space, causing the segfault. On the other hand, the heap usually has a huge size (say, as much space as is free on the system), and so mallocing a large object won't cause an out-of-memory error.

In general, be careful with variable-length arrays in C. They can easily exceed stack size. Prefer malloc unless you know the size is small or that you really only do want the array for a short period of time.

Hope this helps!

like image 171
templatetypedef Avatar answered Sep 22 '22 12:09

templatetypedef


int list[n] 

Allocates space for n integers on the stack, which is usually pretty small. Using memory on the stack is much faster than the alternative, but it is quite small and it is easy to overflow the stack (i.e. allocate too much memory) if you do things like allocate huge arrays or do recursion too deeply. You do not have to manually deallocate memory allocated this way, it is done by the compiler when the array goes out of scope.

malloc on the other hand allocates space in the heap, which is usually very large compared to the stack. You will have to allocate a much larger amount of memory on the heap to exhaust it, but it is a lot slower to allocate memory on the heap than it is on the stack, and you must deallocate it manually via free when you are done using it.

like image 24
Seth Carnegie Avatar answered Sep 23 '22 12:09

Seth Carnegie