In one C++ open source project, I see this. <pre class="prettyprint"><code>struct SomeClass { ... size_t data_length; char data[1]; ... } </code></pre> What are the advantages of doing so rather than using a pointer? <pre class="prettyprint"><code>struct SomeClass { ... size_t data_length; char* data; ... } </code></pre> The only thing I can think of is with the size 1 array version, users aren't expected to see NULL. Is there anything else?

With this, you don't have to allocate the memory elsewhere and make the pointer point to that. <ul> <li>No extra memory management</li> <li>Accesses to the memory will hit the memory cache (much) more likely</li> </ul> The trick is to allocate more memory than <code>sizeof (SomeClass)</code>, and make a <code>SomeClass*</code> point to it. Then the initial memory will be used by your <code>SomeClass</code> object, and the remaining memory can be used by the <code>data</code>. That is, you can say <code>p->data[0]</code> but also <code>p->data[1]</code> and so on up until you hit the end of memory you allocated. Points can be made that this use results in undefined behavior though, because you declared your array to only have one element, but access it as if it contained more. But real compilers do allow this with the expected meaning because C++ has no alternative syntax to formulate these means (C99 has, it's called "flexible array member" there).

This is usually a quick(and dirty?) way of avoiding multiple memory allocations and deallocations, though it's more C stylish than C++. That is, instead of this: <pre class="prettyprint"><code>struct SomeClass *foo = malloc(sizeof *foo); foo->data = malloc(data_len); memcpy(foo->data,data,data_len); .... free(foo->data); free(foo); </code></pre> You do something like this: <pre class="prettyprint"><code>struct SomeClass *foo = malloc(sizeof *foo + data_len); memcpy(foo->data,data,data_len); ... free(foo); </code></pre> In addition to saving (de)allocation calls, this can also save a bit of memory as there's no space for a pointer and you could even use space that otherwise could have been struct padding.

They are semantically different in your example. <code>char data[1]</code> is a valid array of char with one uninitialized element allocated on the stack. You could write <code>data[0] = 'w'</code> and your program would be correct. <code>char* data;</code> simply declares a pointer that is invalid until initialized to point to a valid address.

Usually you see this as the final member of a structure. Then whoever <code>malloc</code>s the structure, will allocate all the data bytes consecutively in memory as one block to "follow" the structure. So if you need 16 bytes of data, you'd allocate an instance like this: <pre class="prettyprint"><code>SomeClass * pObj = malloc(sizeof(SomeClass) + (16 - 1)); </code></pre> Then you can access the data as if it were an array: <pre class="prettyprint"><code>pObj->data[12] = 0xAB; </code></pre> And you can free all the stuff with one call, of course, as well. The <code>data</code> member is a single-item array by convention because older C compilers (and apparently the current C++ standard) doesn't allow a zero-sized array. Nice further discussion here: http://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html

<ol> <li>The structure can be simply allocated as a single block of memory instead of multiple allocations that must be freed.</li> <li>It actually uses less memory because it doesn't need to store the pointer itself.</li> <li>There may also be performance advantages with caching due to the memory being contiguous.</li> </ol>

The idea behind this particular thing is that the rest of <code>data</code> fits in memory directly after the struct. Of course, you could just do that anyway.

Why use array size 1 instead of pointer?

Tags:

c++

c

In one C++ open source project, I see this.

struct SomeClass {
  ...
  size_t data_length;
  char data[1];
  ...
}

What are the advantages of doing so rather than using a pointer?

struct SomeClass {
  ...
  size_t data_length;
  char* data;
  ...
}

The only thing I can think of is with the size 1 array version, users aren't expected to see NULL. Is there anything else?

895

asked Jun 17 '11 18:06

Russell

6 Answers

With this, you don't have to allocate the memory elsewhere and make the pointer point to that.

No extra memory management
Accesses to the memory will hit the memory cache (much) more likely

The trick is to allocate more memory than sizeof (SomeClass), and make a SomeClass* point to it. Then the initial memory will be used by your SomeClass object, and the remaining memory can be used by the data. That is, you can say p->data[0] but also p->data[1] and so on up until you hit the end of memory you allocated.

Points can be made that this use results in undefined behavior though, because you declared your array to only have one element, but access it as if it contained more. But real compilers do allow this with the expected meaning because C++ has no alternative syntax to formulate these means (C99 has, it's called "flexible array member" there).

193

answered Oct 01 '22 17:10

Johannes Schaub - litb

This is usually a quick(and dirty?) way of avoiding multiple memory allocations and deallocations, though it's more C stylish than C++.

That is, instead of this:

struct SomeClass *foo = malloc(sizeof *foo);
foo->data = malloc(data_len);
memcpy(foo->data,data,data_len);

....
free(foo->data);
free(foo);

You do something like this:

struct SomeClass *foo = malloc(sizeof *foo + data_len);
memcpy(foo->data,data,data_len);

...
free(foo);

In addition to saving (de)allocation calls, this can also save a bit of memory as there's no space for a pointer and you could even use space that otherwise could have been struct padding.

answered Oct 01 '22 18:10

Lyke

They are semantically different in your example.

char data[1] is a valid array of char with one uninitialized element allocated on the stack. You could write data[0] = 'w' and your program would be correct.

char* data; simply declares a pointer that is invalid until initialized to point to a valid address.

answered Oct 01 '22 16:10

Ed S.

Usually you see this as the final member of a structure. Then whoever mallocs the structure, will allocate all the data bytes consecutively in memory as one block to "follow" the structure.

So if you need 16 bytes of data, you'd allocate an instance like this:

SomeClass * pObj = malloc(sizeof(SomeClass) + (16 - 1));

Then you can access the data as if it were an array:

pObj->data[12] = 0xAB;

And you can free all the stuff with one call, of course, as well.

The data member is a single-item array by convention because older C compilers (and apparently the current C++ standard) doesn't allow a zero-sized array. Nice further discussion here: http://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html

answered Oct 01 '22 16:10

Ben Zotto

The structure can be simply allocated as a single block of memory instead of multiple allocations that must be freed.
It actually uses less memory because it doesn't need to store the pointer itself.
There may also be performance advantages with caching due to the memory being contiguous.

answered Oct 01 '22 17:10

Jonathan Wood

The idea behind this particular thing is that the rest of data fits in memory directly after the struct. Of course, you could just do that anyway.

answered Oct 01 '22 17:10

Puppy

Related questions
                            
                                Is it ok to read a shared boolean flag without locking it when another thread may set it (at most once)?
                            
                                How to delete the default constructor?
                            
                                Elegantly call C++ from C
                            
                                How get next (previous) element in std::list without incrementing (decrementing) iterator?
                            
                                Using C-string gives Warning: "Address of stack memory associated with local variable returned"
                            
                                "winapifamily.h: No such file or directory" when compiling SDL in Code::Blocks
                            
                                Reverse map lookup
                            
                                Difference between hash_map and unordered_map?
                            
                                Cost of Default parameters in C++
                            
                                "#ifdef" inside a macro [duplicate]
                            
                                Recursive lambda functions in C++14
                            
                                VS 2012 - Project failed to build because of missing Toolset
                            
                                invalid conversion from 'const char*' to 'char*'
                            
                                Getting a dangling pointer by returning a pointer from a local C-style array
                            
                                Bad practice to return unique_ptr for raw pointer like ownership semantics?
                            
                                DLL References in Visual C++
                            
                                template member function of template class called from template function
                            
                                How do I check for C++20 support? What is the value of __cplusplus for C++20? [duplicate]
                            
                                How do I use an enum value in a switch statement in C++?
                            
                                Difference between execution policies and when to use them

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why use array size 1 instead of pointer?

Tags:

c++

c

Russell

People also ask

6 Answers

Johannes Schaub - litb

Lyke

Ed S.

Ben Zotto

Jonathan Wood

Puppy

Recent Activity

Donate For Us