In many code samples, people usually use <code>'\0'</code> after creating a new char array like this: <pre class="prettyprint"><code>string s = "JustAString"; char* array = new char[s.size() + 1]; strncpy(array, s.c_str(), s.size()); array[s.size()] = '\0'; </code></pre> Why should we use <code>'\0'</code> here?

The title of your question references C strings. C++ <code>std::string</code> objects are handled differently than standard C strings. <code>\0</code> is important when using C strings, and when I use the term <code>string</code> here, I'm referring to standard C strings. <code>\0</code> acts as a string terminator in C. It is known as the null character, or NUL. It signals code that processes strings - standard libraries but also your own code - where the end of a string is. A good example is <code>strlen</code> which returns the length of a string. When you declare a constant string with: <pre class="prettyprint"><code>const char *str = "JustAString"; </code></pre> then the <code>\0</code> is appended automatically for you. In other cases, where you'll be managing a non-constant string as with your array example, you'll sometimes need to deal with it yourself. The docs for strncpy, which is used in your example, are a good illustration: <code>strncpy</code> copies over the null termination characters except in the case where the specified length is reached before the entire string is copied. Hence you'll often see <code>strncpy</code> combined with the possibly redundant assignment of a null terminator. <code>strlcpy</code> and <code>strcpy_s</code> were designed to address the potential problems that arise from neglecting to handle this case. In your particular example, <code>array[s.size()] = '\0';</code> is one such redundancy: since <code>array</code> is of size <code>s.size() + 1</code>, and <code>strncpy</code> is copying <code>s.size()</code> characters, the function will append the <code>\0</code>. The documentation for standard C string utilities will indicate when you'll need to be careful to include such a null terminator. But read the documentation carefully: as with <code>strncpy</code> the details are easily overlooked, leading to potential buffer overflows.

<blockquote> Why are strings in C++ usually terminated with <code>'\0'</code>? </blockquote> Note that C++ Strings and C strings are not the same. In C++ string refers to std::string which is a template class and provides a lot of intuitive functions to handle the string. Note that C++ std::string are not <code>\0</code> terminated, but the class provides functions to fetch the underlying string data as <code>\0</code> terminated c-style string. In C a string is collection of characters. This collection usually ends with a <code>\0</code>. Unless a special character like <code>\0</code> is used there would be no way of knowing when a string ends. It is also aptly known as the string null terminator. Ofcourse, there could be other ways of bookkeeping to track the length of the string, but using a special character has two straight advantages: <ul> <li>It is more intuitive and </li> <li>There are no additional overheads</li> </ul> Note that <code>\0</code> is needed because most of Standard C library functions operate on strings assuming they are <code>\0</code> terminated. For example: While using <code>printf()</code> if you have an string which is not <code>\0</code>terminated then <code>printf()</code> keeps writing characters to <code>stdout</code> until a <code>\0</code> is encountered, in short it might even print garbage. <blockquote> Why should we use <code>'\0'</code> here? </blockquote> There are two scenarios when you do not need to <code>\0</code> terminate a string: <ul> <li>In any usage if you are explicitly bookkeeping length of the string and </li> <li>If you are using some standard library api will implicitly add a <code>\0</code> to strings. </li> </ul> In your case you already have the second scenario working for you. <pre class="prettyprint"><code>array[s.size()] = '\0'; </code></pre> The above code statement is redundant in your example. For your example using <code>strncpy()</code> makes it useless. <code>strncpy()</code> copies <code>s.size()</code> characters to your <code>array</code>, Note that it appends a null termination if there is any space left after copying the strings. Since <code>array</code>is of size <code>s.size() + 1</code> a <code>\0</code> is automagically added.

Why are strings in C++ usually terminated with '\0'?

Tags:

c++

c

string

null-terminated

In many code samples, people usually use '\0' after creating a new char array like this:

string s = "JustAString";
char* array = new char[s.size() + 1];
strncpy(array, s.c_str(), s.size());
array[s.size()] = '\0';

Why should we use '\0' here?

592

asked Jun 08 '12 04:06

Kingfisher Phuoc

3 Answers

The title of your question references C strings. C++ std::string objects are handled differently than standard C strings. \0 is important when using C strings, and when I use the term string here, I'm referring to standard C strings.

\0 acts as a string terminator in C. It is known as the null character, or NUL. It signals code that processes strings - standard libraries but also your own code - where the end of a string is. A good example is strlen which returns the length of a string.

When you declare a constant string with:

const char *str = "JustAString";

then the \0 is appended automatically for you. In other cases, where you'll be managing a non-constant string as with your array example, you'll sometimes need to deal with it yourself. The docs for strncpy, which is used in your example, are a good illustration: strncpy copies over the null termination characters except in the case where the specified length is reached before the entire string is copied. Hence you'll often see strncpy combined with the possibly redundant assignment of a null terminator. strlcpy and strcpy_s were designed to address the potential problems that arise from neglecting to handle this case.

In your particular example, array[s.size()] = '\0'; is one such redundancy: since array is of size s.size() + 1, and strncpy is copying s.size() characters, the function will append the \0.

The documentation for standard C string utilities will indicate when you'll need to be careful to include such a null terminator. But read the documentation carefully: as with strncpy the details are easily overlooked, leading to potential buffer overflows.

195

answered Oct 03 '22 11:10

pb2q

Why are strings in C++ usually terminated with '\0'?

Note that C++ Strings and C strings are not the same.
In C++ string refers to std::string which is a template class and provides a lot of intuitive functions to handle the string.
Note that C++ std::string are not \0 terminated, but the class provides functions to fetch the underlying string data as \0 terminated c-style string.

In C a string is collection of characters. This collection usually ends with a \0.
Unless a special character like \0 is used there would be no way of knowing when a string ends.
It is also aptly known as the string null terminator.

Ofcourse, there could be other ways of bookkeeping to track the length of the string, but using a special character has two straight advantages:

It is more intuitive and
There are no additional overheads

Note that \0 is needed because most of Standard C library functions operate on strings assuming they are \0 terminated.
For example:
While using printf() if you have an string which is not \0terminated then printf() keeps writing characters to stdout until a \0 is encountered, in short it might even print garbage.

Why should we use '\0' here?

There are two scenarios when you do not need to \0 terminate a string:

In any usage if you are explicitly bookkeeping length of the string and
If you are using some standard library api will implicitly add a \0 to strings.

In your case you already have the second scenario working for you.

array[s.size()] = '\0';

The above code statement is redundant in your example.

For your example using strncpy() makes it useless. strncpy() copies s.size() characters to your array, Note that it appends a null termination if there is any space left after copying the strings. Since arrayis of size s.size() + 1 a \0 is automagically added.

answered Oct 03 '22 09:10

Alok Save

'\0' is the null termination character. If your character array didn't have it and you tried to do a strcpy you would have a buffer overflow. Many functions rely on it to know when they need to stop reading or writing memory.

answered Oct 03 '22 09:10

evanmcdonnal

Related questions
                            
                                error LNK2001: unresolved external symbol "private: static class
                            
                                Why is new int[n] valid when int array[n] is not?
                            
                                What does ~0 mean in this code?
                            
                                Does GCC have a built-in compile time assert?
                            
                                Why must C/C++ string literal declarations be single-line?
                            
                                Why isn't cin >> string working with Visual C++ 2010? [closed]
                            
                                WPARAM and LPARAM parameters
                            
                                How can I force a compile error in C++?
                            
                                C++ Program in Xcode not outputting simple text file using outFile
                            
                                std::map access operator deprecated? no operator [] matches these operands
                            
                                optimized memcpy
                            
                                Advice for dealing with code maintenance [closed]
                            
                                Multiple Inheritance: same variable name
                            
                                vtable for .. referenced from compile error xcode
                            
                                Unresolved externals in C++ when using vectors and find
                            
                                How do I populate values of a static QMap in C++ Qt?
                            
                                How to (portably) get DBL_EPSILON in C and C++
                            
                                How to retrieve the thread id from a boost::thread?
                            
                                XORing "Hello World!" cuts off string
                            
                                How to remove a particular substring from a string?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With