C++ Primer says <blockquote> For most applications, in addition to being safer, it is also more efficient to use library strings rather then C-style strings </blockquote> Safety is understood. Why is C++ strings library more efficient? After all, underneath it all, aren't strings still represented as character arrays? To clarify, does author talk about programmer efficiency (understood) or processing efficiency?

C-strings are usually faster, because they do not call malloc/new. But there are cases where <code>std::string</code> is faster. Function <code>strlen()</code> is O(N), but <code>std::string::size()</code> is O(1). Also when you search for substring, in C strings you need to check for <code>'\0'</code> on every cycle, in <code>std::string</code> - you don't. In a naive substring search algorithm it doesn't matter much, because instead of checking for <code>'\0'</code> you need to check for <code>i<s.size()</code>. But modern high-performance substring search algorithms traverse strings in multibyte steps. And the need for a <code>'\0'</code> check in every byte slows them down. This is the reason why GLIBC <code>memmem</code> is x2 times faster than <code>strstr</code>. I did a lot of benchmarking of substring algorithms. This is true not only for substring search algorithm. Many other string processing algorithms are slower for zero-terminated strings.

<blockquote> Why is C++ strings library more efficient? After all, underneath it all, aren't strings still represented as character arrays? </blockquote> Because the code which uses <code>char*</code> or <code>char[]</code> is more likely to be inefficent if not written carefully. For example, have you seen loop like this: <pre class="prettyprint"><code>char *get_data(); char const *s = get_data(); for(size_t i = 0 ; i < strlen(s) ; ++i) //Is it efficent loop? No. { //do something } </code></pre> Is that efficient? No. The time-complexity of <code>strlen()</code> is <code>O(N)</code>, and furthermore, it is computed in each iteration, in the above code. Now you may say "I can make it efficient if I call <code>strlen()</code> just once.". Of course, you can. But you have to do all that sort of optimization yourself and conciously. If you missed something, you missed CPU cycles. But with <code>std::string</code>, many such optimization is done by the class itself. So you can write this: <pre class="prettyprint"><code>std::string get_data(); std::string const & s = get_data(); //avoid copy if you don't need it for(size_t i = 0 ; i < s.size() ; ++i) //Is it efficent loop? Yes. { //do something } </code></pre> Is that efficient? Yes. The time-complexity of <code>size()</code> is <code>O(1)</code>. No need to optimize it manually which often makes code look ugly and hard to read. The resulting code with <code>std::string</code> is almost always neat and clean in comparison to <code>char*</code>. Also note that <code>std::string</code> not only makes your code efficent in terms of CPU cycles, but it also increases programmer efficency!

Efficiency of C-String vs C++Strings

2 Answers

C-strings are usually faster, because they do not call malloc/new. But there are cases where std::string is faster. Function strlen() is O(N), but std::string::size() is O(1).

Also when you search for substring, in C strings you need to check for '\0' on every cycle, in std::string - you don't. In a naive substring search algorithm it doesn't matter much, because instead of checking for '\0' you need to check for i<s.size(). But modern high-performance substring search algorithms traverse strings in multibyte steps. And the need for a '\0' check in every byte slows them down. This is the reason why GLIBC memmem is x2 times faster than strstr. I did a lot of benchmarking of substring algorithms.

This is true not only for substring search algorithm. Many other string processing algorithms are slower for zero-terminated strings.

175

answered Sep 17 '22 23:09

Leonid Volnitsky

Why is C++ strings library more efficient? After all, underneath it all, aren't strings still represented as character arrays?

Because the code which uses char* or char[] is more likely to be inefficent if not written carefully. For example, have you seen loop like this:

char *get_data();  char const *s = get_data();   for(size_t i = 0 ; i < strlen(s) ; ++i) //Is it efficent loop? No. {    //do something }

Is that efficient? No. The time-complexity of strlen() is O(N), and furthermore, it is computed in each iteration, in the above code.

Now you may say "I can make it efficient if I call strlen() just once.". Of course, you can. But you have to do all that sort of optimization yourself and conciously. If you missed something, you missed CPU cycles. But with std::string, many such optimization is done by the class itself. So you can write this:

std::string get_data();  std::string const & s = get_data(); //avoid copy if you don't need  it  for(size_t i = 0 ; i < s.size() ; ++i) //Is it efficent loop? Yes. {    //do something }

Is that efficient? Yes. The time-complexity of size() is O(1). No need to optimize it manually which often makes code look ugly and hard to read. The resulting code with std::string is almost always neat and clean in comparison to char*.

Also note that std::string not only makes your code efficent in terms of CPU cycles, but it also increases programmer efficency!

answered Sep 16 '22 23:09

Nawaz

Related questions
                            
                                Difference between code object and executable file
                            
                                How to compare the signature of two functions?
                            
                                Is there any advantage in using static_cast rather than C-style casting for non-pointer types?
                            
                                What is the difference between NULL and __null in C++?
                            
                                C++: Where to write the code documentation: in .cpp or in .hpp files? [closed]
                            
                                Header file included only once in entire program?
                            
                                stl::multimap - how do i get groups of data?
                            
                                How do I convert wchar_t* to std::string?
                            
                                C++11 and the lack of polymorphic lambdas - why?
                            
                                ‘numeric_limits’ was not declared in this scope, no matching function for call to ‘max()’
                            
                                How to use SDL2 and SDL_image with cmake
                            
                                Why does printf() promote a float to a double?
                            
                                Why can't a data member be in a lambda capture list
                            
                                Why can't I use strerror?
                            
                                Syntax highlighting in MS Word document [closed]
                            
                                How do I create and use a class arrow operator?
                            
                                Compiling C and C++ files together using GCC
                            
                                How to call a template member function? [duplicate]
                            
                                Library function for Permutation and Combination in C++
                            
                                Error LNK2019 when using GetFileVersionInfoSize()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Efficiency of C-String vs C++Strings

Tags:

c++

string

James Leonard

People also ask

2 Answers

Leonid Volnitsky

Nawaz

Recent Activity

Donate For Us