C++ string::find complexity

Tags:

Why the c++'s implemented string::find() doesn't use the KMP algorithm (and doesn't run in O(N + M)) and runs in O(N * M)? Is that corrected in C++0x? If the complexity of current find is not O(N * M), what is that?

so what algorithm is implemented in gcc? is that KMP? if not, why? I've tested that and the running time shows that it runs in O(N * M)

864

asked Jan 15 '12 12:01

Farzam

1 Answers

Why the c++'s implemented string::substr() doesn't use the KMP algorithm (and doesn't run in O(N + M)) and runs in O(N * M)?

I assume you mean find(), rather than substr() which doesn't need to search and should run in linear time (and only because it has to copy the result into a new string).

The C++ standard doesn't specify implementation details, and only specifies complexity requirements in some cases. The only complexity requirements on std::string operations are that size(), max_size(), operator[], swap(), c_str() and data() are all constant time. The complexity of anything else depends on the choices made by whoever implemented the library you're using.

The most likely reason for choosing a simple search over something like KMP is to avoid needing extra storage. Unless the string to be found is very long, and the string to search contains a lot of partial matches, the time taken to allocate and free that would likely be much more than the cost of the extra complexity.

Is that corrected in c++0x?

No, C++11 doesn't add any complexity requirements to std::string, and certainly doesn't add any mandatory implementation details.

If the complexity of current substr is not O(N * M), what is that?

That's the worst-case complexity, when the string to search contains a lot of long partial matches. If the characters have a reasonably uniform distribution, then the average complexity would be closer to O(N). So by choosing an algorithm with better worst-case complexity, you may well make more typical cases much slower.

148

answered Sep 23 '22 01:09

Mike Seymour

Related questions
                            
                                Ndk-build: CreateProcess: make (e=87): The parameter is incorrect
                            
                                Compiling a static executable with CMake
                            
                                C++ static constexpr field with incomplete type
                            
                                Are symbols from the C standard library reserved in C++?
                            
                                Where to put the enum in a cpp program?
                            
                                Is there a standard date/time class in C++?
                            
                                Deleting an object in C++
                            
                                Sorting two corresponding arrays
                            
                                Decimal points with std::stringstream?
                            
                                C++ string declaration
                            
                                Tensorflow Different ways to Export and Run graph in C++
                            
                                Does UINT_MAX have all bits set to 1?
                            
                                C++ check if statement can be evaluated constexpr
                            
                                std::unique_ptr of base class holding reference of derived class does not show warning in gcc compiler while naked pointer shows it. Why?
                            
                                Why place headers in a separate directory? [duplicate]
                            
                                Cannot use .begin() or .end() on an array
                            
                                Find maximum value of a cv::Mat
                            
                                std::vector removing elements which fulfill some conditions
                            
                                Keeping the first N elements of a std::vector<> and removing the rest
                            
                                lldb: Couldn't materialize: couldn't get the value of variable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

C++ string::find complexity

Tags:

c++

string

substring

algorithm

time-complexity

Farzam

People also ask

1 Answers

Mike Seymour

Recent Activity

Donate For Us