How to write a better strlen function?

Tags:

I am reading "Write Great Code Volume 2" and it shows the following strlen impelementation:

int myStrlen( char *s )
{
    char *start;
    start = s;
    while( *s != 0 )
    {
        ++s;
    }
    return s - start;
}

the book says that this implementation is typical for an inexperienced C programmer. I have been coding in C for the past 11 years and i can't see how to write a function better than this in C(i can think of writing better thing in assembly). How is it possible to write code better than this in C? I looked the standard library implementation of the strlen function in glibc and I couldn't understand most part of it. Where can I find better information on how to write highly optimized code?

380

asked Jul 05 '11 14:07

Victor

3 Answers

From Optimising strlen(), a blogpost by Colm MacCarthaigh:

Unfortunately in C, we’re doomed to an O(n) implementation, best case, but we’re still not done … we can do something about the very size of n.

It gives good example in what direction you can work to speed it up. And another quote from it

Sometimes going really really fast just makes you really really insane.

143

answered Oct 08 '22 04:10

Mojo Risin

Victor, take a look at this:
http://en.wikipedia.org/wiki/Strlen#Implementation

P.S. The reason you don't understand the glibc version is probably because it uses bit shifting to find the \0.

answered Oct 08 '22 03:10

gkrogers

For starters, this is worthless for encodings like UTF-8... that is, calculating the number of characters in an UTF-8 string is more complicated, whereas the number of bytes is, of course, just as easy to calculate as in, say, an ASCII string.

In general, you can optimize on some platforms by reading into larger registers. Since the other links posted so far don't have an example of that, here's a bit of pseudo-pseudocode for lower endian:

int size = 0;
int x;
int *caststring = (int *) yourstring;
while (int x = *caststring++) {
  if (!(x & 0xff)) /* first byte in this int-sized package is 0 */ return size;
  else if (!(x & 0xff00)) /* second byte etc. */ return size+1;
  /* rinse and repeat depending on target architecture, i.e. twice more for 32 bit */
  size += sizeof (int);
}

answered Oct 08 '22 04:10

Jan Krüger

Related questions
                            
                                Is it a bad idea to use GCC's -fms-extensions?
                            
                                Send data to multiple sockets using pipes, tee() and splice()
                            
                                C: wrongly nested switch/case blocks surprisingly works
                            
                                open() what happens if I open twice the same file?
                            
                                Fast counting the number of set bits in __m128i register
                            
                                Why do we need .so.1 file in Linux?
                            
                                Determining the element that occurred the most in O(n) time and O(1) space
                            
                                All struct identifiers are automatically forward declared
                            
                                Bit hack: Expanding bits
                            
                                Why is my "cat" function with system calls slower compared to Linux's "cat"?
                            
                                How to call MATLAB code from C?
                            
                                How could one emulate namespace in C?
                            
                                linux du command source code
                            
                                Use two loop bodies or one (result identical)?
                            
                                For loop condition to stop at 0 when using unsigned integers?
                            
                                How do you get the size of array that is passed into the function?
                            
                                OpenAL: How to create simple "Microphone Echo" programm?
                            
                                What is u_int32_t? [duplicate]
                            
                                Print part of char array
                            
                                Why does an empty declaration work for definitions with int arguments but not for float arguments?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to write a better strlen function?

Tags:

c

string

optimization

pointers

c-strings