I want to check if string is empty and parse the string in time. Please find the below code <pre class="prettyprint"><code>valueStr = strings.Replace(string(valueStr), " ", "", -1) valueStr = strings.Replace(string(valueStr), "\t", "", -1) valueStr = strings.Replace(string(valueStr), "\n", "", -1) valueStr = strings.Replace(string(valueStr), "\r", "", -1) var re = regexp.MustCompile(`\s`) valueStr = re.ReplaceAllString(valueStr, "") if valueStr != "" { fmt.Printf("-------- valueStr %c: \n", valueStr) // o/p => -------- valueStr %!c(string= ): fmt.Printf("-------- valueStr %#v: \n", valueStr) // o/p => -------- valueStr "\x00": fmt.Printf("-------- valueStr %x: \n", valueStr) // o/p => -------- valueStr 00: fmt.Println("-------- valueStr length: ", len(valueStr)) // o/p => -------- valueStr length: 1 // considering valueStr is not empty, parse string to time time, err := time.Parse(TIME_FORMAT, strings.TrimSpace(valueStr)) if err != nil { fmt.Println("-------- Error converting time: ", err) // o/p => -------- Error converting time: parsing time " " as "15:04:05": cannot parse " " as "15" return } } else { // another code } </code></pre> How to remove this empty character from string? Or check if string contains this empty character?

You can remove <code>\x00</code> runes from a string the same way you can remove any other runes: <pre class="prettyprint"><code>valueStr = strings.Replace(valueStr, "\x00", "", -1) </code></pre> Example: <pre class="prettyprint"><code>s := "a\x00b" fmt.Printf("%q\n", s) s = strings.Replace(s, "\x00", "", -1) fmt.Printf("%q\n", s) </code></pre> Output (try it on the Go Playground): <pre class="prettyprint"><code>"a\x00b" "ab" </code></pre> <h3>Using <code>strings.Replacer</code> </h3> Also note that you can substitute the multiple replaces with a single operation by using <code>strings.Replacer</code>, and it will also be more efficient as it only iterates over the input once (and there will be only one <code>string</code> allocated for the result, no matter how many substrings you want to replace). For example: <pre class="prettyprint"><code>s := " \t\n\rabc\x00" fmt.Printf("%q\n", s) r := strings.NewReplacer(" ", "", "\t", "", "\n", "", "\r", "", "\x00", "") s = r.Replace(s) fmt.Printf("%q\n", s) </code></pre> Output (try it on the Go Playground): <pre class="prettyprint"><code>" \t\n\rabc\x00" "abc" </code></pre> Also note that it's enough to create a <code>string.Replacer</code> once, and you can store it in a (global) variable and reuse it, it is even safe to use it concurrently from multiple goroutines. <h3>Using <code>strings.Map()</code> </h3> Also note that if you only want to replace (remove) single <code>rune</code>s and not multi-rune (or multi-byte) substrings, you can also use <code>strings.Map()</code> which might be even more efficient than <code>strings.Replacer</code>. First define a function that tells which <code>rune</code>s to replace (or remove if you return a negative value): <pre class="prettyprint"><code>func remove(r rune) rune { switch r { case ' ', '\t', '\n', '\r', 0: return -1 } return r } </code></pre> And then using it: <pre class="prettyprint"><code>s := " \t\n\rabc\x00" fmt.Printf("%q\n", s) s = strings.Map(remove, s) fmt.Printf("%q\n", s) </code></pre> Output (try it on the Go Playground): <pre class="prettyprint"><code>" \t\n\rabc\x00" "abc" </code></pre> <h3>Benchmarks</h3> We might think <code>strings.Map()</code> will be superior as it only have to deal with <code>rune</code>s which are just <code>int32</code> numbers, while <code>strings.Replacer</code> have to deal with <code>string</code> values which are headers (length+data pointer) plus a series of bytes. But we should know that <code>string</code> values are stored as UTF-8 byte sequences in memory, which means <code>strings.Map()</code> have to decode the <code>rune</code>s from the UTF-8 byte sequence (and encode the runes back to UTF-8 in the end), while <code>strings.Replacer</code> does not: it may simply look for byte sequence matches without decoding the <code>rune</code>s. And <code>strings.Replacer</code> is highly optimized to take advantage of such "tricks". So let's create a benchmark to compare them: We'll use these for the benchmarks: <pre class="prettyprint"><code>var r = strings.NewReplacer(" ", "", "\t", "", "\n", "", "\r", "", "\x00", "") func remove(r rune) rune { switch r { case ' ', '\t', '\n', '\r', 0: return -1 } return r } </code></pre> And we run benchmarks on different input strings: <pre class="prettyprint"><code>func BenchmarkReplaces(b *testing.B) { cases := []struct { title string input string }{ { title: "None", input: "abc", }, { title: "Normal", input: " \t\n\rabc\x00", }, { title: "Long", input: "adsfWR \t\rab\nc\x00 \t\n\rabc\x00asdfWER\n\r", }, } for _, c := range cases { b.Run("Replacer-"+c.title, func(b *testing.B) { for i := 0; i < b.N; i++ { r.Replace(c.input) } }) b.Run("Map-"+c.title, func(b *testing.B) { for i := 0; i < b.N; i++ { strings.Map(remove, c.input) } }) } } </code></pre> And now let's see the benchmark results: <pre class="prettyprint"><code>BenchmarkReplaces/Replacer-None-4 100000000 12.3 ns/op 0 B/op 0 allocs/op BenchmarkReplaces/Map-None-4 100000000 16.1 ns/op 0 B/op 0 allocs/op BenchmarkReplaces/Replacer-Normal-4 20000000 92.7 ns/op 6 B/op 2 allocs/op BenchmarkReplaces/Map-Normal-4 20000000 92.4 ns/op 16 B/op 2 allocs/op BenchmarkReplaces/Replacer-Long-4 5000000 234 ns/op 64 B/op 2 allocs/op BenchmarkReplaces/Map-Long-4 5000000 235 ns/op 80 B/op 2 allocs/op </code></pre> Despite expectations, <code>string.Replacer</code> performs pretty good, just as good as <code>strings.Map()</code> due to it not having to decode and encode runes.

remove null character from string

Tags:

string

replace

go

removing-whitespace

I want to check if string is empty and parse the string in time.

Please find the below code

valueStr = strings.Replace(string(valueStr), " ", "", -1)
valueStr = strings.Replace(string(valueStr), "\t", "", -1)
valueStr = strings.Replace(string(valueStr), "\n", "", -1)
valueStr = strings.Replace(string(valueStr), "\r", "", -1)
var re = regexp.MustCompile(`\s`)
valueStr = re.ReplaceAllString(valueStr, "")

if valueStr != "" {
    fmt.Printf("-------- valueStr %c: \n", valueStr)         // o/p =>  -------- valueStr %!c(string= ):
    fmt.Printf("-------- valueStr %#v: \n", valueStr)        // o/p => -------- valueStr "\x00":
    fmt.Printf("-------- valueStr %x: \n", valueStr)         // o/p =>  -------- valueStr 00:
    fmt.Println("-------- valueStr length: ", len(valueStr)) // o/p => -------- valueStr length:  1

    // considering valueStr is not empty, parse string to time

    time, err := time.Parse(TIME_FORMAT, strings.TrimSpace(valueStr))
    if err != nil {
        fmt.Println("-------- Error converting time: ", err) // o/p => -------- Error converting time:  parsing time " " as "15:04:05": cannot parse " " as "15"
        return
    }
} else {
    // another code
}

How to remove this empty character from string? Or check if string contains this empty character?

277

asked Jan 21 '19 07:01

Bhavana

1 Answers

You can remove \x00 runes from a string the same way you can remove any other runes:

valueStr = strings.Replace(valueStr, "\x00", "", -1)

Example:

s := "a\x00b"
fmt.Printf("%q\n", s)
s = strings.Replace(s, "\x00", "", -1)
fmt.Printf("%q\n", s)

Output (try it on the Go Playground):

"a\x00b"
"ab"

Using `strings.Replacer`

Also note that you can substitute the multiple replaces with a single operation by using strings.Replacer, and it will also be more efficient as it only iterates over the input once (and there will be only one string allocated for the result, no matter how many substrings you want to replace).

For example:

s := " \t\n\rabc\x00"
fmt.Printf("%q\n", s)

r := strings.NewReplacer(" ", "", "\t", "", "\n", "", "\r", "", "\x00", "")
s = r.Replace(s)
fmt.Printf("%q\n", s)

Output (try it on the Go Playground):

" \t\n\rabc\x00"
"abc"

Also note that it's enough to create a string.Replacer once, and you can store it in a (global) variable and reuse it, it is even safe to use it concurrently from multiple goroutines.

Using `strings.Map()`

Also note that if you only want to replace (remove) single runes and not multi-rune (or multi-byte) substrings, you can also use strings.Map() which might be even more efficient than strings.Replacer.

First define a function that tells which runes to replace (or remove if you return a negative value):

func remove(r rune) rune {
    switch r {
    case ' ', '\t', '\n', '\r', 0:
        return -1
    }
    return r
}

And then using it:

s := " \t\n\rabc\x00"
fmt.Printf("%q\n", s)

s = strings.Map(remove, s)
fmt.Printf("%q\n", s)

Output (try it on the Go Playground):

" \t\n\rabc\x00"
"abc"

Benchmarks

We might think strings.Map() will be superior as it only have to deal with runes which are just int32 numbers, while strings.Replacer have to deal with string values which are headers (length+data pointer) plus a series of bytes.

But we should know that string values are stored as UTF-8 byte sequences in memory, which means strings.Map() have to decode the runes from the UTF-8 byte sequence (and encode the runes back to UTF-8 in the end), while strings.Replacer does not: it may simply look for byte sequence matches without decoding the runes. And strings.Replacer is highly optimized to take advantage of such "tricks".

So let's create a benchmark to compare them:

We'll use these for the benchmarks:

var r = strings.NewReplacer(" ", "", "\t", "", "\n", "", "\r", "", "\x00", "")

func remove(r rune) rune {
    switch r {
    case ' ', '\t', '\n', '\r', 0:
        return -1
    }
    return r
}

And we run benchmarks on different input strings:

func BenchmarkReplaces(b *testing.B) {
    cases := []struct {
        title string
        input string
    }{
        {
            title: "None",
            input: "abc",
        },
        {
            title: "Normal",
            input: " \t\n\rabc\x00",
        },
        {
            title: "Long",
            input: "adsfWR \t\rab\nc\x00 \t\n\rabc\x00asdfWER\n\r",
        },
    }

    for _, c := range cases {
        b.Run("Replacer-"+c.title, func(b *testing.B) {
            for i := 0; i < b.N; i++ {
                r.Replace(c.input)
            }
        })
        b.Run("Map-"+c.title, func(b *testing.B) {
            for i := 0; i < b.N; i++ {
                strings.Map(remove, c.input)
            }
        })
    }

}

And now let's see the benchmark results:

BenchmarkReplaces/Replacer-None-4    100000000   12.3 ns/op    0 B/op  0 allocs/op
BenchmarkReplaces/Map-None-4         100000000   16.1 ns/op    0 B/op  0 allocs/op
BenchmarkReplaces/Replacer-Normal-4  20000000    92.7 ns/op    6 B/op  2 allocs/op
BenchmarkReplaces/Map-Normal-4       20000000    92.4 ns/op   16 B/op  2 allocs/op
BenchmarkReplaces/Replacer-Long-4     5000000   234 ns/op     64 B/op  2 allocs/op
BenchmarkReplaces/Map-Long-4          5000000   235 ns/op     80 B/op  2 allocs/op

Despite expectations, string.Replacer performs pretty good, just as good as strings.Map() due to it not having to decode and encode runes.

answered Oct 01 '22 08:10

icza

Related questions
                            
                                String literal without need to escape backslash
                            
                                What is the difference between " as string" and "stringvalue" in swift?
                            
                                How do I convert from a char array [char; N] to a string slice &str?
                            
                                Faster way to trim a long character vector in R [closed]
                            
                                Understanding String heap size in Javascript / V8
                            
                                PHP "&curren" string turns into weird symbol
                            
                                Swift remove ONLY trailing spaces from string
                            
                                Does making a char* point to another string literal leak memory?
                            
                                Python how to get value from argparse from variable, but not the name of the variable?
                            
                                Why is the time complexity of this example from "Cracking the Coding Interview" O(k c^k)?
                            
                                const volatile char string not printing properly
                            
                                Case insensitive string matching in awk
                            
                                Best Algorithm to make correction typos in text
                            
                                identify if QVariant is String or Int
                            
                                Return a local C-string for function with C++ string return value
                            
                                Forcing dict keys to be used as argument specifiers with str.format
                            
                                Split string and convert the substrings to integers in Go
                            
                                TypeError: Type aliases cannot be used with isinstance()
                            
                                Count spaces (ASCII code 32) at the beginning and end of string
                            
                                Interlacing two vectors [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

remove null character from string

Tags:

string

replace

go

removing-whitespace

Bhavana

People also ask

1 Answers

Using `strings.Replacer`

Using `strings.Map()`

Benchmarks

icza

Recent Activity

Donate For Us

remove null character from string

Tags:

string

replace

go

removing-whitespace

Bhavana

People also ask

1 Answers

Using strings.Replacer

Using strings.Map()

Benchmarks

icza

Related questions

Recent Activity

Donate For Us

Using `strings.Replacer`

Using `strings.Map()`