I am trying to validate some inputs to remove a set of characters. Only alphanumeric characters plus, period, underscore, hyphen are allowed. I've tested the regex expression <code>[^\w.-]</code> here http://gskinner.com/RegExr/ and it matches what I want removed so I not sure why <code>sed</code> is returning the opposite. What am I missing? My end goal is to input <code>"Â10.41.89.50 "</code> and get <code>"10.41.89.50</code>". I've tried: <code>echo "Â10.41.89.50 " | sed s/[^\w.-]//g</code> returns <code>Â...</code> <code>echo "Â10.41.89.50 " | sed s/[\w.-]//g</code> and <code>echo "Â10.41.89.50 " | sed s/[\w^.-]//g</code> returns <code>Â10418950</code> I attempted the answer found here Skip/remove non-ascii character with sed but nothing was removed.

tr's <code>-c</code> (complement) flag may be an option <pre class="prettyprint"><code>echo "Â10.41.89.50-._ " | tr -cd '[:alnum:]._-' </code></pre>

You might want to use the <code>[:alpha:]</code> class instead: <pre class="prettyprint"><code>echo "Â10.41.89.50 " | sed "s/[[:alpha:].-]//g" </code></pre> should work. If not, you might need to change your local settings. On the other hand, if you only want to keep the digits, the hyphens and the period:: <pre class="prettyprint"><code>echo "Â10.41.89.50 " | sed "s/[^[:digit:].-]//g" </code></pre> If your string is in a variable, you can use pure bash and parameter expansions for that: <pre class="prettyprint"><code>$ dirty="Â10.41.89.50 " $ clean=${dirty//[^[:digit:].-]/} $ echo "$clean" 10.41.89.50 </code></pre> or <pre class="prettyprint"><code>$ dirty="Â10.41.89.50 " $ clean=${dirty//[[:alpha:]]/} $ echo "$clean" 10.41.89.50 </code></pre> You can also have a look at <code>1_CR</code>'s answer.

To remove all characters except of alphanumeric and "-" use this code: <pre class="prettyprint"><code>echo "a b-1_2" | sed "s/[^[:alnum:]-]//g" </code></pre>

Removing non-alphanumeric characters with sed

Tags:

regex

bash

replace

sed

I am trying to validate some inputs to remove a set of characters. Only alphanumeric characters plus, period, underscore, hyphen are allowed. I've tested the regex expression [^\w.-] here http://gskinner.com/RegExr/ and it matches what I want removed so I not sure why sed is returning the opposite. What am I missing?

My end goal is to input "Â10.41.89.50 " and get "10.41.89.50".

I've tried:

echo "Â10.41.89.50 " | sed s/[^\w.-]//g returns Â...

echo "Â10.41.89.50 " | sed s/[\w.-]//g and echo "Â10.41.89.50 " | sed s/[\w^.-]//g returns Â10418950

I attempted the answer found here Skip/remove non-ascii character with sed but nothing was removed.

786

asked Nov 15 '13 17:11

wanderingandy

3 Answers

tr's -c (complement) flag may be an option

echo "Â10.41.89.50-._ " | tr -cd '[:alnum:]._-'

178

answered Oct 07 '22 12:10

iruvar

You might want to use the [:alpha:] class instead:

echo "Â10.41.89.50 " | sed "s/[[:alpha:].-]//g"

should work. If not, you might need to change your local settings.

On the other hand, if you only want to keep the digits, the hyphens and the period::

echo "Â10.41.89.50 " | sed "s/[^[:digit:].-]//g"

If your string is in a variable, you can use pure bash and parameter expansions for that:

$ dirty="Â10.41.89.50 "
$ clean=${dirty//[^[:digit:].-]/}
$ echo "$clean"
10.41.89.50

$ dirty="Â10.41.89.50 "
$ clean=${dirty//[[:alpha:]]/}
$ echo "$clean"
10.41.89.50

You can also have a look at 1_CR's answer.

answered Oct 07 '22 12:10

gniourf_gniourf

To remove all characters except of alphanumeric and "-" use this code:

echo "a b-1_2" | sed "s/[^[:alnum:]-]//g"

answered Oct 07 '22 12:10

panticz

Related questions
                            
                                How a RegEx engine works [closed]
                            
                                Find longest repetitive sequence in a string
                            
                                Regex optional capturing group?
                            
                                Find all strings except one string using regex [duplicate]
                            
                                Java PatternSyntaxException: Illegal repetition on string substitution?
                            
                                Free alternative to RegexBuddy [closed]
                            
                                Match specific length x or y
                            
                                Using regex in R to find strings as whole words (but not strings as part of words)
                            
                                Regular expressions in findstr
                            
                                Xcode 6 doesn't recognize \1, \2, \# patterns anymore?
                            
                                Javascript Regexp loop all matches
                            
                                What is the longest regular expression you have seen [closed]
                            
                                Get group names in java regex
                            
                                How to match a pattern given in a variable in awk?
                            
                                Regular expression for excluding special characters [closed]
                            
                                RegEx for matching "A-Z, a-z, 0-9, _" and "."
                            
                                Python Regex Engine - "look-behind requires fixed-width pattern" Error
                            
                                Replace excess whitespaces and line-breaks with PHP?
                            
                                Replace new line/return with space using regex
                            
                                regex over multiple lines in Groovy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Removing non-alphanumeric characters with sed

Tags:

regex

bash

replace

sed

wanderingandy

People also ask

3 Answers

iruvar

gniourf_gniourf

panticz

Recent Activity

Donate For Us