I am trying to validate some inputs to remove a set of characters. Only alphanumeric characters plus, period, underscore, hyphen are allowed. I've tested the regex expression [^\w.-]
here http://gskinner.com/RegExr/ and it matches what I want removed so I not sure why sed
is returning the opposite. What am I missing?
My end goal is to input "Â10.41.89.50 "
and get "10.41.89.50
".
I've tried:
echo "Â10.41.89.50 " | sed s/[^\w.-]//g
returns Â...
echo "Â10.41.89.50 " | sed s/[\w.-]//g
and echo "Â10.41.89.50 " | sed s/[\w^.-]//g
returns Â10418950
I attempted the answer found here Skip/remove non-ascii character with sed but nothing was removed.
Just use [a-zA-Z0-9_-] .
tr's -c
(complement) flag may be an option
echo "Â10.41.89.50-._ " | tr -cd '[:alnum:]._-'
You might want to use the [:alpha:]
class instead:
echo "Â10.41.89.50 " | sed "s/[[:alpha:].-]//g"
should work. If not, you might need to change your local settings.
On the other hand, if you only want to keep the digits, the hyphens and the period::
echo "Â10.41.89.50 " | sed "s/[^[:digit:].-]//g"
If your string is in a variable, you can use pure bash and parameter expansions for that:
$ dirty="Â10.41.89.50 "
$ clean=${dirty//[^[:digit:].-]/}
$ echo "$clean"
10.41.89.50
or
$ dirty="Â10.41.89.50 "
$ clean=${dirty//[[:alpha:]]/}
$ echo "$clean"
10.41.89.50
You can also have a look at 1_CR
's answer.
To remove all characters except of alphanumeric and "-" use this code:
echo "a b-1_2" | sed "s/[^[:alnum:]-]//g"
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With