How to remove all non - ASCII characters from a string in Ruby

Tags:

ruby

watir

I seems to be a very simple and much needed method. I need to remove all non ASCII characters from a string. e.g Â© etc. See the following example.

#coding: utf-8
s = " Hello this a mixed string Â© that I made."
puts s.encoding
puts s.encode

output:

UTF-8
Hello this a mixed str

ing ┬⌐ that I made.

When I feed this to Watir, it produces following error:incompatible character encodings: UTF-8 and ASCII-8BIT

So my problem is that I want to get rid of all non ASCII characters before using it. I will not know which encoding the source string "s" uses.

I have been searching and experimenting for quite some time now.

If I try to use

  puts s.encode('ASCII-8BIT')

It gives the error:

 : "\xC2\xA9" from UTF-8 to ASCII-8BIT (Encoding::UndefinedConversionError)

787

asked Jul 08 '10 04:07

Nick

1 Answers

You can just literally translate what you asked into a Regexp. You wrote:

I want to get rid of all non ASCII characters

We can rephrase that a little bit:

I want to substitue all characters which don't thave the ASCII property with nothing

And that's a statement that can be directly expressed in a Regexp:

s.gsub!(/\P{ASCII}/, '')

As an alternative, you could also use String#delete!:

s.delete!("^\u{0000}-\u{007F}")

147

answered Oct 26 '22 11:10

Jörg W Mittag

Related questions
                            
                                Initializing hashes
                            
                                Convert a string of 0-F into a byte array in Ruby
                            
                                How do i get request.uri in model in Rails?
                            
                                What is a good shopping cart gem for Rails?
                            
                                "__rvm_do_with_env_before" and "__rvm_after_cd" when doing "cd"
                            
                                Get own IP address
                            
                                How to parse a URL and extract the required substring
                            
                                Getting the model class from active record relation
                            
                                write csv in ruby 1.9 and CSV::Writer
                            
                                Creating a model that has a tree structure
                            
                                Bundler: how to use without rails?
                            
                                Get model or 404 on Rails
                            
                                The InstanceMethods module inside ActiveSupport::Concern.. Deprecation Warning
                            
                                Rails Carrierwave Base64 image upload
                            
                                Positive lookahead doesn't stop at first occurrence
                            
                                Ruby turn string into symbol
                            
                                Unique foreign key in rails migration
                            
                                rbenv and bundler: "bad interpreter: No such file or directory"
                            
                                Check if Internet Connection Exists with Ruby?
                            
                                how to create wizard forms in ruby on rails

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With