Possible Duplicates:
Regex to match URL
regex to remove the webpage part of a url in ruby
I am in search of a regular expression for parsing all the urls in a file.
i tried many of the regular expression i got after googling but it fails in one or the other case . my idea is to write one which checks the presense of http or https at the begening and it will match everything untill it sees a blank space .
any ideas ?
NOTE : i dont need to parse the url but erase all the urls from a file or atleast make it unreadable .
The standard URI library provides URI.regexp
which is the regular expression for url string.
require 'uri'
string.scan(URI.regexp)
http://ruby-doc.org/stdlib/libdoc/uri/rdoc/index.html
You can try this:
/https?:\/\/[\S]+/
The \S
means any non-whitespace character.
(Rubular)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With