Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Dynamic Regex in Ruby

Tags:

regex

ruby

I am trying to find a way to let me dynamically create a regexp object from a string (taken from the database) and then use that to filter another string. This example is to extract data from a git commit message, but in theory any valid regexp could be present in the database as a string.

What happens

>> string = "[ALERT] Project: Revision ...123456 committed by Me <[email protected]>\n on 2009-   07-28 21:21:47\n\n    Fixed typo\n"
>> r = Regexp.new("[A-Za-z]+: Revision ...[\w]+ committed by [A-Za-z\s]+")
>> string[r]
=> nil

What I want to happen

>> string = "[ALERT] Project: Revision ...123456 committed by Me <[email protected]>\n on 2009-   07-28 21:21:47\n\n    Fixed typo\n"
>> string[/[A-Za-z]+: Revision ...[\w]+ committed by [A-Za-z\s]+/]
=> "Project: Revision 123456 committed by Me"
like image 444
davidsmalley Avatar asked Aug 01 '09 07:08

davidsmalley


1 Answers

You're only missing one thing:

>> Regexp.new "\w"
=> /w/
>> Regexp.new "\\w"
=> /\w/

Backslashes are escape characters in strings. If you want a literal backslash you have to double it.

>> string = "[ALERT] Project: Revision ...123456 committed by Me <[email protected]>\n on 2009-   07-28 21:21:47\n\n    Fixed typo\n"
=> "[ALERT] Project: Revision ...123456 committed by Me <[email protected]>\n on 2009-   07-28 21:21:47\n\n    Fixed typo\n"
>> r = Regexp.new("[A-Za-z]+: Revision ...[\\w]+ committed by [A-Za-z\\s]+")
=> /[A-Za-z]+: Revision ...[\w]+ committed by [A-Za-z\s]+/
>> string[r]
=> "Project: Revision ...123456 committed by Me "

Typically, if you'd pasted the output from your "broken" lines, rather than just the input, you'd probably have spotted that the w and s weren't escaped properly

like image 59
Gareth Avatar answered Nov 11 '22 22:11

Gareth