Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Optional Group Capture with Lua Pattern Matching

I am trying to parse chemical formulas in Lua using simple pattern matching. However, I do not know how to specify a capture group as being optional. Here is the pattern I have come up with:

pattern = "(%u%l*)(%d*)"

The first group captures the atomic symbol (i.e. "H", "He", etc..) and the second group captures the number of that atom in the molecule. This value is usually an integer value, but if it is 1, it is often omitted, such as in:

formula = "C2H6O"

When I attempt to do a global match, if there is no match the result of count is '' instead of what I would anticipate of nil.

compound = {}
for atom,count in string.gmatch(formula, pattern) do
    compound[atom] = count or 1
end

Obviously I could just check if count = ''but I was curious if there was an optional capturing group in Lua.

like image 825
Moop Avatar asked Sep 25 '14 17:09

Moop


2 Answers

if there was an optional capturing group in Lua.

No; pattern items don't list captures as acceptable options, so you can't have, for example, (%d*)? like you'd do in Perl.

like image 114
Paul Kulchenko Avatar answered Oct 23 '22 18:10

Paul Kulchenko


There is no optional capturing group in Lua.

count is the empty string instead of nil because the empty string matches %d*.

Try this instead:

compound[atom] = tonumber(count) or 1

Note that tonumber will return nil if count is the empty string, which is what you want to check.

like image 26
lhf Avatar answered Oct 23 '22 20:10

lhf