I am trying to parse chemical formulas in Lua using simple pattern matching. However, I do not know how to specify a capture group as being optional. Here is the pattern I have come up with:
pattern = "(%u%l*)(%d*)"
The first group captures the atomic symbol (i.e. "H", "He", etc..) and the second group captures the number of that atom in the molecule. This value is usually an integer value, but if it is 1, it is often omitted, such as in:
formula = "C2H6O"
When I attempt to do a global match, if there is no match the result of count
is ''
instead of what I would anticipate of nil
.
compound = {}
for atom,count in string.gmatch(formula, pattern) do
compound[atom] = count or 1
end
Obviously I could just check if count = ''
but I was curious if there was an optional capturing group in Lua.
if there was an optional capturing group in Lua.
No; pattern items don't list captures as acceptable options, so you can't have, for example, (%d*)?
like you'd do in Perl.
There is no optional capturing group in Lua.
count
is the empty string instead of nil because the empty string matches %d*
.
Try this instead:
compound[atom] = tonumber(count) or 1
Note that tonumber
will return nil if count is the empty string, which is what you want to check.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With