We're trying to build a wrapper script over a command line tool we're using. We would like to set some tool arguments based on options in our wrapper scripts. We would also like to have the possibility to pass native arguments to the command line tool directly as they are written on the command line. Here is what we came up with: <pre class="prettyprint"><code>import argparse parser = argparse.ArgumentParser() parser.add_argument('positional') parser.add_argument('-f', '--foo', action='store_true') parser.add_argument('-b', '--bar', action='store_true') parser.add_argument('native_arg', nargs='*') args = parser.parse_args() print (args) </code></pre> <code>positional</code> is mandatory. Based on the options <code>-f</code> and <code>-b</code> we would add some extra options to our tool call. Anything that is left afterwards (if anything) should be treated as a native tool argument and given to the tool directly. Calling our script with <code>-h</code> produces the following usage: <pre class="prettyprint"><code>usage: test.py [-h] [-f] [-b] positional [native_arg [native_arg ...]] </code></pre> The trick is that these native arguments are themselves options for the tool and contain leading dashes, for example <code>-native0</code> and <code>-native1</code>. We already know about the trick with the double dash to stop argparse from looking for more options. The following call: <pre class="prettyprint"><code>./test.py pos -- -native0 -native1 </code></pre> produces the expected parsed arguments: <pre class="prettyprint"><code>Namespace(bar=False, foo=False, native_arg=['-native0', '-native1'], positional='pos') </code></pre> Trying to add an option after the first positional argument doesn't work, though. More specifically, the following call: <pre class="prettyprint"><code>./test.py pos --foo -- -native0 -native1 </code></pre> produces the following output: <pre class="prettyprint"><code>usage: [...shortened...] test.py: error: unrecognized arguments: -- -native0 -native1 </code></pre> Putting the optional arguments before the positionals: <pre class="prettyprint"><code>./test.py --foo pos -- -native0 -native1 </code></pre> seems to work, as the following is printed: <pre class="prettyprint"><code>Namespace(bar=False, foo=True, native_arg=['-native0', '-native1'], positional='pos') </code></pre> Even stranger, changing the value of <code>nargs</code> for <code>native_arg</code> to <code>'+'</code> works in all the above situations (with the caveat, of course, that at least one <code>native_arg</code> is expected). Are we doing something wrong in our Python code or is this some kind of argparse bug?

This is a known issue (https://bugs.python.org/issue15112, argparse: nargs='*' positional argument doesn't accept any items if preceded by an option and another positional) The parsing alternates handling positionals and optionals. When dealing with positionals it tries to handle as many as the input strings require. But an <code>?</code> or <code>*</code> positional is satisfied with <code>[]</code>, an empty list of strings. <code>+</code> on the other hand requires at least one string <pre class="prettyprint"><code>./test.py pos --foo -- -native0 -native1 </code></pre> The parser gives 'pos' to <code>positional</code>, and <code>[]</code> to <code>native-arg</code>. Then it gives '--foo' to its optional. There aren't anymore <code>positionals</code> left to hand the remaining strings, so it raises the error. The allocation of input strings is done with a stylized form of <code>regex</code> string matching. Imagine matching a pattern that looks like <code>AA?</code>. To correct this, parser would have to look ahead, and delay handling <code>native-arg</code>. We've suggested patches but they aren't in production. @SethMMorton's suggestion of using <code>parse_known_args</code> is a good one. Earlier parsers (e.g. Optparse) handle all the flagged arguments, but return the rest, the positionals, as a undifferentiated list. It's up to the user to split that list. <code>argparse</code> has added the ability to name and parse <code>positionals</code>, but the algorithm works best with fixed <code>nargs</code>, and gets flaky with too many variable <code>nargs</code>.

'argparse' with optional positional arguments that start with dash

Tags:

python

argparse

We're trying to build a wrapper script over a command line tool we're using. We would like to set some tool arguments based on options in our wrapper scripts. We would also like to have the possibility to pass native arguments to the command line tool directly as they are written on the command line.

Here is what we came up with:

import argparse

parser = argparse.ArgumentParser()

parser.add_argument('positional')
parser.add_argument('-f', '--foo', action='store_true')
parser.add_argument('-b', '--bar', action='store_true')

parser.add_argument('native_arg', nargs='*')

args = parser.parse_args()
print (args)

positional is mandatory. Based on the options -f and -b we would add some extra options to our tool call. Anything that is left afterwards (if anything) should be treated as a native tool argument and given to the tool directly. Calling our script with -h produces the following usage:

usage: test.py [-h] [-f] [-b] positional [native_arg [native_arg ...]]

The trick is that these native arguments are themselves options for the tool and contain leading dashes, for example -native0 and -native1. We already know about the trick with the double dash to stop argparse from looking for more options. The following call:

./test.py pos -- -native0 -native1

produces the expected parsed arguments:

Namespace(bar=False, foo=False, native_arg=['-native0', '-native1'], positional='pos')

Trying to add an option after the first positional argument doesn't work, though. More specifically, the following call:

./test.py pos --foo -- -native0 -native1

produces the following output:

usage: [...shortened...]
test.py: error: unrecognized arguments: -- -native0 -native1

Putting the optional arguments before the positionals:

./test.py --foo pos -- -native0 -native1

seems to work, as the following is printed:

Namespace(bar=False, foo=True, native_arg=['-native0', '-native1'], positional='pos')

Even stranger, changing the value of nargs for native_arg to '+' works in all the above situations (with the caveat, of course, that at least one native_arg is expected).

Are we doing something wrong in our Python code or is this some kind of argparse bug?

259

asked Nov 09 '17 15:11

Tudor Timi

2 Answers

argparse does have a hard time when you mix non-required positional arguments with optional arguments (see https://stackoverflow.com/a/47208725/1399279 for details into the bug report). Rather than suggesting a way to solve this issue, I am going to present an alternative approach.

You should check out the parse_known_args method, which was created for the situation you describe (i.e. passing options to a wrapped tool).

In [1]: import argparse

In [2]: parser = argparse.ArgumentParser()

In [3]: parser.add_argument('positional')

In [4]: parser.add_argument('-f', '--foo', action='store_true')

In [5]: parser.add_argument('-b', '--bar', action='store_true')

In [6]: parser.parse_known_args(['pos', '--foo', '-native0', '-native1'])
Out[6]: (Namespace(bar=False, foo=True, positional='pos'), ['-native0', '-native1'])

Unlike parse_args, the output of parse_known_args is a two-element tuple. The first element is the Namespace instance you would expect to get from parse_args, and it contains all the attributes defined by calls to add_argument. The second element is a list of all the arguments not known to the parser.

I personally prefer this method because the user does not need to remember any tricks about how to call your program, or which option order does not result in errors.

134

answered Nov 03 '22 09:11

SethMMorton

This is a known issue (https://bugs.python.org/issue15112, argparse: nargs='*' positional argument doesn't accept any items if preceded by an option and another positional)

The parsing alternates handling positionals and optionals. When dealing with positionals it tries to handle as many as the input strings require. But an ? or * positional is satisfied with [], an empty list of strings. + on the other hand requires at least one string

./test.py pos --foo -- -native0 -native1

The parser gives 'pos' to positional, and [] to native-arg. Then it gives '--foo' to its optional. There aren't anymore positionals left to hand the remaining strings, so it raises the error.

The allocation of input strings is done with a stylized form of regex string matching. Imagine matching a pattern that looks like AA?.

To correct this, parser would have to look ahead, and delay handling native-arg. We've suggested patches but they aren't in production.

@SethMMorton's suggestion of using parse_known_args is a good one.

Earlier parsers (e.g. Optparse) handle all the flagged arguments, but return the rest, the positionals, as a undifferentiated list. It's up to the user to split that list. argparse has added the ability to name and parse positionals, but the algorithm works best with fixed nargs, and gets flaky with too many variable nargs.

answered Nov 03 '22 10:11

hpaulj

Related questions
                            
                                What is the purpose and result of using INADDR_ANY?
                            
                                Security issues I should be aware of with jupyter notebook?
                            
                                Limit/minimize step size in scipy optimization?
                            
                                How to recover original indices for a flattened Numpy array?
                            
                                Possible to insert row at specific position with python-docx?
                            
                                How to change the linestyle of whiskers in pandas boxplots?
                            
                                pdb: set a breakpoint on file which isn't in sys.path
                            
                                Convert unix timestamp in Python to datetime and make 2 Hours behind
                            
                                How to get max value and name from a Pandas series?
                            
                                cx_Freeze build error?
                            
                                Matplotlib: expand legend vertically
                            
                                Adding silent frame to wav file using python
                            
                                Pandas groupby() on one column and then sum on another
                            
                                Python - pysftp / paramiko - Verify host key using its fingerprint
                            
                                Python SSL Certification Problems in Tensorflow
                            
                                Python to close own CMD shell window on exit
                            
                                Is line-joining unsupported by f-strings?
                            
                                Is there a way to close the file PdfFileReader opens?
                            
                                How to send FIX logon message with Python to GDAX/Coinbase
                            
                                Pandas explode list of dictionaries into rows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With