Here's the problem: I have this script <code>foo.py</code>, and if the user invokes it without the <code>--bar</code> option, I'd like to display the following error message: <pre class="prettyprint"><code>Please add the --bar option to your command, like so: python foo.py --bar </code></pre> Now, the tricky part is that there are several ways the user might have invoked the command: <ul> <li>They may have used <code>python foo.py</code> like in the example</li> <li>They may have used <code>/usr/bin/foo.py</code> </li> <li>They may have a shell alias <code>frob='python foo.py'</code>, and actually ran <code>frob</code> </li> <li>Maybe it's even a git alias <code>flab=!/usr/bin/foo.py</code>, and they used <code>git flab</code> </li> </ul> In every case, I'd like the message to reflect how the user invoked the command, so that the example I'm providing would make sense. <code>sys.argv</code> always contains <code>foo.py</code>, and <code>/proc/$$/cmdline</code> doesn't know about aliases. It seems to me that the only possible source for this information would be bash itself, but I don't know how to ask it. Any ideas? UPDATE How about if we limit possible scenarios to only those listed above? UPDATE 2: Plenty of people wrote very good explanation about why this is not possible in the general case, so I would like to limit my question to this: Under the following assumptions: <ul> <li>The script was started interactively, from bash</li> <li>The script was start in one of these 3 ways: <ol> <li> <code>foo <args></code> where foo is a symbolic link /usr/bin/foo -> foo.py</li> <li> <code>git foo</code> where alias.foo=!/usr/bin/foo in <code>~/.gitconfig</code> </li> <li> <code>git baz</code> where alias.baz=!/usr/bin/foo in <code>~/.gitconfig</code> </li> </ol> </li> </ul> Is there a way to distinguish between 1 and (2,3) from within the script? Is there a way to distinguish between 2 and 3 from within the script? I know this is a long shot, so I'm accepting Charles Duffy's answer for now. UPDATE 3: So far, the most promising angle was suggested by Charles Duffy in the comments below. If I can get my users to have <pre class="prettyprint"><code>trap 'export LAST_BASH_COMMAND=$(history 1)' DEBUG </code></pre> in their <code>.bashrc</code>, then I can use something like this in my code: <pre class="prettyprint"><code>like_so = None cmd = os.environ['LAST_BASH_COMMAND'] if cmd is not None: cmd = cmd[8:] # Remove the history counter if cmd.startswith("foo "): like_so = "foo --bar " + cmd[4:] elif cmd.startswith(r"git foo "): like_so = "git foo --bar " + cmd[8:] elif cmd.startswith(r"git baz "): like_so = "git baz --bar " + cmd[8:] if like_so is not None: print("Please add the --bar option to your command, like so:") print(" " + like_so) else: print("Please add the --bar option to your command.") </code></pre> This way, I show the general message if I don't manage to get their invocation method. Of course, if I'm going to rely on changing my users' environment I might as well ensure that the various aliases export their own environment variables that I can look at, but at least this way allows me to use the same technique for any other script I might add later.

See the Note at the bottom regarding the originally proposed wrapper script. A new more flexible approach is for the python script to provide a new command line option, permitting users to specify a custom string they would prefer to see in error messages. For example, if a user prefers to call the python script '<code>myPyScript.py</code>' via an alias, they can change the alias definition from this: <pre class="prettyprint"><code> alias myAlias='myPyScript.py $@' </code></pre> to this: <pre class="prettyprint"><code> alias myAlias='myPyScript.py --caller=myAlias $@' </code></pre> If they prefer to call the python script from a shell script, it can use the additional command line option like so: <pre class="prettyprint"><code> #!/bin/bash exec myPyScript.py "$@" --caller=${0##*/} </code></pre> Other possible applications of this approach: <pre class="prettyprint"><code> bash -c myPyScript.py --caller="bash -c myPyScript.py" myPyScript.py --caller=myPyScript.py </code></pre> For listing expanded command lines, here's a script '<code>pyTest.py</code>', based on feedback by @CharlesDuffy, that lists cmdline for the running python script, as well as the parent process that spawned it. If the new -caller argument is used, it will appear in the command line, although aliases will have been expanded, etc. <pre class="prettyprint"><code>#!/usr/bin/env python import os, re with open ("/proc/self/stat", "r") as myfile: data = [x.strip() for x in str.split(myfile.readlines()[0],' ')] pid = data[0] ppid = data[3] def commandLine(pid): with open ("/proc/"+pid+"/cmdline", "r") as myfile: return [x.strip() for x in str.split(myfile.readlines()[0],'\x00')][0:-1] pid_cmdline = commandLine(pid) ppid_cmdline = commandLine(ppid) print "%r" % pid_cmdline print "%r" % ppid_cmdline </code></pre> After saving this to a file named '<code>pytest.py</code>', and then calling it from a bash script called '<code>pytest.sh</code>' with various arguments, here's the output: <pre class="prettyprint"><code>$ ./pytest.sh a b "c d" e ['python', './pytest.py'] ['/bin/bash', './pytest.sh', 'a', 'b', 'c d', 'e'] </code></pre> NOTE: criticisms of the original wrapper script <code>aliasTest.sh</code> were valid. Although the existence of a pre-defined alias is part of the specification of the question, and may be presumed to exist in the user environment, the proposal defined the alias (creating the misleading impression that it was part of the recommendation rather than a specified part of the user's environment), and it didn't show how the wrapper would communicate with the called python script. In practice, the user would either have to source the wrapper or define the alias within the wrapper, and the python script would have to delegate the printing of error messages to multiple custom calling scripts (where the calling information resided), and clients would have to call the wrapper scripts. Solving those problems led to a simpler approach, that is expandable to any number of additional use cases. Here's a less confusing version of the original script, for reference: <pre class="prettyprint"><code>#!/bin/bash shopt -s expand_aliases alias myAlias='myPyScript.py' # called like this: set -o history myAlias $@ _EXITCODE=$? CALL_HISTORY=( `history` ) _CALLING_MODE=${CALL_HISTORY[1]} case "$_EXITCODE" in 0) # no error message required ;; 1) echo "customized error message #1 [$_CALLING_MODE]" 1>&2 ;; 2) echo "customized error message #2 [$_CALLING_MODE]" 1>&2 ;; esac </code></pre> Here's the output: <pre class="prettyprint"><code>$ aliasTest.sh 1 2 3 ['./myPyScript.py', '1', '2', '3'] customized error message #2 [myAlias] </code></pre>

Is there a way to know how the user invoked a program from bash?

Tags:

linux

bash

command-line-interface

Here's the problem: I have this script foo.py, and if the user invokes it without the --bar option, I'd like to display the following error message:

Please add the --bar option to your command, like so:
    python foo.py --bar

Now, the tricky part is that there are several ways the user might have invoked the command:

They may have used python foo.py like in the example
They may have used /usr/bin/foo.py
They may have a shell alias frob='python foo.py', and actually ran frob
Maybe it's even a git alias flab=!/usr/bin/foo.py, and they used git flab

In every case, I'd like the message to reflect how the user invoked the command, so that the example I'm providing would make sense.

sys.argv always contains foo.py, and /proc/$$/cmdline doesn't know about aliases. It seems to me that the only possible source for this information would be bash itself, but I don't know how to ask it.

Any ideas?

UPDATE How about if we limit possible scenarios to only those listed above?

UPDATE 2: Plenty of people wrote very good explanation about why this is not possible in the general case, so I would like to limit my question to this:

Under the following assumptions:

The script was started interactively, from bash
The script was start in one of these 3 ways:
1. foo <args> where foo is a symbolic link /usr/bin/foo -> foo.py
2. git foo where alias.foo=!/usr/bin/foo in ~/.gitconfig
3. git baz where alias.baz=!/usr/bin/foo in ~/.gitconfig

Is there a way to distinguish between 1 and (2,3) from within the script? Is there a way to distinguish between 2 and 3 from within the script?

I know this is a long shot, so I'm accepting Charles Duffy's answer for now.

UPDATE 3: So far, the most promising angle was suggested by Charles Duffy in the comments below. If I can get my users to have

trap 'export LAST_BASH_COMMAND=$(history 1)' DEBUG

in their .bashrc, then I can use something like this in my code:

like_so = None
cmd = os.environ['LAST_BASH_COMMAND']
if cmd is not None:
    cmd = cmd[8:]  # Remove the history counter
    if cmd.startswith("foo "):
        like_so = "foo --bar " + cmd[4:]
    elif cmd.startswith(r"git foo "):
        like_so = "git foo --bar " + cmd[8:]
    elif cmd.startswith(r"git baz "):
        like_so = "git baz --bar " + cmd[8:]
if like_so is not None:
    print("Please add the --bar option to your command, like so:")
    print("    " + like_so)
else:
    print("Please add the --bar option to your command.")

This way, I show the general message if I don't manage to get their invocation method. Of course, if I'm going to rely on changing my users' environment I might as well ensure that the various aliases export their own environment variables that I can look at, but at least this way allows me to use the same technique for any other script I might add later.

371

asked Jul 12 '18 09:07

itsadok

2 Answers

No, there is no way to see the original text (before aliases/functions/etc).

Starting a program in UNIX is done as follows at the underlying syscall level:

int execve(const char *path, char *const argv[], char *const envp[]);

Notably, there are three arguments:

The path to the executable
An argv array (the first item of which -- argv[0] or $0 -- is passed to that executable to reflect the name under which it was started)
A list of environment variables

Nowhere in here is there a string that provides the original user-entered shell command from which the new process's invocation was requested. This is particularly true since not all programs are started from a shell at all; consider the case where your program is started from another Python script with shell=False.

It's completely conventional on UNIX to assume that your program was started through whatever name is given in `argv[0]`; this works for symlinks.

You can even see standard UNIX tools doing this:

$ ls '*.txt'         # sample command to generate an error message; note "ls:" at the front
ls: *.txt: No such file or directory
$ (exec -a foobar ls '*.txt')   # again, but tell it that its name is "foobar"
foobar: *.txt: No such file or directory
$ alias somesuch=ls             # this **doesn't** happen with an alias
$ somesuch '*.txt'              # ...the program still sees its real name, not the alias!
ls: *.txt: No such file

If you do want to generate a UNIX command line, use `pipes.quote()` (Python 2) or `shlex.quote()` (Python 3) to do it safely.

try:
    from pipes import quote # Python 2
except ImportError:
    from shlex import quote # Python 3

cmd = ' '.join(quote(s) for s in open('/proc/self/cmdline', 'r').read().split('\0')[:-1])
print("We were called as: {}".format(cmd))

Again, this won't "un-expand" aliases, revert to the code that was invoked to call a function that invoked your command, etc; there is no un-ringing that bell.

That can be used to look for a git instance in your parent process tree, and discover its argument list:

def find_cmdline(pid):
    return open('/proc/%d/cmdline' % (pid,), 'r').read().split('\0')[:-1]

def find_ppid(pid):
    stat_data = open('/proc/%d/stat' % (pid,), 'r').read()
    stat_data_sanitized = re.sub('[(]([^)]+)[)]', '_', stat_data)
    return int(stat_data_sanitized.split(' ')[3])

def all_parent_cmdlines(pid):
    while pid > 0:
        yield find_cmdline(pid)
        pid = find_ppid(pid)

def find_git_parent(pid):
    for cmdline in all_parent_cmdlines(pid):
        if cmdline[0] == 'git':
            return ' '.join(quote(s) for s in cmdline)
    return None

147

answered Oct 23 '22 06:10

Charles Duffy

See the Note at the bottom regarding the originally proposed wrapper script.

A new more flexible approach is for the python script to provide a new command line option, permitting users to specify a custom string they would prefer to see in error messages.

For example, if a user prefers to call the python script 'myPyScript.py' via an alias, they can change the alias definition from this:

  alias myAlias='myPyScript.py $@'

to this:

  alias myAlias='myPyScript.py --caller=myAlias $@'

If they prefer to call the python script from a shell script, it can use the additional command line option like so:

  #!/bin/bash
  exec myPyScript.py "$@" --caller=${0##*/}

Other possible applications of this approach:

  bash -c myPyScript.py --caller="bash -c myPyScript.py"

  myPyScript.py --caller=myPyScript.py

For listing expanded command lines, here's a script 'pyTest.py', based on feedback by @CharlesDuffy, that lists cmdline for the running python script, as well as the parent process that spawned it. If the new -caller argument is used, it will appear in the command line, although aliases will have been expanded, etc.

#!/usr/bin/env python

import os, re

with open ("/proc/self/stat", "r") as myfile:
  data = [x.strip() for x in str.split(myfile.readlines()[0],' ')]

pid = data[0]
ppid = data[3]

def commandLine(pid):
  with open ("/proc/"+pid+"/cmdline", "r") as myfile:
    return [x.strip() for x in str.split(myfile.readlines()[0],'\x00')][0:-1]

pid_cmdline = commandLine(pid)
ppid_cmdline = commandLine(ppid)

print "%r" % pid_cmdline
print "%r" % ppid_cmdline

After saving this to a file named 'pytest.py', and then calling it from a bash script called 'pytest.sh' with various arguments, here's the output:

$ ./pytest.sh a b "c d" e
['python', './pytest.py']
['/bin/bash', './pytest.sh', 'a', 'b', 'c d', 'e']

NOTE: criticisms of the original wrapper script aliasTest.sh were valid. Although the existence of a pre-defined alias is part of the specification of the question, and may be presumed to exist in the user environment, the proposal defined the alias (creating the misleading impression that it was part of the recommendation rather than a specified part of the user's environment), and it didn't show how the wrapper would communicate with the called python script. In practice, the user would either have to source the wrapper or define the alias within the wrapper, and the python script would have to delegate the printing of error messages to multiple custom calling scripts (where the calling information resided), and clients would have to call the wrapper scripts. Solving those problems led to a simpler approach, that is expandable to any number of additional use cases.

Here's a less confusing version of the original script, for reference:

#!/bin/bash
shopt -s expand_aliases
alias myAlias='myPyScript.py'

# called like this:
set -o history
myAlias $@
_EXITCODE=$?
CALL_HISTORY=( `history` )
_CALLING_MODE=${CALL_HISTORY[1]}

case "$_EXITCODE" in
0) # no error message required
  ;;
1)
  echo "customized error message #1 [$_CALLING_MODE]" 1>&2
  ;;
2)
  echo "customized error message #2 [$_CALLING_MODE]" 1>&2
  ;;
esac

Here's the output:

$ aliasTest.sh 1 2 3
['./myPyScript.py', '1', '2', '3']
customized error message #2 [myAlias]

answered Oct 23 '22 08:10

philwalk

Related questions
                            
                                Debugging in Linux using core dumps
                            
                                How to allow certain threads to have priority in locking a mutex use PTHREADS
                            
                                Sending keyboard input to a program from command-line
                            
                                Behaviour of GNU sort command (with non-letter ASCII characters, such as dot or semicolon)
                            
                                Turn on core/crash dumps programmatically
                            
                                Split files based on file content and pattern matching
                            
                                Linux Mach-O Disassembler
                            
                                What does asm("pause") do and why to use it
                            
                                Why does C99 complain about storage sizes?
                            
                                Why does wget output to stderr rather than stdout?
                            
                                How to set a Java thread's cpu core affinity?
                            
                                What is the use of udevadm settle? [closed]
                            
                                Can't write to named pipe
                            
                                Extract the Linux serial number without sudo
                            
                                Why softirq is used for highly threaded and high frequency uses?
                            
                                What does an arrow mean when listing files? [closed]
                            
                                Unix: What does cat by itself do?
                            
                                Disable variable-length automatic arrays in gcc
                            
                                When is a TCP connection considered idle?
                            
                                Can eBPF modify the return value or parameters of a syscall?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a way to know how the user invoked a program from bash?

Tags:

linux

bash

command-line-interface

itsadok

People also ask

2 Answers

No, there is no way to see the original text (before aliases/functions/etc).

It's completely conventional on UNIX to assume that your program was started through whatever name is given in `argv[0]`; this works for symlinks.

If you do want to generate a UNIX command line, use `pipes.quote()` (Python 2) or `shlex.quote()` (Python 3) to do it safely.

That can be used to look for a git instance in your parent process tree, and discover its argument list:

Charles Duffy

philwalk

Recent Activity

Donate For Us

Is there a way to know how the user invoked a program from bash?

Tags:

linux

bash

command-line-interface

itsadok

People also ask

2 Answers

No, there is no way to see the original text (before aliases/functions/etc).

It's completely conventional on UNIX to assume that your program was started through whatever name is given in argv[0]; this works for symlinks.

If you do want to generate a UNIX command line, use pipes.quote() (Python 2) or shlex.quote() (Python 3) to do it safely.

That can be used to look for a git instance in your parent process tree, and discover its argument list:

Charles Duffy

philwalk

Related questions

Recent Activity

Donate For Us

It's completely conventional on UNIX to assume that your program was started through whatever name is given in `argv[0]`; this works for symlinks.

If you do want to generate a UNIX command line, use `pipes.quote()` (Python 2) or `shlex.quote()` (Python 3) to do it safely.