What would be an elegant way of preventing snakemake from failing upon shell/R error?

Tags:

snakemake

I would like to be able to have my snakemake workflows continue running even when certain rules fail.

For example, I'm using a variety of tools in order to perform peak-calling of ChIP-seq data. However, certain programs issue an error when they are not able to identify peaks. I would prefer to create an empty output file in such cases, and not having snakemake fail (like some peak-callers already do).

Is there a snakemake-like way of handling such cases, using the "shell" and "run" keywords?

Thanks

813

asked Aug 10 '17 12:08

rioualen

1 Answers

For shell commands, you can always take advantage conditional "or", ||:

rule some_rule:
    output:
        "outfile"
    shell:
        """
        command_that_errors || true
        """

# or...

rule some_rule:
    output:
        "outfile"
    run:
        shell("command_that_errors || true")

Usually an exit code of zero (0) means success, and anything non-zero indicates failure. Including || true ensures a successful exit when the command exits with a non-zero exit code (true always returns 0).

If you need to allow a specific non-zero exit code, you can use shell or Python to check the code. For Python, it would be something like the following. The shlex.split() module is used so shell commands do not need to passed as arrays of arguments.

import shlex

rule some_rule:
    output:
        "outfile"
    run:
        try:
           proc_output = subprocess.check_output(shlex.split("command_that_errors {output}"), shell=True)                       
        # an exception is raised by check_output() for non-zero exit codes (usually returned to indicate failure)
        except subprocess.CalledProcessError as exc: 
            if exc.returncode == 2: # 2 is an allowed exit code
                # this exit code is OK
                pass
            else:
                # for all others, re-raise the exception
                raise

In shell script:

rule some_rule:
    output:
        "outfile"
    run:
        shell("command_that_errors {output} || rc=$?; if [[ $rc == 2 ]]; then exit 0; else exit $?; fi")

186

answered Sep 25 '22 23:09

tomkinsc

Related questions
                            
                                Snakemake: I keep getting The 'conda' command is not available in $PATH. when running on SGE cluster
                            
                                Using wildcards in params
                            
                                R Draws Plots with Rectangles Instead of Text
                            
                                Snakemake wants to run job although output file already exists
                            
                                Symlink (auto-generated) directories via Snakemake
                            
                                How to get the basename of the wildcard values in the snakemake output rule?
                            
                                Snakemake how to execute downstream rules when an upstream rule fails
                            
                                Multiple inputs and outputs in a single rule Snakemake file
                            
                                Snakemake + docker example, how to use volumes
                            
                                snakemake - output one only file from multiple input files in one rule
                            
                                Restrict number of jobs by a rule in snakemake
                            
                                snakemake define folder as output
                            
                                Snakemake: 'Missing input files' due to wrong wildcard expansion
                            
                                How to properly use wildcards in input and output

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With