Restricting Python's syntax to execute user code safely. Is this a safe approach?

Tags:

Original question:

Executing mathematical user code on a python web server, what is the simplest secure way?

I want to be able to run user submitted code on a python webserver. The code will be simple and mathematical in nature.

As such a small subset of Python is required, my current approach is to whitelist allowable syntax by traversing Python's abstract syntax tree. Functions and names get special treatment; only explicitly whitelisted functions are allowed, and only unused names.

import ast

allowed_functions = set([
    #math library
    'acos', 'acosh', 'asin', 'asinh', 'atan', 'atan2', 'atanh',
    'ceil', 'copysign', 'cos', 'cosh', 'degrees', 'e', 'erf',
    'erfc', 'exp', 'expm1', 'fabs', 'factorial', 'floor', 'fmod',
    'frexp', 'fsum', 'gamma', 'hypot', 'isinf', 'isnan', 'ldexp',
    'lgamma', 'log', 'log10', 'log1p', 'modf', 'pi', 'pow', 'radians',
    'sin', 'sinh', 'sqrt', 'tan', 'tanh', 'trunc',
    #builtins
    'abs', 'max', 'min', 'range', 'xrange'
    ])

allowed_node_types = set([
    #Meta
    'Module', 'Assign', 'Expr',
    #Control
    'For', 'If', 'Else',
    #Data
    'Store', 'Load', 'AugAssign', 'Subscript',
    #Datatypes
    'Num', 'Tuple', 'List',
    #Operations
    'BinOp', 'Add', 'Sub', 'Mult', 'Div', 'Mod', 'Compare'
    ])

safe_names = set([
    'True', 'False', 'None'
    ])


class SyntaxChecker(ast.NodeVisitor):

    def check(self, syntax):
        tree = ast.parse(syntax)
        self.visit(tree)

    def visit_Call(self, node):
        if node.func.id not in allowed_functions:
            raise SyntaxError("%s is not an allowed function!"%node.func.id)
        else:
            ast.NodeVisitor.generic_visit(self, node)

    def visit_Name(self, node):
        try:
            eval(node.id)
        except NameError:
            ast.NodeVisitor.generic_visit(self, node)
        else:
            if node.id not in safe_names and node.id not in allowed_functions:
                raise SyntaxError("%s is a reserved name!"%node.id)
            else:
                ast.NodeVisitor.generic_visit(self, node)

    def generic_visit(self, node):
        if type(node).__name__ not in allowed_node_types:
            raise SyntaxError("%s is not allowed!"%type(node).__name__)
        else:
            ast.NodeVisitor.generic_visit(self, node)

if __name__ == '__main__':
    x = SyntaxChecker()
    while True:
        try:
            x.check(raw_input())
        except Exception as e:
            print e

This seems to accept the required syntax, but I am reasonably new to programming and could be missing any number of gaping security holes.

So my questions are: Is this secure, is there a better approach, and are there any other precautions I should be taking?

908

asked May 18 '12 23:05

SudoNhim

2 Answers

Have you looked at pypy's sandboxing features? It is reputedly much safer than any CPython sandboxing efforts. You can even limit the heap size and cpu execution time to prevent denial of service.

answered Oct 16 '22 10:10

Andrew Gorcester

Two points I noticed that you could still improve:

You should always escape any output that can be generated from some form of user input. In your example, the unallowed identifiers get mirrored unmodified back to the output. This could potentially be exploited, one example being Cross Site Scripting. Therefore I would additionally escape any such error message to prevent this.

Another thing you need to be aware of is Denial-of-Service attacks. Imagine someone whips up an Ackermann function and a script to submit it a couple of thousand times to your server... To prevent this, you should timebox the execution time of any code being submitted. This is essential, because this type of "attack" often happens unintentionally - someone managed to produce an infinite loop.

Edit:

Finally, I would also recommend to update your Python version to prevent a "hashDoS" attack.

answered Oct 16 '22 08:10

emboss

Related questions
                            
                                TypedChoiceField or ChoiceField in Django
                            
                                mask a 2D numpy array based on values in one column
                            
                                python - add cookie to cookiejar
                            
                                I don't understand Jinja2 Call Blocks
                            
                                Generating a audio waveform graphic within Python
                            
                                Is there a simple way to use Python libraries from Common Lisp?
                            
                                What does this error mean: invalid ELF header
                            
                                PyObjC on Xcode 4
                            
                                "sorted 1-d iterator" based on "2-d iterator" (Cartesian product of iterators)
                            
                                TeX in matplotlib on Mac OS X and TeX Live
                            
                                How would you create a comma-delimited string from a pyodbc result row?
                            
                                How to retrieve from python dict where key is only partially known?
                            
                                Accessing bitfields while reading/writing binary data structures
                            
                                Default constructor parameters in pyyaml
                            
                                How to iterate over Unicode characters in Python 3?
                            
                                NLTK Chunking and walking the results tree
                            
                                wrapping a numpy array in python
                            
                                Personalizing Online Assignments for a Statistics Class [closed]
                            
                                i th order statistic in Python
                            
                                Creating LaTeX math macros within Sphinx

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Restricting Python's syntax to execute user code safely. Is this a safe approach?

Tags:

python

security

abstract-syntax-tree

SudoNhim

People also ask

2 Answers

Andrew Gorcester

emboss

Recent Activity

Donate For Us