I would like to set up a Python function I've written on AWS Lambda, a function that depends on a bunch of Python libraries I have already collected in a conda environment. To set this up on Lambda, I'm supposed to zip this environment up, but the Lambda docs only give instructions for how to do this using pip/VirtualEnv. Does anyone have experience with this?

You should use the serverless framework in combination with the serverless-python-requirements plugin. You just need a <code>requirements.txt</code> and the plugin automatically packages your code and the dependencies in a zip-file, uploads everything to s3 and deploys your function. Bonus: Since it can do this dockerized, it is also able to help you with packages that need binary dependencies. Have a look here (https://serverless.com/blog/serverless-python-packaging/) for a how-to. From experience I strongly recommend you look into that. Every bit of manual labour for deployment and such is something that keeps you from developing your logic. Edit 2017-12-17: Your comment makes sense @eelco-hoogendoorn. However, in my mind a conda environment is just an encapsulated place where a bunch of python packages live. So, if you would put all these dependencies (from your conda env) into a <code>requirements.txt</code> (and use serverless + plugin) that would solve your problem, no? IMHO it would essentially be the same as zipping all the packages you installed in your env into your deployment package. That being said, here is a snippet, which does essentially this: <pre class="prettyprint"><code>conda env export --name Name_of_your_Conda_env | yq -r '.dependencies[] | .. | select(type == "string")' | sed -E "s/(^[^=]*)(=+)([0-9.]+)(=.*|$)/\1==\3/" > requirements.txt </code></pre> Unfortunately <code>conda env export</code> only exports the environment in yaml format. The <code>--json</code> flag doesn't work right now, but is supposed to be fixed in the next release. That is why I had to use yq instead of jq. You can install <code>yq</code> using <code>pip install yq</code>. It is just a wrapper around <code>jq</code> to allow it to also work with yaml files. KEEP IN MIND Lambda deployment code can only be 50MB in size. Your environment shouldn't be too big. I have not tried deploying a lambda with <code>serverless</code> + <code>serverless-python-packaging</code> and a <code>requirements.txt</code> created like that and I don't know if it will work.

The main reason why I use <code>conda</code> is an option not to compile different binary packages myself (like <code>numpy</code>, <code>matplotlib</code>, <code>pyqt</code>, etc.) or compile them less frequently. When you do need to compile something yourself for the specific version of <code>python</code> (like <code>uwsgi</code>), you should compile the binaries with the same <code>gcc</code> version that the <code>python</code> within your <code>conda</code> environment is compiled with - most probably it is not the same <code>gcc</code> that your OS is using, since <code>conda</code> is now using the latest versions of the <code>gcc</code> that should be installed with <code>conda install gxx_linux-64</code>. This leads us to two situations: <ol> <li>All you dependencies are in pure python and you can actually save a list of list of them using <code>pip freeze</code> and bundle them as it is stated for <code>virtualenv</code>.</li> <li>You have some binary extensions. In that case, the the binaries from your conda environment will not work with the python used by AWS lambda. Unfortunately, you will need to visit the page describing the execution environment (AMI: amzn-ami-hvm-2017.03.1.20170812-x86_64-gp2), set up the environment, build the binaries for the specific version of built-in python in a separate directory (as well as pure python packages), and then bundle them into a zip-archive.</li> </ol> This is a general answer to your question, but the main idea is that you can not reuse your binary packages, only a list of them.

conda environment to AWS Lambda

2 Answers

You should use the serverless framework in combination with the serverless-python-requirements plugin. You just need a requirements.txt and the plugin automatically packages your code and the dependencies in a zip-file, uploads everything to s3 and deploys your function. Bonus: Since it can do this dockerized, it is also able to help you with packages that need binary dependencies.

Have a look here (https://serverless.com/blog/serverless-python-packaging/) for a how-to.

From experience I strongly recommend you look into that. Every bit of manual labour for deployment and such is something that keeps you from developing your logic.

Edit 2017-12-17:

Your comment makes sense @eelco-hoogendoorn.

However, in my mind a conda environment is just an encapsulated place where a bunch of python packages live. So, if you would put all these dependencies (from your conda env) into a requirements.txt (and use serverless + plugin) that would solve your problem, no?
IMHO it would essentially be the same as zipping all the packages you installed in your env into your deployment package. That being said, here is a snippet, which does essentially this:

conda env export --name Name_of_your_Conda_env | yq -r '.dependencies[] | .. | select(type == "string")' | sed -E "s/(^[^=]*)(=+)([0-9.]+)(=.*|$)/\1==\3/" > requirements.txt

Unfortunately conda env export only exports the environment in yaml format. The --json flag doesn't work right now, but is supposed to be fixed in the next release. That is why I had to use yq instead of jq. You can install yq using pip install yq. It is just a wrapper around jq to allow it to also work with yaml files.

KEEP IN MIND

Lambda deployment code can only be 50MB in size. Your environment shouldn't be too big.

I have not tried deploying a lambda with serverless + serverless-python-packaging and a requirements.txt created like that and I don't know if it will work.

141

answered Oct 20 '22 09:10

DrEigelb

The main reason why I use conda is an option not to compile different binary packages myself (like numpy, matplotlib, pyqt, etc.) or compile them less frequently. When you do need to compile something yourself for the specific version of python (like uwsgi), you should compile the binaries with the same gcc version that the python within your conda environment is compiled with - most probably it is not the same gcc that your OS is using, since conda is now using the latest versions of the gcc that should be installed with conda install gxx_linux-64.

This leads us to two situations:

All you dependencies are in pure python and you can actually save a list of list of them using pip freeze and bundle them as it is stated for virtualenv.
You have some binary extensions. In that case, the the binaries from your conda environment will not work with the python used by AWS lambda. Unfortunately, you will need to visit the page describing the execution environment (AMI: amzn-ami-hvm-2017.03.1.20170812-x86_64-gp2), set up the environment, build the binaries for the specific version of built-in python in a separate directory (as well as pure python packages), and then bundle them into a zip-archive.

This is a general answer to your question, but the main idea is that you can not reuse your binary packages, only a list of them.

answered Oct 20 '22 11:10

newtover

Related questions
                            
                                How to Run Python Code on SublimeREPL
                            
                                Downloading a file at a specified location through python and selenium using Chrome driver
                            
                                Explaining the 'self' variable to a beginner [duplicate]
                            
                                A list of string replacements in Python
                            
                                Windows reports error when trying to install package using pipenv
                            
                                lxml runtime error: Reason: Incompatible library version: etree.so requires version 12.0.0 or later, but libxml2.2.dylib provides version 10.0.0
                            
                                How to render by Vue instead of Jinja
                            
                                Pandas ParserError EOF character when reading multiple csv files to HDF5
                            
                                Is Python interpreted (like Javascript or PHP)?
                            
                                Why does python use unconventional triple-quotation marks for comments?
                            
                                Django 1.8 migrate is not creating tables
                            
                                Is there a way to write these ifs nicer?
                            
                                Documenting class attributes with type annotations
                            
                                Matplotlib and Pyplot.close() not releasing memory? - backend related Qt4Agg
                            
                                Using basic Flask vs Flask-RESTful for API development
                            
                                Can we shed some definitive light on how python packaging and import works?
                            
                                Example of the right way to use QThread in PyQt?
                            
                                How to expand all the subsections on the sidebar toctree in Sphinx?
                            
                                How to work within individual files rather than projects in PyCharm
                            
                                Execute terminal commands in python3 [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

conda environment to AWS Lambda

Tags:

python

amazon-web-services

conda

aws-lambda

RoyalTS

People also ask

2 Answers

DrEigelb

newtover

Recent Activity

Donate For Us