What is the easiest way to run python scripts in a cloud server?

Tags:

I have a web crawling python script that takes hours to complete, and is infeasible to run in its entirety on my local machine. Is there a convenient way to deploy this to a simple web server? The script basically downloads webpages into text files. How would this be best accomplished? Thanks!

875

asked Nov 13 '14 05:11

user1330691

2 Answers

Since you said that performance is a problem and you are doing web-scraping, first thing to try is a Scrapy framework - it is a very fast and easy to use web-scraping framework. scrapyd tool would allow you to distribute the crawling - you can have multiple scrapyd services running on different servers and split the load between each. See:

Distributed crawls
Running Scrapy on Amazon EC2

There is also a Scrapy Cloud service out there:

Scrapy Cloud bridges the highly efficient Scrapy development environment with a robust, fully-featured production environment to deploy and run your crawls. It's like a Heroku for Scrapy, although other technologies will be supported in the near future. It runs on top of the Scrapinghub platform, which means your project can scale on demand, as needed.

answered Oct 11 '22 15:10

alecxe

As an alternative to the solutions already given, I would suggest Heroku. You can not only deploy easily a website, but also scripts for bots to run.

Basic account is free and is pretty flexible.

This blog entry, this one and this video contain practical examples of how to make it work.

answered Oct 11 '22 14:10

J0ANMM

Related questions
                            
                                Strange behavior when casting an int to float in C
                            
                                How do I get started with swagger-ui [closed]
                            
                                How do I implement polymorphism with std::shared_ptr?
                            
                                How can I generate a WebApi2 URL without specifying a Name on the Route attribute with AttributeRouting?
                            
                                How to handle exceptions within the actor?
                            
                                How to simplify aws DynamoDB query JSON output from the command line?
                            
                                Force Google searches to not return results without the search terms
                            
                                What ABI, if any, restricts the size of [u]intmax_t?
                            
                                Why is ASP.NET vNext 'dnu build' not working on OSX
                            
                                How to change the per-language configuration of setting "editor.insertSpaces" to "auto"
                            
                                How to build EXISTS clause in sequelize
                            
                                How to call a service exposed by a Kubernetes cluster from another Kubernetes cluster in same project

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With