How restart Scrapy spider

Tags:

What I need:

start crawler
crawler job done
wait 1 minute
start crawler again

I try this:

from scrapy.crawler import CrawlerProcess
from scrapy.utils.project import get_project_settings
from time import sleep

while True:
    process = CrawlerProcess(get_project_settings())
    process.crawl('spider_name')
    process.start()
    sleep(60)

But get error:

twisted.internet.error.ReactorNotRestartable

please help me do it right

Python 3.6
Scrapy 1.3.2
Linux

524

asked Feb 19 '17 22:02

sojowok

1 Answers

I think I found the solution:

from scrapy.utils.project import get_project_settings
from scrapy.crawler import CrawlerRunner
from twisted.internet import reactor
from twisted.internet import task


timeout = 60


def run_spider():
    l.stop()
    runner = CrawlerRunner(get_project_settings())
    d = runner.crawl('spider_name')
    d.addBoth(lambda _: l.start(timeout, False))


l = task.LoopingCall(run_spider)
l.start(timeout)

reactor.run()

answered Oct 22 '22 16:10

sojowok

Related questions
                            
                                Matplotlib customize the legend to show squares instead of rectangles
                            
                                multiprocessing pool.map not processing list in order
                            
                                Change the width of a rectangle in tkinter's canvas widget
                            
                                Parameter order for unittest.TestCase.assertEqual
                            
                                flask - something more strict than @api.expect for input data?
                            
                                Tensorflow's gradient_override_map function
                            
                                How do I stream data through a flask application?
                            
                                What is the analogue of EXCEPT clause in SQL in Pandas?
                            
                                Random search without cross validation in python/sklearn
                            
                                Formatting date labels on bar plot
                            
                                Where to hold common strftime strings like ("%d/%m/%Y")
                            
                                How do I paste multi-line python script into ConEmu?
                            
                                ValueError: unsupported pickle protocol: 4 with pandas
                            
                                Mixing audio files in MoviePy
                            
                                Writing cross-compatible Python 2/3: Difference between __future__, six, and future.utils?
                            
                                Batch-major vs time-major LSTM
                            
                                Python property with public getter and private setter
                            
                                "sh: sysctl Command not Found " for Mac OS X running a cron job
                            
                                cx_Freeze - The appdirs package is required
                            
                                Session is shared between two Flask apps on localhost

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How restart Scrapy spider

Tags:

python

python-3.x

scrapy

scrapy-spider

sojowok

People also ask

1 Answers

sojowok

Recent Activity

Donate For Us