Stop Scrapy after N items scraped

Tags:

1 Answers

My problem was trying to apply close spider in the wrong place. It's a variable that needs to be set in the settings.py file. When I set it manually in there, or set it as a argument in the command line, it worked (Stopping within 10-20 of N for what it's worth).

settings.py:

BOT_NAME = 'internal_links'
SPIDER_MODULES = ['internal_links.spiders']
NEWSPIDER_MODULE = 'internal_links.spiders'
CLOSESPIDER_PAGECOUNT = 1000
ITEM_PIPELINES = ['internal_links.pipelines.CsvWriterPipeline']
# Crawl responsibly by identifying yourself (and your website) on the user-agent
USER_AGENT = 'yo mama'
LOG_LEVEL = 'DEBUG'

answered Sep 27 '22 23:09

Josh Usre

Related questions
                            
                                Plotting a function of three variables in python
                            
                                python Time a try except
                            
                                Why can't you reference modules that appear to be automatically loaded by the interpreter without an additional `import` statement?
                            
                                How do I plot GFS grib2 data with Python?
                            
                                How to use MySQL view in django?
                            
                                Insert xml node in specific location
                            
                                How to create polygons with arcs in shapely (or a better library)
                            
                                Flask-login current_user returns only ID
                            
                                PIL - The submitted file is empty Test Case
                            
                                How to move a Python script from one subpackage to another directory/package, maintaining backwards compatibility
                            
                                Print custom string upon unittest failure
                            
                                Python timezone offset wrong? [duplicate]
                            
                                Import SAS data file into python data frame
                            
                                Remove all characters which cannot be decoded in Python
                            
                                python BeautifulSoup find all input for specific form
                            
                                Python - getting duration of a video with ffprobe
                            
                                Installing Modules for SPARK on worker nodes
                            
                                What is func_dict?
                            
                                Enumerate sentences in python
                            
                                Abstract base class is not enforcing function implementation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stop Scrapy after N items scraped

Tags:

python

scrapy

Josh Usre

People also ask

1 Answers

Josh Usre

Recent Activity

Donate For Us