How to collect stats from within scrapy spider callback?

Tags:

How can I collect stats from within a spider callback?

Example

class MySpider(Spider):
     name = "myspider"
     start_urls = ["http://example.com"]

def parse(self, response):
    stats.set_value('foo', 'bar')

Not sure what to import or how to make stats available in general.

671

asked Apr 09 '14 01:04

mattes

1 Answers

Check out the stats page from the scrapy documentation. The documentation states that the Stats Collector, but it may be necessary to add from scrapy.stats import stats to your spider code to be able to do stuff with it.

EDIT: At the risk of blowing my own trumpet, if you were after a concrete example I posted an answer about how to collect failed urls.

EDIT2: After a lot of googling, apparently no imports are necessary. Just use self.crawler.stats.set_value()!

109

answered Oct 07 '22 22:10

Talvalin

Related questions
                            
                                Extracting data from HTML with Python
                            
                                SQLAlchemy Core Connection Context Manager
                            
                                xlwt limiting the number of rows
                            
                                Implementing Stack with Python
                            
                                Why use else in try/except construct in Python?
                            
                                Why is PyYAML spending so much time in just parsing a YAML File?
                            
                                numpy gradient function and numerical derivatives
                            
                                Python Firefox Webdriver tmp files
                            
                                python pandas: why map is faster?
                            
                                Python UTF-8 Lowercase Turkish Specific Letter
                            
                                Scrapy: Limit the number of request or request bytes
                            
                                fill missing indices in pandas
                            
                                Python precision in string formatting with float numbers
                            
                                Python: How can I include the delimiter(s) in a string split? [duplicate]
                            
                                How to encode log file?
                            
                                How to get a file close event in python
                            
                                How do I solve NameError: name 'threading' is not defined in python 3.3
                            
                                How do I safely get the user's real IP address in Flask (using mod_wsgi)?
                            
                                how to load virtualenv using environmental module file (tcl script)?
                            
                                why readline() is much slower than readlines() in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to collect stats from within scrapy spider callback?

Tags:

python

scrapy

scrapy-spider

mattes

People also ask

1 Answers

Talvalin

Recent Activity

Donate For Us