How to access scrapy stats from a pipeline

Question

From the scrapy api I know that a crawler contains the stats attribute, but how can I access it from a custom pipeline?

class MyPipeline(object):

    def __init__(self): 
        self.stats = ???

jbahamon · Accepted Answer

Your pipeline is an extension and you want it to access the stats attribute. An extension receives the Crawler object through the from_crawler(cls, crawler) method.

All in all, you should do something like

def __init__(self, stats):
    self.stats = stats

@classmethod
def from_crawler(cls, crawler):
    return cls(crawler.stats)

http://scrapy.readthedocs.org/en/latest/topics/stats.html#topics-stats

slavugan · Answer

also stats available from spider.crawler, for example (v1.1.0):

class ObjPipeline(object):
    def process_item(self, item, spider):
        spider.crawler.stats.inc_value('scraped_items')
        ...

How to access scrapy stats from a pipeline

Tags:

python

scrapy

gusridd

2 Answers

jbahamon

slavugan

Recent Activity

Donate For Us

How to access scrapy stats from a pipeline

Tags:

python

scrapy

gusridd

2 Answers

jbahamon

slavugan

Related questions

Recent Activity

Donate For Us