Scrapy request+response+download time

Tags:

2 Answers

You could write a Downloader Middleware which would time each request. It would add a start time to the request before it's made and then a finish time when it's finished. Typically, arbitrary data such as this is stored in the Request.meta attribute. This timing information could later be read by your spider and added to your item.

This downloader middleware sounds like it could be useful on many projects.

177

answered Sep 21 '22 17:09

Shane Evans

Not sure if you need a Middleware here. Scrapy has a request.meta which you can query and yield. For download latency, simply yield

download_latency=response.meta.get('download_latency'),

The amount of time spent to fetch the response, since the request has been started, i.e. HTTP message sent over the network. This meta key only becomes available when the response has been downloaded. While most other meta keys are used to control Scrapy behavior, this one is supposed to be read-only.

answered Sep 20 '22 17:09

Sam

Related questions
                            
                                scrapyd-client command not found
                            
                                'NoneType' object has no attribute '_app_data' in scrapy\twisted\openssl
                            
                                CSS Selector to get the element attribute value
                            
                                Scrapy getting href out of div
                            
                                Getting error: DLL load failed: The operating system cannot run %1 - Python 2.7; Scrapy Module; Importing Cryptography
                            
                                scrapy crawler caught exception reading instance data
                            
                                Building a RESTful Flask API for Scrapy
                            
                                Normalize whitespace with Python
                            
                                passing selenium response url to scrapy
                            
                                Combining base url with resultant href in scrapy
                            
                                How to add Headers to Scrapy CrawlSpider Requests?
                            
                                Not able Running/deploying custom script with shub-image
                            
                                Crawling LinkedIn while authenticated with Scrapy
                            
                                Is it possible to run another spider from Scrapy spider?
                            
                                Django custom management command running Scrapy: How to include Scrapy's options?
                            
                                How to crawl an entire website with Scrapy?
                            
                                Scrapy or Selenium or Mechanize to scrape web data?
                            
                                Setting Scrapy proxy middleware to rotate on each request
                            
                                Multiple pages per item in Scrapy
                            
                                Scrapy doesn't seem to be doing DFO

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scrapy request+response+download time

Tags:

scrapy

b1_

People also ask

2 Answers

Shane Evans

Sam

Recent Activity

Donate For Us