Scrapy get request url in parse

Tags:

How can I get the request url in Scrapy's parse() function? I have a lot of urls in start_urls and some of them redirect my spider to homepage and as result I have an empty item. So I need something like item['start_url'] = request.url to store these urls. I'm using the BaseSpider.

484

asked Nov 19 '13 20:11

Goran

2 Answers

The 'response' variable that's passed to parse() has the info you want. You shouldn't need to override anything.

eg. (EDITED)

def parse(self, response):     print "URL: " + response.request.url

128

answered Sep 24 '22 00:09

Jagu

The request object is accessible from the response object, therefore you can do the following:

def parse(self, response):     item['start_url'] = response.request.url

answered Sep 23 '22 00:09

gusridd

Related questions
                            
                                Python OrderedDict not keeping element order [duplicate]
                            
                                pip install gives error: Unable to find vcvarsall.bat
                            
                                How to extract dictionary single key-value pair in variables
                            
                                Installing h5py on an Ubuntu server
                            
                                matplotlib: RuntimeError: Python is not installed as a framework
                            
                                Create 3D array using Python
                            
                                Why is the size of 2⁶³ 36 bytes, but 2⁶³-1 is only 24 bytes?
                            
                                Convert XLSX to CSV correctly using python [closed]
                            
                                When calling super() in a derived class, can I pass in self.__class__? [duplicate]
                            
                                How to perform arithmetic operation on a date in Python?
                            
                                Append an empty row in dataframe using pandas
                            
                                global variable warning in python [duplicate]
                            
                                How do I restart airflow webserver?
                            
                                ImportError: Could not import the Python Imaging Library (PIL) required to load image files on tensorflow
                            
                                Use a loop to plot n charts Python
                            
                                Python: how to capture image from webcam on click using OpenCV
                            
                                Unknown error: Chrome failed to start: exited abnormally
                            
                                Python 2.7 installing opencv via pip (virtual environment)
                            
                                In python selenium, how does one find the visibility of an element?
                            
                                Spell Checker for Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scrapy get request url in parse

Tags:

python-2.7

scrapy

scrapyd

Goran

People also ask

2 Answers

Jagu

gusridd

Recent Activity

Donate For Us