Scrapy: Follow link to get additional Item data?

Tags:

I don't have a specific code issue I'm just not sure how to approach the following problem logistically with the Scrapy framework:

The structure of the data I want to scrape is typically a table row for each item. Straightforward enough, right?

Ultimately I want to scrape the Title, Due Date, and Details for each row. Title and Due Date are immediately available on the page...

BUT the Details themselves aren't in the table -- but rather, a link to the page containing the details (if that doesn't make sense here's a table):

|-------------------------------------------------| |             Title              |    Due Date    | |-------------------------------------------------| | Job Title (Clickable Link)     |    1/1/2012    | | Other Job (Link)               |    3/2/2012    | |--------------------------------|----------------|

I'm afraid I still don't know how to logistically pass the item around with callbacks and requests, even after reading through the CrawlSpider section of the Scrapy documentation.

842

asked Feb 17 '12 19:02

dru

1 Answers

Please, first read the docs to understand what i say.

The answer:

To scrape additional fields which are on other pages, in a parse method extract URL of the page with additional info, create and return from that parse method a Request object with that URL and pass already extracted data via its meta parameter.

how do i merge results from target page to current page in scrapy?

197

answered Oct 09 '22 09:10

warvariuc

Related questions
                            
                                How to use markdown in telegram? I want to send message with hyperlink
                            
                                JQuery Autocomplete Where the Results are Links
                            
                                CSS - Link not clickable when using absolute position
                            
                                Why does adding float:left to my css make my link unclickable?
                            
                                Change clickable TextView's color on focus and click?
                            
                                Create links in HTML canvas
                            
                                Create URL hyperlink in R Shiny?
                            
                                How to open a link new tab with print command?
                            
                                CSS - style a link based on its "rel" attribute?
                            
                                Linking from a web page to a specific section (anchor) in PDF document
                            
                                Obtain a link to a specific email in GMail
                            
                                Add mime type to HTML link
                            
                                How to make a section of an image a clickable link
                            
                                How to navigate to a section of a page
                            
                                How do you include hashtags within Twitter share link text?
                            
                                css link color styles best practice
                            
                                how to make a cell of table hyperlink
                            
                                Google Apps Script to open a URL
                            
                                RAILS link_to external site, url is attribute of user table, like: @users.website
                            
                                What is the RTF syntax for a hyperlink?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scrapy: Follow link to get additional Item data?

Tags:

hyperlink

callback

scrapy

dru

People also ask

1 Answers

warvariuc

Recent Activity

Donate For Us