Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to scrape multiple pages with Import.io

I am trying to scrape a a list of events from a site http://www.cityoflondon.gov.uk/events/, But when scrapping it with import.io I am able to extract just the first page.

How could I extract all pages at once?

like image 223
Huander Avatar asked Oct 19 '22 06:10

Huander


1 Answers

You can extract data with this site, with either a Crawler or using Bulk Extract. The above website uses a very simply form of pagination:

http://www.cityoflondon.gov.uk/events/Pages/default.aspx    
http://www.cityoflondon.gov.uk/events/Pages/default.aspx?start1=13    
http://www.cityoflondon.gov.uk/events/Pages/default.aspx?start1=49    
http://www.cityoflondon.gov.uk/events/Pages/default.aspx?start1=25    
http://www.cityoflondon.gov.uk/events/Pages/default.aspx?start1=37  

Here is a Data Set that I created for the above URLs that should contain all the relevant information.

319aebad-88ea-4053-a649-2087011ce041

If you have further question about an individual website, please contact [email protected]

Thanks! Meg

like image 137
Meg Ainsley Avatar answered Jan 04 '23 05:01

Meg Ainsley