Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Websites explicitly designed for testing Web Scraping applications [closed]

I'd like to test functionality that's based upon rvest. Are there any websites that are explicitly designed for testing Web Scaping apps, i.e. websites whose structure does not change?

like image 432
Rappster Avatar asked Dec 15 '14 09:12

Rappster


People also ask

Which website is best for web scraping?

OctoParse, Webhose.io, Common Crawl, Mozenda, Content Grabber are a few of the best web scraping tools available for free.

Can websites block web scraping?

Many websites on the web do not have any anti-scraping mechanism but some of the websites do block scrapers because they do not believe in open data access. But if you are building web scrapers for your project or a company then you must follow these 10 tips before even starting to scrape any website.

Can web scraping be used to pull data off of websites?

Web scraping refers to the process of extracting content and data from websites using software. For example, most price comparison services use web scrapers to read price information from several online stores. Another example is Google, which routinely scrapes or “crawls” the web to index websites.

How do websites stop scraping?

To prevent you from scraping their websites, companies are using various strategies. IP rate limiting, also called requests throttling, is a commonly used anti-scraping method. A good practice of web scraping is to respect the website and scrape it slowly.


1 Answers

This question is a bit off-topic really, but I'll answer anyway. I just googled for a few things, found this:

http://scraping.pro/web-scraper-test-drive/

which has its test pages here:

http://testing-ground.scraping.pro/

although I think this would make a nice project containing the test cases and the correct results in a form that could be used in any language's test framework...

I'm sure there's other things beyond the first google hit, which you should have done anyway.

like image 196
Spacedman Avatar answered Nov 15 '22 04:11

Spacedman