Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Amazon Scraping returns 503

I am using BeautifulSoup and using UserAgent while making request. Amazon is blocking my calls though I did add a sleep method to avoid it. Is there anyway to deal with it? I know there's an API available but I doubt I would get what I am looking for.

What I want to get product details based on ASIN and it returns all price offers by different sellers using Amazon Prime shipping option, the example URL given here.

like image 960
Volatil3 Avatar asked Jan 30 '14 06:01

Volatil3


People also ask

How does Amazon detect scraping?

1. Amazon can detect Bots and block their IPs. Since Amazon prevents web scraping on its pages, it can easily detect if an action is being executed by a scraper bot or through a browser by a manual agent. A lot of these trends are identified by closely monitoring the behavior of the browsing agent.

What is Amazon scraping?

Web scraping allows you to extract relevant data from the Amazon website and save it in a spreadsheet or JSON format. You can even automate the process to update the data on a regular weekly or monthly basis. There is currently no way to simply export product data from Amazon to a spreadsheet.


1 Answers

Is there anyway to deal with it?

Yes... you comply with their acceptable use policy.

If it's not available from an API, you're not authorized to scrape it.

Even if you successfully scrape it, you're still not authorized to use it.

This license does not include any resale or commercial use of any Amazon Service, or its contents; any collection and use of any product listings, descriptions, or prices; any derivative use of any Amazon Service or its contents; any downloading, copying, or other use of account information for the benefit of any third party; or any use of data mining, robots, or similar data gathering and extraction tools.

https://www.amazon.com/gp/help/customer/display.html/ref=ap_frn_condition_of_use?ie=UTF8&nodeId=508088

like image 54
Michael - sqlbot Avatar answered Oct 23 '22 19:10

Michael - sqlbot