What free/paid search API's allow for programmatic querying and caching/storage of the resulting data?

Tags:

If you've done any serious research into search API's, you know that most of them have a huge slew of TOS/TOU restrictions that make them nearly impossible to use in anything but the most inane applications.

Bing's 2.0 API, Yahoo Search BOSS, Google Places, Google AJAX Search (dead), et al, are far too restrictive for us. I need to run a finite and relatively small number of queries (perhaps 500k) one time only, storing specific data from the results for use within our application.

For example, we need to match up business names with their target websites (we have written the algorithm to make a 'best guess' from a set of results if necessary; we just need a vanilla result set). Also, we need to match an address to this company in question.

Unfortunately, I can find ZERO search API's that will allow us to fire off queries in a programmatic, non-user-initiated manner.

We're even quite eager to give someone cold, hard cash for access to this kind of data; Google, Bing, Yahoo, and others simply seem to not want our money (as evidenced by their TOSes)...

Any thoughts?

846

asked Aug 31 '11 23:08

rinogo

1 Answers

A freely accessible index of 5 billion web pages, their page rank, their link graphs and other metadata, hosted on Amazon EC2.

http://commoncrawl.org/

Their Terms of Service (or TOU) are pretty reasonable and unrestricted too:

http://commoncrawl.org/about/terms-of-use/

187

answered Jan 04 '23 03:01

seanieb

Related questions
                            
                                getting a "connection reset by peer" error when hitting Google Contacts API
                            
                                ERROR While using WEKA API in java code: Class Attribute Not Set?
                            
                                Facebook API OAuthException: “An unexpected error has occurred. Please retry your request later” when trying to retrieve a Page-specific access token
                            
                                Fixing "You have included the Google Maps API multiple times on this page. This may cause unexpected errors."
                            
                                Adding Google Objective-C API 'GTL' to iPhone project
                            
                                bonjour for iphone
                            
                                ruby on rails, creating new object, use create or new method?
                            
                                Facebook graph api. Get photos from albums
                            
                                Removing the Google Maps API Premier Terms-of-Use/Logo/etc. links?
                            
                                Enable CORS in lumen
                            
                                Using boost::shared_ptr in a library's public interface
                            
                                Flutter FutureBuilder gets constantly called
                            
                                API for Golf Course Info? [closed]
                            
                                OpenGraph API User Object Sometimes Returns Link that 404s
                            
                                Google Geocode Components two countries
                            
                                Rails API: Authenticate users from native mobile apps using username/password or facebook token
                            
                                Intercepting input from OS X speech recognition utility
                            
                                Kraken API: Problems with authentication (Invalid key)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What free/paid search API's allow for programmatic querying and caching/storage of the resulting data?

Tags:

search

api

screen-scraping

data-mining

rinogo

People also ask

1 Answers

seanieb

Recent Activity

Donate For Us