Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Nutch versus Solr

Tags:

solr

nutch

Currently collecting information where I should use Nutch with Solr (domain - vertical web search).

Could you suggest me?

like image 509
Jeriho Avatar asked May 12 '10 11:05

Jeriho


1 Answers

Nutch is a framework to build web crawler and search engines. Nutch can do the whole process from collecting the web pages to building the inverted index. It can also push those indexes to Solr.

Solr is mainly a search engine with support for faceted searches and many other neat features. But Solr doesn't fetch the data, you have to feed it.

So maybe the first thing you have to ask in order to choose between the two is whether or not you have the data to be indexed already available (in XML, in a CMS or a database.). In that case, you should probably just use Solr and feed it that data. On the other hand, if you have to fetch the data from the web, you are probably better of with Nutch.

like image 182
Pascal Dimassimo Avatar answered Nov 16 '22 03:11

Pascal Dimassimo