Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Semantic analysis using Solr

I'm considering about adding semantic analysis to my Solr installation, but I don't exactly know where to start.

Basically, I'd like Solr to be able to find "similar" words (taken from the body of the indexed documents). For example, if I search for "music", I should be able to query the semantic engine and obtain "rock", "pop", etc. (of course if these words appeared near to music in some of the indexed documents).

I found this project, but I don't know if it is the correct place to start: http://code.google.com/p/semanticvectors/

like image 683
Enrico Detoma Avatar asked Jul 21 '09 08:07

Enrico Detoma


2 Answers

Semantic indexing is a good place to start. However, in my experience, these kind of technologies don't work that well in practice. You often end up with very bizarre results. Also, because of Google, people have a certain expectation of how keyword search should behave - i.e. your search term should appear in the matching document.

like image 110
bajafresh4life Avatar answered Oct 16 '22 00:10

bajafresh4life


You may use the Lucene Wordnet contrib package to look for synonyms.

Optimizing Findability in Lucene and Solr gives other ways to expand queries.

like image 41
Yuval F Avatar answered Oct 15 '22 23:10

Yuval F