Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

performance comparision between Zend Lucene and Java Lucene

Zend Lucene and Java Lucene are built in PHP and java repectively, and PHP language has a higher level than java.

Just wondering How big the performance difference among these two, regarding to index building and data searching?

Is it much more effective to let java create and rebuild index, and let php use the index?

like image 773
Capitaine Avatar asked May 23 '10 16:05

Capitaine


2 Answers

This is a quote from a Zend Certified Engineer.

Against my better judgment, the company I work for migrated our previous search solution to Zend_Search_Lucene. On pretty heavy-duty hardware, indexing a million documents took several hours, and searches were relatively slow. The indexing process consumed vast amounts of memory, and the indexes frequently became corrupted (using 1.5.2). A single wild card search literally brought the web server to its knees, so we disabled that feature. Memory usage was very high for searches, and as a result requests per second necessarily declined heavily as we had to reduce the number of Apache child processes.

We have since moved to Solr (a Lucene-based Java search server) and the difference is dramatic. Indexing now takes around 10 minutes and searches are lightning fast. What a difference a language makes.

Original Article

In this case, Java.

like image 80
Jarrod Nettles Avatar answered Oct 07 '22 12:10

Jarrod Nettles


When I asked this question of a Zend Evangelist and one of their hired guns, I was told the Java Lucene would provide substantially better performance. Their main points dealt with the handling of UTF8 characters and the speed of indexing (if I remember properly). Apparently Java Lucene is far better at this.

Does Zend Lucene need Java Lucene?

like image 31
allnightgrocery Avatar answered Oct 07 '22 12:10

allnightgrocery