Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to incorporate multiple fields in QueryParser?

Dim qp1 As New QueryParser("filename", New StandardAnalyzer()) Dim qp2 As New QueryParser("filetext", New StandardAnalyzer()) . . 

I am using the 'Lucene.Net' library and have the following question.

Instead of creating two separate QueryParser objects and using them to obtain two Hits objects, is it possible perform a search on both fields using a single QueryParser object, so that I have only one Hits object which gives me the overall score of each Document?

like image 717
user57175 Avatar asked Jan 22 '09 08:01

user57175


2 Answers

There are 3 ways to do this.

The first way is to construct a query manually, this is what QueryParser is doing internally. This is the most powerful way to do it, and means that you don't have to parse the user input if you want to prevent access to some of the more exotic features of QueryParser:

IndexReader reader = IndexReader.Open("<lucene dir>"); Searcher searcher = new IndexSearcher(reader);  BooleanQuery booleanQuery = new BooleanQuery(); Query query1 = new TermQuery(new Term("filename", "<text>")); Query query2 = new TermQuery(new Term("filetext", "<text>")); booleanQuery.add(query1, BooleanClause.Occur.SHOULD); booleanQuery.add(query2, BooleanClause.Occur.SHOULD); // Use BooleanClause.Occur.MUST instead of BooleanClause.Occur.SHOULD // for AND queries Hits hits = searcher.Search(booleanQuery); 

The second way is to use MultiFieldQueryParser, this behaves like QueryParser, allowing access to all the power that it has, except that it will search over multiple fields.

IndexReader reader = IndexReader.Open("<lucene dir>"); Searcher searcher = new IndexSearcher(reader);  Analyzer analyzer = new StandardAnalyzer(); MultiFieldQueryParser queryParser = new MultiFieldQueryParser(                                         new string[] {"filename", "filetext"},                                         analyzer);  Hits hits = searcher.Search(queryParser.parse("<text>")); 

The final way is to use the special syntax of QueryParser see here.

IndexReader reader = IndexReader.Open("<lucene dir>"); Searcher searcher = new IndexSearcher(reader);      Analyzer analyzer = new StandardAnalyzer(); QueryParser queryParser = new QueryParser("<default field>", analyzer); // <default field> is the field that QueryParser will search if you don't  // prefix it with a field. string special = "filename:" + text + " OR filetext:" + text;  Hits hits = searcher.Search(queryParser.parse(special)); 

Your other option is to create new field when you index your content called filenameandtext, into which you can place the contents of both filename and filetext, then you only have to search one field.

like image 142
Sam Doshi Avatar answered Oct 17 '22 05:10

Sam Doshi


Just build a query string with each term:

"filename:searchText OR filetext:searchText" 

It doesn't matter what you pass as the initial field in QueryParser's constructor. Just make sure you call .Parse() on your query string to get a Query object back to execute.

If you want to use an "and" search:

"+filename:searchText +filetext:searchText" 
like image 42
Bob King Avatar answered Oct 17 '22 06:10

Bob King