Is it possible to iterate through documents stored in Lucene Index?

Tags:

I have some documents stored in a Lucene index with a docId field. I want to get all docIds stored in the index. There is also a problem. Number of documents is about 300 000 so I would prefer to get this docIds in chunks of size 500. Is it possible to do so?

694

asked Feb 22 '10 15:02

Eugeniu Torica

2 Answers

IndexReader reader = // create IndexReader for (int i=0; i<reader.maxDoc(); i++) {     if (reader.isDeleted(i))         continue;      Document doc = reader.document(i);     String docId = doc.get("docId");      // do something with docId here... }

136

answered Sep 22 '22 07:09

bajafresh4life

Lucene 4

Bits liveDocs = MultiFields.getLiveDocs(reader); for (int i=0; i<reader.maxDoc(); i++) {     if (liveDocs != null && !liveDocs.get(i))         continue;      Document doc = reader.document(i); }

See LUCENE-2600 on this page for details: https://lucene.apache.org/core/4_0_0/MIGRATE.html

answered Sep 22 '22 07:09

bcoughlan

Related questions
                            
                                Lucene.Net Search result to highlight search keywords
                            
                                Lucene Index problems with "-" character
                            
                                "Nothing to start" when trying to start Apache Solr
                            
                                Keyword (OR, AND) search in Lucene
                            
                                Slow index speed of Elasticsearch
                            
                                Situations to prefer Apache Lucene over Solr?
                            
                                EdgeNGram: Error instantiating class: 'org.apache.lucene.analysis.ngram.EdgeNGramFilterFactory'
                            
                                SOLR and Natural Language Parsing - Can I use it?
                            
                                Paging Lucene's search results
                            
                                Lucene: exception - Query parser encountered <EOF> after "some word"
                            
                                Update specific field on SOLR index
                            
                                Build a Kibana Histogram with buckets dynamically created by ElasticSearch terms aggregation
                            
                                Security (aka Permissions) and Lucene - How ? Should it be done?
                            
                                Mimic Elasticsearch MatchQuery
                            
                                Show contents of Lucene index
                            
                                Solr - LockObtainFailedException on multiple simultaneous writes
                            
                                Lucene's algorithm
                            
                                lucene good practice and thread safety
                            
                                Solr Text field and String field - different search behaviour
                            
                                What are docValues in Solr? When should I use them?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to iterate through documents stored in Lucene Index?

Tags:

lucene

lucene.net

Eugeniu Torica

People also ask

2 Answers

bajafresh4life

bcoughlan

Recent Activity

Donate For Us