I could not understand the difference between two ways of indexing: ANALYZED
and ANALYZED_NO_NORMS
. I read the Lucene Javadoc but did not understand the difference.
Can someone tell me more about NORMS? What are the benefits or limitations that they bring to indexing?
ANALYZED
Index the tokens produced by running the field's value through an Analyzer. This is useful for common text. An analyzer might be something like a Snowball Stemmer Analyzer:
ANALYZED_NO_NORMS
Uses an analyzer, however it doesn't create norms for fields.
Norms are created for quick scoring of documents at query time. These norms are usually all loaded into memory so that when you run a query analyzer over an index it can quickly score the search results.
No norms means that index-time field and document boosting and field length normalization are disabled. The benefit is less memory usage as norms take up one byte of RAM per indexed field for every document in the index, during searching.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With