Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in information-retrieval

How to create more complex Lucene query strings?

Storing an inverted index

Precision at k when fewer than k documents are retrieved

information-retrieval

Compute word n-grams on original text or after lemma/stemming process?

How does a permuterm index works?

information-retrieval

Understanding Recall and Precision

Caluculating IDF(Inverse Document Frequency) for document categorization

What is the correct version of Average precision?

How to detect duplicates among text documents and return the duplicates' similarity?

How to use MultiFieldQueryParser from Lucene?

How to perform a faceted search?

Text summarization: how to choose the right n-gram size

Calculating IDF (as in TF-IDF) when testing?

Online clustering of news articles

What's the difference: ConcurrentUpdateSolrServer vs HttpSolrServer vs CommonsHttpSolrServer?

Efficiently extract WikiData entities from text

unsupervised Named entity recognition (NER) with custom controlled vocabulary for crosslink-suggestions in Java

Algorithm for search in inverted index

Representation and a good similarity measure between Tweets for topic detection