Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in information-retrieval

How does a permuterm index works?

information-retrieval

Understanding Recall and Precision

Caluculating IDF(Inverse Document Frequency) for document categorization

What is the correct version of Average precision?

How to detect duplicates among text documents and return the duplicates' similarity?

How to use MultiFieldQueryParser from Lucene?

How to perform a faceted search?

Text summarization: how to choose the right n-gram size

Calculating IDF (as in TF-IDF) when testing?

Online clustering of news articles

What's the difference: ConcurrentUpdateSolrServer vs HttpSolrServer vs CommonsHttpSolrServer?

Efficiently extract WikiData entities from text

unsupervised Named entity recognition (NER) with custom controlled vocabulary for crosslink-suggestions in Java

Algorithm for search in inverted index

Representation and a good similarity measure between Tweets for topic detection

Fast in-memory inverted index

Constructing a tree using Python

Boost fresh documents with Lucene

Is it OK if the false positive rate in a ROC curve does not end in 1.0?