Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in information-retrieval

What is the 11pt average precision metric?

Kindly review the python code to boost its performance

How should I save BM25Okapi object value to file?

Search for (Very) Approximate Substrings in a Large Database

Problem understanding chi-squared feature selection

Dynamic regex for date-time formats

Is NDCG (normalized discounted gain) flawed? I have calculated a few alternative ranking quality measures, and I can't make heads or tails of it

Which database can be used to store processed data from NLP engine

How to create more complex Lucene query strings?

Storing an inverted index

Precision at k when fewer than k documents are retrieved

information-retrieval

Compute word n-grams on original text or after lemma/stemming process?

How does a permuterm index works?

information-retrieval

Understanding Recall and Precision

Caluculating IDF(Inverse Document Frequency) for document categorization

What is the correct version of Average precision?

How to detect duplicates among text documents and return the duplicates' similarity?

Computing similarity between two lists

What is the default list of stopwords used in Lucene's StopFilter?

How to calculate TF*IDF for a single new document to be classified?