Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

is there is any stemmer available for indian language [closed]

is there is any implementation of stemmers for indian languages like(hindi,telugu) are available ....

like image 615
rajesh Avatar asked Oct 24 '10 08:10

rajesh


People also ask

Which libraries are useful for processing Indian languages?

“The Indic NLP Library is built to support most of the common text processing and NLP capabilities for Indian languages. Indian languages share a commonality in terms of script, phonology, language syntax, etc.

What is IndicNLP?

The goal of the Indic NLP Library is to build Python based libraries for common text processing and Natural Language Processing in Indian languages. Indian languages share a lot of similarity in terms of script, phonology, language syntax, etc.

What is Indic NLP?

The IndicNLP corpus is a large-scale, general-domain corpus containing 2.7 billion words for 10 Indian languages from two language families. Source: [https://arxiv.org/abs/2005.00085](https://arxiv.org/abs/2005.00085) AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages. ---


2 Answers

Hindi Analyzer, with stemmer, is available in Lucene. It is based on this algorithm(pdf).

like image 75
Shashikant Kore Avatar answered Oct 03 '22 06:10

Shashikant Kore


hindi_stemmer is a Python implementation of the Hindi stemmer described in "A Lightweight Stemmer for Hindi" by Ananthakrishnan Ramanathan and Durgesh D Rao.

like image 45
Luís Gomes Avatar answered Oct 03 '22 08:10

Luís Gomes