is there is any implementation of stemmers for indian languages like(hindi,telugu) are available ....
“The Indic NLP Library is built to support most of the common text processing and NLP capabilities for Indian languages. Indian languages share a commonality in terms of script, phonology, language syntax, etc.
The goal of the Indic NLP Library is to build Python based libraries for common text processing and Natural Language Processing in Indian languages. Indian languages share a lot of similarity in terms of script, phonology, language syntax, etc.
The IndicNLP corpus is a large-scale, general-domain corpus containing 2.7 billion words for 10 Indian languages from two language families. Source: [https://arxiv.org/abs/2005.00085](https://arxiv.org/abs/2005.00085) AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages. ---
Hindi Analyzer, with stemmer, is available in Lucene. It is based on this algorithm(pdf).
hindi_stemmer is a Python implementation of the Hindi stemmer described in "A Lightweight Stemmer for Hindi" by Ananthakrishnan Ramanathan and Durgesh D Rao.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With