An unsupervised stemmer for Natural Language Processing Tasks on Hinglish Language ( Hindi + English words )
- Gensim
- NLTK
Usage:
import stemmer
myStemmer = stemmer.Stemmer()
output = myStemmer.stemWord("ladkaa")
output : 'ladka'
output = myStemmer.stemListOfWords(["ladkii", "ladkaaaa", "firaaangii"])
output: ['ladki', 'ladka', 'firangi']
output = myStemmer.stem2dListOfWords([["merii","merraa"], ["terii", "terraaa", "aaajjjaa"]])
output: [['meri', 'mera'], ['teri', 'tera', 'aja']]
Ashish Gupta Github: www.github.com/ashishgupta1350 You are free to use and distribute this in anyway you like.