Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some words return with "2" suffix #77

Open
ghost opened this issue Mar 28, 2016 · 1 comment
Open

Some words return with "2" suffix #77

ghost opened this issue Mar 28, 2016 · 1 comment

Comments

@ghost
Copy link

ghost commented Mar 28, 2016

I just came across this:

original word: kağıttan
nuve result: kâğıt2/ISIM IC_HAL_AYRILMA_DAn

What is that "2" stands for?

Also is it possible prevent that diacritical letters (â)?

@hrzafer
Copy link
Collaborator

hrzafer commented Mar 28, 2016

kağıt and kâğıt are two possible forms for the root. So nüve can recognize both kağıttan and kâğıttan. To be able to recognize a word's both diacritical form and regular form. Two roots are defined. 2 stands for this. For a while I'm considering another solution to get rid of both 2 the â in the root. I'm leaving this issue open to take an action about this as soon as I have some time.

Meanwhile, you can use regex.replace to get rid of those. Or you can edit those lines in the root.txt as follows and compile the project.

kağıt kağıd kâğıt2 ISIM isim, şapkasız YUMUSAMA_td, SAPKA_DUSUR_A

Just replace kâğıt2 with kağıt and remove the SAPKA_DUSUR_A rule.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant