This repository presents a partial dataset of a POS-tagged lexicon for Sorani. This dataset is updated progressively.
Please refer to the following article for more detail. Please cite the artilce when you use these datasets:
Harvard:
Hassani, H., 2022. Part of Speech Tagging (POST) of a Low-resource Language using another Language (Developing a POS-Tagged Lexicon for Kurdish (Sorani) using a Tagged Persian (Farsi) Corpus). arXiv preprint arXiv:2201.12793.
bibtex:
@article{hassani2022post,
title={{Part of Speech Tagging (POST) of a Low-resource Language using another Language (Developing a POS-Tagged Lexicon for Kurdish (Sorani) using a Tagged Persian (Farsi) Corpus)}},
author={Hossein Hassani},
year={2022},
eprint={2201.12793},
archivePrefix={arXiv},
primaryClass={cs.CL}
}