Skip to content

Morpological variants of Sinhala words. Extracted from FastText 300 si

License

Notifications You must be signed in to change notification settings

brainsharks-fyp17/morphdb-si

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

morphdb-si

Morpological variants of Sinhala words.
Extacted from a FastText-300 model released by Facebook https://fasttext.cc/.
Contains a .json
Ex:

  "යටත්විජිතයන්හි": [
    "යටත්විජිතයන්",
    "යටත්විජිතයන්ගෙද",
    "යටත්විජිතයේ",
    "යටත්විජිතයට",
    "යටත්විජිතයෙහි",
    "යටත්විජිතමය",
    "යටත්විජිතය",
    "යටත්විජිතයෙන්",
    "යටත්විජිතකරණයේ",
    "යටත්විජිතයක්වූ"
  ],
  "නිවසයෙන්": [
    "නිවසවල",
    "නිවසේමය"
  ],
  "වැස්සකට": [
    "වැස්සකදි",
    "වැස්සකදී",
    "වැස්සකටත්",
    "වැස්සක",
    "වැස්සකි",
    "වැස්සක්ද",
    "වැස්සක්ව",
    "වැස්සක්ම",
    "වැස්සකුත්"
  ],

This dataset contains 312,397 keys and their morphological forms

About

Morpological variants of Sinhala words. Extracted from FastText 300 si

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published