Skip to content

Release 0.14.0

Compare
Choose a tag to compare
@aajanki aajanki released this 14 Oct 12:00
· 32 commits to master since this release
  • Compatible with spaCy 3.7
  • The noun chunker includes chains of flats and nmods: e.g. "maaliskuun 7. päivänä"
  • The parser doesn't try to detect nsubj:outer, dislocated and goeswith
    dependencies anymore. There's not enough training data to learn those.
  • Tokenize "-kampanja" as ["-", "kampanja"]
  • Tokenize "maa-" as ["maa", "-"]
  • Tokenize "/kk" as ["/", "kk"]
  • Other tokenizer improvements

Evaluation scores:

TAG 96.62
POS 96.45
MORPH 92.26
LEMMA 94.01
UAS 87.14
LAS 82.90
NER P 83.04
NER R 81.56
NER F 82.29