Release 0.14.0
- Compatible with spaCy 3.7
- The noun chunker includes chains of flats and nmods: e.g. "maaliskuun 7. päivänä"
- The parser doesn't try to detect nsubj:outer, dislocated and goeswith
dependencies anymore. There's not enough training data to learn those. - Tokenize "-kampanja" as ["-", "kampanja"]
- Tokenize "maa-" as ["maa", "-"]
- Tokenize "/kk" as ["/", "kk"]
- Other tokenizer improvements
Evaluation scores:
TAG 96.62
POS 96.45
MORPH 92.26
LEMMA 94.01
UAS 87.14
LAS 82.90
NER P 83.04
NER R 81.56
NER F 82.29