Best practice for training and using custom NER model out of spacy blank? #11507
Replies: 2 comments
-
I think you may have a training data issue. The model is only ever given one word observations that have the whole token as the entity. The data you are evaluating the model on is different in that it has more than one word. The model may even be learning to just say the whole input is the entity as doing that will always be correct in the training process. |
Beta Was this translation helpful? Give feedback.
-
As mentioned, your training data is not usable. Training data should be like the input data you expect - in NER that would normally be complete sentences. NER models learn not only from labelled terms, but also from unlabelled data - they need both in order to see what to label or not label. |
Beta Was this translation helpful? Give feedback.
-
I am training my spacy blank "en" model with the following kind of annotations: one word and label:
I am using the following config file:
But when I am loading the model and trying to find the entities given a sentence:
I see that the
ents = 'restart pc'
and doesn't split to each word and attaches the labels it learned from.What I am missing here when building my own custom NER model?
Beta Was this translation helpful? Give feedback.
All reactions