If training a customer NER model, is there a meaningful difference between using SpanCategorizer vs. "regular" NER? #9681
Lolologist
started this conversation in
Help: Best practices
Replies: 1 comment 2 replies
-
See this comment. Basically the SpanCat is more general / gets to make fewer assumptions, so it can be less accurate. Depending on application the difference in accuracy with an NER component may be negligible or it may be more significant. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I want to train one or more custom entities, and also get scores for those entities. I know that a/the current way to do that is by using SpanCategorizer, and that SpanCategorizer's suggesters (https://spacy.io/api/spancategorizer#suggesters) are different from however NER finds its candidate spans. Also, the documentation seems to at least heavily hint that SpanCategorizer is intended for arbitrary spans that aren't really entities per se.
Is there anything fundamentally different in how they work that would make it a bad idea/needing careful attention to train traditional entities via SpanCategorizer?
Beta Was this translation helpful? Give feedback.
All reactions