What is the fastest way to update the NER model once we get new data #12278
Replies: 7 comments 8 replies
-
Hey, In general, the mechanism you can use in |
Beta Was this translation helpful? Give feedback.
-
The dataset I trained on is noisy hence by having golden data with me now I want to somehow feed it to model. The above suggested approach is fine-tuning on golden data right? |
Beta Was this translation helpful? Give feedback.
-
Yes exactly! |
Beta Was this translation helpful? Give feedback.
-
Ok can you also confirm that does entity ruler only do exact string matching or it works on top of some learning? |
Beta Was this translation helpful? Give feedback.
-
Its true that the |
Beta Was this translation helpful? Give feedback.
-
hey @adypy & @kadarakos, I also have the similar use-case, where I want to fine-tune pre-trained spacy model for custom NER. I was wondering how much time the fine tuning based training would take? Is there any mechanism to estimate based on number of hyper-parameters in model, number of entities, examples & GPU/CPU power or some other relevant factors? I just need to get a rough idea about number of hours & GPU/CPU power needed. |
Beta Was this translation helpful? Give feedback.
-
@kadarakos Is there any way to access embedding of tokens while training NER model? |
Beta Was this translation helpful? Give feedback.
-
I trained NER on a very large dataset and after training I got a golden dataset for few tags that are 100% correct. Now I want this info to flow in my model- I know I can retrain model or correct the older data based on golden data but it will take lot of time and retraining which I can't afford right now.
Does there exist any other shortcuts like entity ruler or something like that to make use of golden data during inference.
Beta Was this translation helpful? Give feedback.
All reactions