What is the best approach to optimize a custom trained NER / SpanCat model ? #11562

rennanvoa2 · 2022-09-30T10:37:19Z

rennanvoa2
Sep 30, 2022

Hello there, I have fine-tuned a couple of custom models using Spacy's and transformer's models and I was wondering if there is a way of optimizing the inference time. I thought that maybe one could parse the model to ONNX and optimize and quantize it, but I searched and found nothing about it.

Is there a way of applying optimization and quantization to Spacy models?

polm · 2022-10-03T06:11:14Z

polm
Oct 3, 2022

We have had a few questions about ONNX before - see #7704 or here - but currently we don't have support for that or any reports of anyone doing it successfully.

If you haven't seen it yet, the speed FAQ may be helpful.

1 reply

polm Oct 4, 2022

Also, looks like you found it before, but #10006 is related.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the best approach to optimize a custom trained NER / SpanCat model ? #11562

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

What is the best approach to optimize a custom trained NER / SpanCat model ? #11562

rennanvoa2 Sep 30, 2022

Replies: 1 comment · 1 reply

polm Oct 3, 2022

polm Oct 4, 2022

rennanvoa2
Sep 30, 2022

Replies: 1 comment 1 reply

polm
Oct 3, 2022