What is the maximum length the annotation of a span can be? #11450
ChinmayR1202
started this conversation in
Help: Best practices
Replies: 1 comment 3 replies
-
Generating n-grams for paragraphs may affect model performance. If you're going to use spancat, how are you planning to suggest the spans? Are they always full paragraphs? Another approach might be to try textcat after splitting them into sections. Of course this also depends on your domain and the type of labels you have. If you're still keen to use spancat (and use the n-gram suggester), you can run your annotated data into debug data to see the number of n-grams you need to limit for the span suggester. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am trying to train a span categorizer but since I'm new to it, I am a bit unsure how to annotate my data. I'm using the spancat as a parser to extract specific sections from a document. Sometimes those sections are one sentence long, sometimes a paragraph long. I'm using prodigy to annotate, and I was wondering when it comes to paragraphs, do I label the entire paragraph as one continuous span, or do I break it down into individual sentences one after the other till the paragraph is covered?
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions