Train on entire abstracts or single sentences? #11355
Replies: 2 comments 1 reply
-
Do you have any cases where you're finding relations between items in different sentences? If so you should keep the whole abstracts, as there's no way to find relations between documents. If that's not the case, then it's hard to say whether it would make a significant difference. Any change like that will definitely have an effect on the training results, but it might not have a major influence on accuracy either way. In situations like this, where you're unsure if approach X or Y is better, it's often better to try both if feasible. |
Beta Was this translation helpful? Give feedback.
-
I've done the test on a test set of 110 abstracts, here are the results: Training on sentences:
Training on abstracts:
To me this seems like no significant difference. however this was with a relatively small dataset so i'll rerun this experiment once more once i have more training data. |
Beta Was this translation helpful? Give feedback.
-
Hey all,
I'm trying to create a pipeline for relationship extraction to extract information out of scientific abstracts. i'm currently annotating entire abstracts at once because it makes the annotation process easier. However i was wondering if this would effect the training results somewhat, in comparison to separating the abstracts into sentences?
Beta Was this translation helpful? Give feedback.
All reactions