Skip to content

The performance of end-to-end fine-tuning #56

Answered by fredzzhang
JacobYuan7 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @JacobYuan7,

We did attempt fine-tuning the whole pipeline end-to-end, but found out that it decreases the performance. This is most likely due to the data processing in the interaction head, where we filter out detected humans and objects with scores below a certain threshold. And since most images in the dataset contain very few salient objects, there are usually fewer than 5 objects left that are fed into the interaction head. As a result, the gradient flowing back to the backbone detector is very noisy.

As a side note, whether the model is two-stage or one-stage does not depend on the training procedure, but the model architecture. Our model is two-stage because it first detects hu…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@JacobYuan7
Comment options

@fredzzhang
Comment options

@JacobYuan7
Comment options

Answer selected by fredzzhang
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants