Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Neurips24 #1970

Merged
merged 30 commits into from
Oct 24, 2024
Merged

Neurips24 #1970

merged 30 commits into from
Oct 24, 2024

Conversation

lenhoanglnh
Copy link
Contributor


Description

Initially the goal was mostly data analysis and experiments for NeurIPS 24 submissions.
However, this has spurred two changes to Solidago source files:

  • Added import of other csv files of Tournesol public dataset.
  • Modified qr_quantile, which was previously flawed. More precisely, qr_quantile had the undesirable property that the addition of a new user with (near) infinite uncertainty decreased the qr_quantile estimate. With the new version of qr_quantile, infinite uncertainty is equivalent to zero voting right.

Checklist

  • I described my changes and my decisions in the PR description
  • I read the development guidelines of the CONTRIBUTING.md
  • The tests pass and have been updated if relevant
  • The code quality check pass

❤️ Thank you for your contribution!

Fixed tiny_tournesol.zip file for testing.
Added data_analysis for dataset submission.
WIP Runtime error on icml24 experiments to be fixed
…than additional term.

This implies that the addition of a new user with huge uncertainties will not affect the quantile much.
Comment on lines +92 to +104
if quantile == 0.5:
return regularization + forces.sum()

left_strength = min(1.0, quantile / (1-quantile))
right_strength = min(1.0, (1-quantile) / quantile)

forces = np.where(
forces < 0,
forces * left_strength,
forces * right_strength,
)

return regularization + quantile_term + forces.sum()
return regularization + forces.sum()
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@lenhoanglnh This change seems to change significantly the behaviour of the "zero shift" on current Tournesol data. Is it expected? Should we adjust the quantile parameter?

On "main", after applying the shift with score_shift_quantile = 0.15, about 13% of the individual scores are negative. On this branch "neurips24", that would be 37%.

As a consequence the distribution of Tournesol would be modified, with fewer videos reaching the recommendability threshold (1238 instead of 3013).

(I used the "legacy2023" pipeline, currently deployed on production. But I expect it would similar with the new pipeline).

image

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is unsatisfactory indeed.
I'm a bit disturbed. It feels like the quantile is now poorly estimated.
Maybe this is because videos with lower scores have higher uncertainty? Or less trust?

Copy link
Contributor Author

@lenhoanglnh lenhoanglnh May 17, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK I looked at the data and indeed, the uncertainties for bad videos are smaller than for good videos, which explains why the quantile increased with the new quantile definition. I see two simple fixes:

  • Reduce score_shift_quantile = 0.15 to score_shift_quantile = 0.05.
  • Remove uncertainties in quantile estimation.

The former is much more satisfactory.

@amatissart amatissart merged commit e5c96e5 into main Oct 24, 2024
10 checks passed
@amatissart amatissart deleted the neurips24 branch October 24, 2024 09:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Solidago Tournesol algorithms library
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants