You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.
Stick an explanation: Shuffle in the DistributedSampler is true(default). If you set shuffle in the DistributedSampler to true, you do not need to set shuffle in the DataLoader that uses the sampler, because the DistributedSampler generates different random seeds for each process in a distributed environment to mess up the data. Therefore, in a distributed environment, it is recommended to set the shuffle only in the DistributedSampler.
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Excellent work!
I see that your training script does not use the
shuffle=True
setting when loading data. I wonder if this setting has any effect for performance?Does using
shuffle=True
have a positive effect? Or negative effects?The text was updated successfully, but these errors were encountered: