You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your excellent work. I have a question about the rate schedule for topk_masking.
As described in the appendix, "To ensure that the degree of noise decreases as the generation process proceeds, we schedule k to increase from 1 to N monotonically as the diffusion step t goes from T to 1." However, in the code (
, the topk_masking function actually returns a mask to indicate the unselected elements. This is essentially the inverse of selecting the highest elements; we implement this way to simplify subsequent calculations for denoising tokens.
Hi
Thanks for your excellent work. I have a question about the rate schedule for topk_masking.
As described in the appendix, "To ensure that the degree of noise decreases as the generation process proceeds, we schedule k to increase from 1 to N monotonically as the diffusion step t goes from T to 1." However, in the code (
reparam-discrete-diffusion/discrete_diffusion/discrete_diffusions/discrete_diffusion_base.py
Line 177 in 26ee286
Best regards
The text was updated successfully, but these errors were encountered: