Potentially incorrect equation from paper #3

david-stojanovski · 2023-01-11T17:06:04Z

Equation 16. from the paper which gives the disentangled component seems to differ to what is actually in the code.

In the paper the equation is given as:

output(image | labelmap) + s * (output(image | labelmap) - output(image | null_label))

However looking at the code in /guided_diffusion/gaussian_diffusion.py within the p_mean_variance function there is the code below:

model_output_zero = model(x, self._scale_timesteps(t), y=th.zeros_like(model_kwargs['y']))
model_output[:, :3] = model_output_zero[:, :3] + model_kwargs['s'] * (model_output[:, :3] - model_output_zero[:, :3])

This seems to be calculating the following instead though:

output(image | null_label) + s * (output(image | labelmap) - output(image | null_label)).

Am I understanding this correctly or is this a bug in the paper/code?

The text was updated successfully, but these errors were encountered:

valvgab-bh · 2023-01-18T10:00:51Z

@WeilunWang thanks a lot for your code, it is a really nice work! :)

Coming to the issue, I also find the implementation differs from what described in the paper.
If we try to re-arrange the elements in the equation, we get:

model_output = output(image | null_label) + s * (output(image | labelmap) - output(image | null_label))
             = [...] 
             = output(image | labelmap)  - s' * (output(image | labelmap) - output(image | null_label))

where s' = 1 - s.

So, the sign in front of the parenthesis has changed. This means that instead of increasing by the distance from the model bias output(image | null_label), we are going in the opposite direction? Could you clarify this, please? :)

obaghirli · 2023-03-09T19:31:22Z

I believe it is a bug in the paper, not in the code.

obaghirli · 2023-04-27T22:38:07Z

Figure 3. (c) in the paper is in sync with the code.

HuangChiEn · 2023-07-19T02:56:05Z

it seems the author keeps the golden principle of programming: "If it works, don't touch it (don't try to understand it)" www

LexieYang · 2024-01-30T20:46:16Z

Hi, does anyone know the name of the argparse parameter for guidance scale, s? When I debugging the code, the following if statement is false,

if 's' in model_kwargs and model_kwargs['s'] > 1.0: # FALSE
            model_output_zero = model(x, self._scale_timesteps(t), y=th.zeros_like(model_kwargs['y']))
            model_output[:, :3] = model_output_zero[:, :3] + model_kwargs['s'] * (model_output[:, :3] - model_output_zero[:, :3])

In this case, the classifier-free guidance is not functional at all!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potentially incorrect equation from paper #3

Potentially incorrect equation from paper #3

david-stojanovski commented Jan 11, 2023

valvgab-bh commented Jan 18, 2023

obaghirli commented Mar 9, 2023

obaghirli commented Apr 27, 2023

HuangChiEn commented Jul 19, 2023

LexieYang commented Jan 30, 2024

Potentially incorrect equation from paper #3

Potentially incorrect equation from paper #3

Comments

david-stojanovski commented Jan 11, 2023

valvgab-bh commented Jan 18, 2023

obaghirli commented Mar 9, 2023

obaghirli commented Apr 27, 2023

HuangChiEn commented Jul 19, 2023

LexieYang commented Jan 30, 2024