how can i train a diffusion model #60

No360201 · 2022-08-08T11:08:50Z

when i use openai/improved-diffusion train my data to get a diffusion model ,i get three .pt, which one is the diffusion model?
when i load the model in andreas128 RePaint ,i get
Missing key(s) in state_dict: "input_blocks.3.0.in_layers.0.weight", "input_blocks.3.0.in_layers.0.bias", "input_blocks.3.0.in_layers.2.weight", "input_blocks.3.0.in_layers.2.bias", "input_blocks.3.0.emb_layers.1.weight", "input_blocks.3.0.emb_layers.1.bias", "input_blocks.3.0.out_layers.0.weight", "input_blocks.3.0.out_layers.0.bias", "input_blocks.3.0.out_layers.3.weight", "input_blocks.3.0.out_layers.3.bias", "input_blocks.6.0.in_layers.0.weight", "input_blocks.6.0.in_layers.0.bias", "input_blocks.6.0.in_layers.2.weight", "input_blocks.6.0.in_layers.2.bias", "input_blocks.6.0.emb_layers.1.weight", "input_blocks.6.0.emb_layers.1.bias", "input_blocks.6.0.out_layers.0.weight", "input_blocks.6.0.out_layers.0.bias", "input_blocks.6.0.out_layers.3.weight", "input_blocks.6.0.out_layers.3.bias", "input_blocks.9.0.in_layers.0.weight", "input_blocks.9.0.in_layers.0.bias", "input_blocks.9.0.in_layers.2.weight", "input_blocks.9.0.in_layers.2.bias", "input_blocks.9.0.emb_layers.1.weight", "input_blocks.9.0.emb_layers.1.bias", "input_blocks.9.0.out_layers.0.weight", "input_blocks.9.0.out_layers.0.bias", "input_blocks.9.0.out_layers.3.weight", "input_blocks.9.0.out_layers.3.bias", "input_blocks.12.0.in_layers.0.weight", "input_blocks.12.0.in_layers.0.bias", "input_blocks.12.0.in_layers.2.weight", "input_blocks.12.0.in_layers.2.bias", "input_blocks.12.0.emb_layers.1.weight", "input_blocks.12.0.emb_layers.1.bias", "input_blocks.12.0.out_layers.0.weight", "input_blocks.12.0.out_layers.0.bias", "input_blocks.12.0.out_layers.3.weight", "input_blocks.12.0.out_layers.3.bias", "input_blocks.15.0.in_layers.0.weight", "input_blocks.15.0.in_layers.0.bias", "input_blocks.15.0.in_layers.2.weight", "input_blocks.15.0.in_layers.2.bias", "input_blocks.15.0.emb_layers.1.weight", "input_blocks.15.0.emb_layers.1.bias", "input_blocks.15.0.out_layers.0.weight", "input_blocks.15.0.out_layers.0.bias", "input_blocks.15.0.out_layers.3.weight", "input_blocks.15.0.out_layers.3.bias", "output_blocks.2.2.in_layers.0.weight", "output_blocks.2.2.in_layers.0.bias", "output_blocks.2.2.in_layers.2.weight", "output_blocks.2.2.in_layers.2.bias", "output_blocks.2.2.emb_layers.1.weight", "output_blocks.2.2.emb_layers.1.bias", "output_blocks.2.2.out_layers.0.weight", "output_blocks.2.2.out_layers.0.bias", "output_blocks.2.2.out_layers.3.weight", "output_blocks.2.2.out_layers.3.bias", "output_blocks.5.2.in_layers.0.weight", "output_blocks.5.2.in_layers.0.bias", "output_blocks.5.2.in_layers.2.weight", "output_blocks.5.2.in_layers.2.bias", "output_blocks.5.2.emb_layers.1.weight", "output_blocks.5.2.emb_layers.1.bias", "output_blocks.5.2.out_layers.0.weight", "output_blocks.5.2.out_layers.0.bias", "output_blocks.5.2.out_layers.3.weight", "output_blocks.5.2.out_layers.3.bias", "output_blocks.8.2.in_layers.0.weight", "output_blocks.8.2.in_layers.0.bias", "output_blocks.8.2.in_layers.2.weight", "output_blocks.8.2.in_layers.2.bias", "output_blocks.8.2.emb_layers.1.weight", "output_blocks.8.2.emb_layers.1.bias", "output_blocks.8.2.out_layers.0.weight", "output_blocks.8.2.out_layers.0.bias", "output_blocks.8.2.out_layers.3.weight", "output_blocks.8.2.out_layers.3.bias", "output_blocks.11.1.in_layers.0.weight", "output_blocks.11.1.in_layers.0.bias", "output_blocks.11.1.in_layers.2.weight", "output_blocks.11.1.in_layers.2.bias", "output_blocks.11.1.emb_layers.1.weight", "output_blocks.11.1.emb_layers.1.bias", "output_blocks.11.1.out_layers.0.weight", "output_blocks.11.1.out_layers.0.bias", "output_blocks.11.1.out_layers.3.weight", "output_blocks.11.1.out_layers.3.bias", "output_blocks.14.1.in_layers.0.weight", "output_blocks.14.1.in_layers.0.bias", "output_blocks.14.1.in_layers.2.weight", "output_blocks.14.1.in_layers.2.bias", "output_blocks.14.1.emb_layers.1.weight", "output_blocks.14.1.emb_layers.1.bias", "output_blocks.14.1.out_layers.0.weight", "output_blocks.14.1.out_layers.0.bias", "output_blocks.14.1.out_layers.3.weight", "output_blocks.14.1.out_layers.3.bias".
Unexpected key(s) in state_dict: "input_blocks.3.0.op.weight", "input_blocks.3.0.op.bias", "input_blocks.6.0.op.weight", "input_blocks.6.0.op.bias", "input_blocks.9.0.op.weight", "input_blocks.9.0.op.bias", "input_blocks.12.0.op.weight", "input_blocks.12.0.op.bias", "input_blocks.15.0.op.weight", "input_blocks.15.0.op.bias", "output_blocks.2.2.conv.weight", "output_blocks.2.2.conv.bias", "output_blocks.5.2.conv.weight", "output_blocks.5.2.conv.bias", "output_blocks.8.2.conv.weight", "output_blocks.8.2.conv.bias", "output_blocks.11.1.conv.weight", "output_blocks.11.1.conv.bias", "output_blocks.14.1.conv.weight", "output_blocks.14.1.conv.bias".

No360201 · 2022-08-08T11:09:22Z

@adam-openai @aluo-openai

pokameng · 2022-10-09T13:53:53Z

hi
have you solved this problem?
I meet this problem too!!!
@No360201

No360201 · 2022-10-10T06:58:04Z

hi have you solved this problem? I meet this problem too!!! @No360201
i can train now ， do you have wechat

pokameng · 2022-10-10T07:34:27Z

My we chat NLG-wsm
@No360201

pokameng · 2022-10-10T07:41:20Z

We can chat with each other in wechat
and my wechat is NLG-wsm

lin-tianyu · 2023-01-17T16:57:25Z

Hey guys! I am now encountering the same problem. Can you share the solution with me? @pokameng @No360201

FrozenSeas · 2023-02-23T07:42:32Z

I am encountering the same problem. Did you guys find out how to solve this problem?

lin-tianyu · 2023-02-23T07:54:06Z

I trained a diffusion model base on guided-diffusion, rather than 'improved-diffusion', and this problem was solved.
I think this issue might due to the different setting of diffusion model between improved-diffusion and guided-diffusion.

ONobody · 2023-02-28T01:51:28Z

@lin-tianyu Hello, may I add your contact information and ask some training questions?

lin-tianyu · 2023-02-28T02:28:02Z

@ONobody
Of course, you can contact me via my email: [email protected]

zhangbaijin · 2023-03-14T08:57:22Z

The problem is solved, thanks,guys.

octadion · 2023-03-15T12:55:17Z

@zhangbaijin excuse me sir, can you tell me how to solve it, because i seem to be having the same problem

xyz-xdx · 2023-03-28T13:05:30Z

@zhangbaijin @pokameng @lin-tianyu Hi guys! I am encountering the same problem. Can you share the solution with me? Ask about the weight mismatch and NaN problem during model training.

daisybby · 2023-05-07T07:14:25Z

I have solved this problem!I need to train guided diffusion for repaint using my own dataset.But I ignored these hyperparameters .All hyperparameters must be consistent with repaint(stored in the YAML file), and you can preliminarily judge whether they are consistent by the size of the checkpoint. If they are inconsistent, load_state_dict will report an error.

hzy-del · 2024-02-01T05:14:26Z

I have solved this problem!I need to train guided diffusion for repaint using my own dataset.But I ignored these hyperparameters .All hyperparameters must be consistent with repaint(stored in the YAML file), and you can preliminarily judge whether they are consistent by the size of the checkpoint. If they are inconsistent, load_state_dict will report an error.

hello，can you provide the train file and related configs？

Joseph-Mulenga · 2024-02-21T22:10:31Z

@daisybby > I have solved this problem!I need to train guided diffusion for repaint using my own dataset.But I ignored these hyperparameters .All hyperparameters must be consistent with repaint(stored in the YAML file), and you can preliminarily judge whether they are consistent by the size of the checkpoint. If they are inconsistent, load_state_dict will report an error.

Hello can you please help me on how to train guided diffusion for repaint. I'm tryna train with my own data for repaint.

zhangbaijin · 2024-02-22T11:22:51Z

MODEL_FLAGS="--image_size 256 --attention_resolutions 32,16,8 --num_channels 256 --num_head_channels 64 --num_res_blocks 2 --num_heads 4 --resblock_updown true --learn_sigma True --use_scale_shift_norm true --learn_sigma true --timestep_respacing 250 --use_fp16 false --use_kl false " DIFFUSION_FLAGS="--diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False" TRAIN_FLAGS="--lr 1e-4 --microbatch 4 --dropout 0.0"

shidedh mentioned this issue Jul 23, 2023

Train problem,the guided-diffusion;s pretrained does't work andreas128/RePaint#40

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how can i train a diffusion model #60

how can i train a diffusion model #60

No360201 commented Aug 8, 2022

No360201 commented Aug 8, 2022

pokameng commented Oct 9, 2022

No360201 commented Oct 10, 2022

pokameng commented Oct 10, 2022

pokameng commented Oct 10, 2022

lin-tianyu commented Jan 17, 2023

FrozenSeas commented Feb 23, 2023

lin-tianyu commented Feb 23, 2023

ONobody commented Feb 28, 2023

lin-tianyu commented Feb 28, 2023

zhangbaijin commented Mar 14, 2023

octadion commented Mar 15, 2023

xyz-xdx commented Mar 28, 2023

daisybby commented May 7, 2023

hzy-del commented Feb 1, 2024

Joseph-Mulenga commented Feb 21, 2024 •

edited

Loading

zhangbaijin commented Feb 22, 2024

how can i train a diffusion model #60

how can i train a diffusion model #60

Comments

No360201 commented Aug 8, 2022

No360201 commented Aug 8, 2022

pokameng commented Oct 9, 2022

No360201 commented Oct 10, 2022

pokameng commented Oct 10, 2022

pokameng commented Oct 10, 2022

lin-tianyu commented Jan 17, 2023

FrozenSeas commented Feb 23, 2023

lin-tianyu commented Feb 23, 2023

ONobody commented Feb 28, 2023

lin-tianyu commented Feb 28, 2023

zhangbaijin commented Mar 14, 2023

octadion commented Mar 15, 2023

xyz-xdx commented Mar 28, 2023

daisybby commented May 7, 2023

hzy-del commented Feb 1, 2024

Joseph-Mulenga commented Feb 21, 2024 • edited Loading

zhangbaijin commented Feb 22, 2024

Joseph-Mulenga commented Feb 21, 2024 •

edited

Loading