Allow masking padding tokens in cross attention layers #94

jazcollins · 2023-11-07T00:51:01Z

This PR adds a new parameter to the stable_diffusion_xl and stable_diffusion_2 classes called mask_pad_tokens that allows for masking out padding tokens in cross attention layers.

The generate() had to get a bit more complicated due to the setting where we pass in pre-tokenized inputs.. we'd now want to allow passing the padding mask with it (as well as for pre-tokenized negative prompts). Let me know if you think of a better way of handling this :/

One small note: this change might be slightly redundant with the zero_out_negative_prompt arg (in the generate() function) and zero_dropped_captions (in dataloader) that I added not too long ago for zero-ing out empty negative prompts and dropped captions. I think mask_pad_tokens ought to serve a similar purpose of masking out the empty text embeddings in the cross attention layers, however zero_out_negative_prompt/zero_dropped_captions additionally zero-out the pooled text embedding (used in SDXL as microconditioning). I think we still want to keep that functionality. The cleanest way would prob be to merge these all into one flag?

diffusion/models/stable_diffusion.py

coryMosaicML

LGTM, thanks for this!

jazcollins added 8 commits November 6, 2023 10:37

add padding attn mask to training

13df923

remove squeeze

6d79bc3

torch tensorify

6903378

handle sdxl

7646d6f

encoder attn mask

c5df8c5

retry

5361506

pad masking in generate() and pyright

b113026

toggle pad masking with flag, add arg for token masks in generate()

06fbe02

jazcollins requested a review from coryMosaicML November 7, 2023 00:51

Skylion007 reviewed Nov 7, 2023

View reviewed changes

diffusion/models/stable_diffusion.py Show resolved Hide resolved

coryMosaicML approved these changes Nov 7, 2023

View reviewed changes

jazcollins merged commit 5decf2a into mosaicml:main Nov 16, 2023
7 checks passed

jazcollins deleted the zero-pad-tokens branch November 16, 2023 19:27

jazcollins restored the zero-pad-tokens branch November 16, 2023 19:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow masking padding tokens in cross attention layers #94

Allow masking padding tokens in cross attention layers #94

jazcollins commented Nov 7, 2023

coryMosaicML left a comment

Allow masking padding tokens in cross attention layers #94

Allow masking padding tokens in cross attention layers #94

Conversation

jazcollins commented Nov 7, 2023

coryMosaicML left a comment

Choose a reason for hiding this comment