Developing a process to build good prompts is the first step every Stable Diffusion user tackles.
A good prompt needs to be detailed and specific. A good process is to look through a list of keyword categories and decide whether you want to use any of them. For example, subject, medium, style, artist, website, resolution, additional details, color and lighting.
The subject is what you want to see in the image.
Medium is the material used to make artwork.
The style refers to the artistic style of the image.
Artist names are strong modifiers. They allow you to dial in the exact style using a particular artist as a reference. It is also common to use multiple artist names to blend their styles.
Using them in a prompt is a sure way to steer the image toward these styles.
Resolution represents how sharp and detailed the image is.
Additional details are sweeteners added to modify an image.
You can control the overall color of the image by adding color keywords.
Lighting keywords can have a huge effect on how the image looks.
Using negative prompts is another great way to steer the image, but instead of putting in what you want, you put in what you don’t want. They don’t need to be objects. They can also be styles and unwanted attributes. (e.g. ugly, deformed).
The negative prompt helped the images to pop out more, making them less flat.
Using negative prompts is a must for v2 models.
Here are some universal nagative prompt:
ugly, tiling, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, bad anatomy, watermark, signature, cut off, low contrast, underexposed, overexposed, bad art, beginner, amateur, distorted face, blurry, draft, grainy
You can adjust the weight of a keyword by the syntax (keyword: factor)
. factor
is a value such that less than 1 means less important and larger than 1 means more important.
An equivalent way to adjust keyword strength is to use ()
and []
.
(keyword)
increases the strength of the keyword by a factor of 1.1 and is the same as(keyword:1.1)
.[keyword]
decrease the strength by a factor of 0.9 and is the same as(keyword:0.9)
.
We can use multiple of them, just like in Algebra… The effect is multiplicative.
- (keyword): 1.1 ((keyword)): 1.21 (((keyword))): 1.33
- [keyword]: 0.9 [[keyword]]: 0.81 [[[keyword]]]: 0.73
The proper term is prompt scheduling.
The syntax is [keyword1 : keyword2: factor]
factor
controls at which step keyword1 is switched to keyword2. It is a number between 0 and 1.
In the basic Stable Diffusion v1 model, that limit is 75 tokens.
AUTOMATIC1111 has no token limits. If a prompt contains more than 75 tokens, the limit of the CLIP tokenizer, it will start a new chunk of another 75 tokens, so the new “limit” becomes 150. The process can continue forever or until your computer runs out of memory.
Each chunk of 75 tokens is processed independently, and the resulting representations are concatenated before feeding into Stable Diffusion’s U-Net.
{% embed url="https://stable-diffusion-art.com/prompt-guide/#Keyword_weight" %}