Skip to content

Commit 85827e0

Browse files
committed
note about dropout
1 parent bbb2a0c commit 85827e0

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ch05/03_bonus_pretraining_on_gutenberg/pretraining_simple.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -180,7 +180,7 @@ def train_model_simple(model, optimizer, device, n_epochs,
180180
"emb_dim": 12, # Embedding dimension
181181
"n_heads": 2, # Number of attention heads
182182
"n_layers": 2, # Number of layers
183-
"drop_rate": 0.0, # Dropout rate
183+
"drop_rate": 0.0, # Dropout rate, deactivated via 0.0 as dropout in LLMs is not recommended anymore
184184
"qkv_bias": False # Query-key-value bias
185185
}
186186

0 commit comments

Comments
 (0)