Skip to content

Add support of SGD to GRPO trainer#1

Open
beryll1um wants to merge 1 commit intogensyn-ai:mainfrom
beryll1um:feature/sgd-support
Open

Add support of SGD to GRPO trainer#1
beryll1um wants to merge 1 commit intogensyn-ai:mainfrom
beryll1um:feature/sgd-support

Conversation

@beryll1um
Copy link

Adam is memory-intensive, so most users cannot use it on graphics cards with less than 8GB of VRAM. High-momentum SGD provides nearly the required convergence, but requires more training time. Essentially, it converts VRAM usage to GPU usage over time.

Adam is memory-intensive, so most users cannot use it on graphics cards with
less than 8GB of VRAM. High-momentum SGD provides nearly the required
convergence, but requires more training time. Essentially,
it converts VRAM usage to GPU usage over time.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant