Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

运行速度缓慢如何解决 #145

Open
B1ueorange opened this issue Mar 10, 2025 · 1 comment
Open

运行速度缓慢如何解决 #145

B1ueorange opened this issue Mar 10, 2025 · 1 comment

Comments

@B1ueorange
Copy link

在4*A800上开batchsize=4,num_generations=8,运行grpo_rec.py,平均一个step在20s,如何提速呢,还是说这是正常速度?以及是否和You are attempting to use Flash Attention 2.0 with a model not initialized on GPU. Make sure to move the model to GPU after initializing it on
CPU with model.to('cuda').这个提示有关呢?希望能解答一下,非常感谢

@hekaijie123
Copy link

我用8卡A100,GPU利用率绝大部分时候在40%以下,这个还有优化空间吗?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants