You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
在4*A800上开batchsize=4,num_generations=8,运行grpo_rec.py,平均一个step在20s,如何提速呢,还是说这是正常速度?以及是否和You are attempting to use Flash Attention 2.0 with a model not initialized on GPU. Make sure to move the model to GPU after initializing it on
CPU with model.to('cuda').这个提示有关呢?希望能解答一下,非常感谢
The text was updated successfully, but these errors were encountered:
在4*A800上开batchsize=4,num_generations=8,运行grpo_rec.py,平均一个step在20s,如何提速呢,还是说这是正常速度?以及是否和You are attempting to use Flash Attention 2.0 with a model not initialized on GPU. Make sure to move the model to GPU after initializing it on
CPU with
model.to('cuda')
.这个提示有关呢?希望能解答一下,非常感谢The text was updated successfully, but these errors were encountered: