Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spawn failed #186

Open
JeremyGohBNP opened this issue Jul 20, 2022 · 6 comments
Open

Spawn failed #186

JeremyGohBNP opened this issue Jul 20, 2022 · 6 comments
Assignees
Labels
bug Something isn't working

Comments

@JeremyGohBNP
Copy link

image

I have trouble starting a server. It seems no node is available and I am always met with a Spawn failed. Is there any ongoing issue?

@JeremyGohBNP JeremyGohBNP added the bug Something isn't working label Jul 20, 2022
@JeremyGohBNP
Copy link
Author

Here is an additional log

image

@JeremyGohBNP
Copy link
Author

Got solved when setting GPU to 0 (default was 1)

@JeremyGohBNP
Copy link
Author

Is the issue going to be fixed though? Because with only CPU and no GPU, processing times are unfortunately going through the roof. Thanks!

@JeremyGohBNP
Copy link
Author

@MichaelTiemann @erikerlandson for visibility

@MichaelTiemannOSC
Copy link
Contributor

I relinquished my claim on one of the GPUs this morning (after seeing the message). Can you see if that freed up what you need? I agree we need better dashboards to show resources (and hoarders).

@Shreyanand please suggest who might also work on providing dashboard/status displays to help use understand resource avails from the cluster. Thanks!

@Shreyanand
Copy link
Member

@HumairAK @redmikhail Is it possible to accommodate this request for dashboards? In general it would be useful to see how the GPUs are being used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants