Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for distributed Metrics Visualization #12

Open
gauglertodd opened this issue May 22, 2024 · 0 comments
Open

Add support for distributed Metrics Visualization #12

gauglertodd opened this issue May 22, 2024 · 0 comments
Labels
good first issue Good issue for first-time contributors

Comments

@gauglertodd
Copy link
Collaborator

gauglertodd commented May 22, 2024

The README includes a sample visualization which showcases live loss metrics for a given pytorch worker on ray.

Implement a more general interface for the live publishing of pytorch metrics on raydar, which can help users understand the quality/progress of their distributed ML workflows on Ray in real time. A wishlist of supported features might include:

  • Real time loss metrics as distributed training is happening
  • Real time custom metrics per-model as distributed training is happening
  • Support from within the pytorch api on ray
@gauglertodd gauglertodd added the good first issue Good issue for first-time contributors label May 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good issue for first-time contributors
Projects
None yet
Development

No branches or pull requests

1 participant