Skip to content

feat: add v100 training configs, systemd services and operational scripts#22

Open
ogulcanaydogan wants to merge 3 commits into
mainfrom
chore/v100-training-configs
Open

feat: add v100 training configs, systemd services and operational scripts#22
ogulcanaydogan wants to merge 3 commits into
mainfrom
chore/v100-training-configs

Conversation

@ogulcanaydogan
Copy link
Copy Markdown
Owner

Summary

  • Add Turkcell 7B v100 model configs (v2 stable, v3 ultrastable, fallbacks)
  • Add vLLM v100 merged serving config
  • Add systemd training service, watchdog service and timer
  • Add training monitoring, watchdog and completion scripts

Test plan

  • Configs validated against existing schema
  • Systemd services tested on v100 host
  • Watchdog timer verified running

…ipts

- Add Turkcell 7B v100 model configs (v2 stable, v3 ultrastable, fallbacks)
- Add vLLM v100 merged serving config
- Add systemd training service, watchdog service and timer
- Add training monitoring, watchdog and completion scripts
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant