Skip to content

Conversation

@grimoire
Copy link
Collaborator

@grimoire grimoire commented Jun 20, 2025

  • enable by LMDEPLOY_TRITON_CUSTOM_CACHE_MGR_ENABLE=1 (disable by default)
  • Compiled kernel would be shared by all tp rank
  • remove unused code

@grimoire grimoire closed this Aug 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant