Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add allowed CUDA versions parameter to endpoint creation #375

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

nielsrolf
Copy link

This commit introduces a new parameter allowed_cuda_versions to the create_endpoint function and the
corresponding GraphQL mutation generator generate_endpoint_mutation. This parameter allows specifying a
comma-separated list of allowed CUDA versions for the endpoint. The parameter is added to the GraphQL mutation as
new field and is expected to enhance control over the CUDA environment for endpoints.

  • The allowed_cuda_versions parameter is added with a default value of "12.1,12.2,12.3,12.4,12.5".
  • The parameter is also included in the GraphQL mutation string.
  • Redundant documentation and example code in create_endpoint have been removed to streamline the function's docstring.

 This commit introduces a new parameter `allowed_cuda_versions` to the `create_endpoint` function and the
 corresponding GraphQL mutation generator `generate_endpoint_mutation`. This parameter allows specifying a
 comma-separated list of allowed CUDA versions for the endpoint. The parameter is added to the GraphQL mutation as
 new field and is expected to enhance control over the CUDA environment for endpoints.

 - The `allowed_cuda_versions` parameter is added with a default value of "12.1,12.2,12.3,12.4,12.5".
 - The parameter is also included in the GraphQL mutation string.
 - Redundant documentation and example code in `create_endpoint` have been removed to streamline the function's
 docstring.
 This commit introduces the `gpu_count` parameter to the `create_endpoint` function in `ctl_commands.py` and the `generate_endpoint_mutation` function in
 `mutations/endpoints.py`. The `gpu_count` parameter allows specifying the number of GPUs to be used by an endpoint. This change includes the necessary updates to the
 function signatures and the GraphQL mutation string construction to support the new parameter.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant