[FEATURE] KServe gRPC frontend expose full ModelConfig specification

The current `ModelConfig` endpoint in KServe gRPC frontend populate the `ModelConfig` via only `TensorModelConfig` and other fields are set to the default values. However, some Triton Inference Server deployments uses ModelConfig and allowing a Dynamo worker to provide full `ModelConfig` specification will help the migration.

For this feature request, I may add a new "extra" field in `TensorModelConfig` so that Dynamo still treat "tensor based model" as generic as possible, and yet some specialized information can be passed around and be interpreted by parts that understand them.<br>So for ModelConfig, which I categorize it to be specialized Triton Inference Server metadata, you will pass as `{..., "extra" : {"triton_model_config": {...} # JSON representation of the ModelConfig}}` . Then the gRPC frontend will deserialize `triton_model_config` if present, otherwise only populate field from base fields in `TensorModelConfig`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FEATURE] KServe gRPC frontend expose full ModelConfig specification #3438

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[FEATURE] KServe gRPC frontend expose full ModelConfig specification #3438

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions