Skip to content

Support multiple backend refs when ref is an InferencePool #4192

@salonichf5

Description

@salonichf5

Is your enhancement request related to a problem? Please describe.
Yes, we have new conformance tests GatewayWeightedAcrossTwoInferencePools that fails in the pipeline since we do not allow multiple backend refs when Inference Pool is the backend type.

What would you like to be added:
Support multiple backend refs for Inference Pools

Why this is needed:
To be gateway API inference extension conformant

Additional context
Add any other context or screenshots about the enhancement request here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    Status

    🆕 New

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions