Skip to content

[nvbugs/5297821] Fix llama4 disaggregated serving accuracy tests #4743

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 29, 2025

Conversation

Tabrizian
Copy link
Member

@Tabrizian Tabrizian commented May 28, 2025

Fix llama4 disaggregated serving accuracy tests

Llama4 is not supported in TRT-Backend. Need to make sure to specify backend=pytorch to make sure it works properly.

@Tabrizian Tabrizian requested a review from a team as a code owner May 28, 2025 22:41
@Tabrizian Tabrizian force-pushed the user/imant/llama4fix branch from c426d81 to 71f0941 Compare May 28, 2025 22:41
@Tabrizian
Copy link
Member Author

/bot run --stage-list "DGX_H200-8_GPUs-PyTorch-[Post-Merge]"

@tensorrt-cicd
Copy link
Collaborator

PR_Github #6816 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #6816 [ run ] completed with state SUCCESS
/LLM/release-0.20/L0_MergeRequest_PR pipeline #110 (Partly Tested) completed with status: 'SUCCESS'

@Tabrizian Tabrizian force-pushed the user/imant/llama4fix branch from 71f0941 to f345ecf Compare May 29, 2025 19:15
@Tabrizian
Copy link
Member Author

/bot reuse-pipeline

@tensorrt-cicd
Copy link
Collaborator

PR_Github #6951 [ reuse-pipeline ] triggered by Bot

@Tabrizian Tabrizian enabled auto-merge (squash) May 29, 2025 19:24
@tensorrt-cicd
Copy link
Collaborator

PR_Github #6951 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #6816 (Partly Tested) for commit f345ecf

@Tabrizian Tabrizian merged commit de0613b into NVIDIA:release/0.20 May 29, 2025
3 checks passed
@schetlur-nv
Copy link
Collaborator

@Tabrizian can you please add a description for this MR?

@Tabrizian
Copy link
Member Author

Added description.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants