[None][fix] InputProcessor config naming convention fix #8705

yechank-nvidia · 2025-10-28T01:23:23Z

Summary by CodeRabbit

Release Notes

Refactor
- Standardized internal configuration handling across model input processors for improved code consistency and maintainability.
- Added type hints to input processor signatures for better code clarity.

coderabbitai · 2025-10-28T01:29:09Z

📝 Walkthrough

Walkthrough

Systematic refactoring across nine model input processor files and the input registry to rename the model_config constructor parameter to config and update internal attribute references accordingly. Some processors include additional changes to imports, device handling, dtype management, and attribute naming conventions.

Changes

Cohort / File(s)	Summary
Core parameter and reference rename `tensorrt_llm/_torch/models/modeling_gemma3vl.py`, `tensorrt_llm/_torch/models/modeling_llama.py`, `tensorrt_llm/_torch/models/modeling_llava_next.py`, `tensorrt_llm/_torch/models/modeling_nanov2vlm.py`, `tensorrt_llm/_torch/models/modeling_phi4mm.py`, `tensorrt_llm/_torch/models/modeling_vila.py`	Constructor parameter renamed from `model_config` to `config`; all internal references updated from `self.model_config` to `self.config`. Type hints added via `AutoTokenizer` and `PretrainedConfig` imports.
Parameter rename with attribute refactoring `tensorrt_llm/_torch/models/modeling_hyperclovax.py`	Constructor parameter renamed; internal references updated; vocabulary size retrieval and configuration field access switch from `self.pretrained_config` to `self.config`.
Parameter rename with processor attribute rename `tensorrt_llm/_torch/models/modeling_mistral.py`	Constructor parameter renamed from `model_config` to `config`; public processor attribute renamed from `self._processor` to `self.processor`; all token ID references updated.
Comprehensive parameter and method signature refactoring `tensorrt_llm/_torch/models/modeling_qwen2vl.py`	Constructor and classmethod (`get_rope_index`) parameter renamed from `model_config` to `config`; all references to `self.model_config` replaced with `self.config` throughout initialization, post-processing, and utility methods.
Registry layer update `tensorrt_llm/inputs/registry.py`	Variable renaming from `model_config` to `config` in `create_input_processor` function; downstream usage updated to pass renamed parameter to input processor constructors.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Areas requiring extra attention:

Cross-file consistency: Verify all nine model processor files consistently renamed parameters and internal references; check for any missed references to model_config or _processor.
Import alignment: Confirm AutoTokenizer and PretrainedConfig imports are present and correct in all updated model files.
Registry integration: Validate that tensorrt_llm/inputs/registry.py correctly passes the renamed config parameter to all input processor constructors.
Qwen2VL specifics: Review method signature changes in get_rope_index classmethod and verify all callers (get_mrope_config, utility methods) pass the correct parameter.
Mistral processor attribute visibility: Confirm the rename from _processor to self.processor does not break external API expectations or introduce unintended public interface changes.

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Description Check	⚠️ Warning	The pull request description is completely empty, with no content provided by the author. According to the repository's description template, the PR should include a Description section explaining the issue and solution, a Test Coverage section listing relevant tests, and a completed PR Checklist. The absence of any description content means the PR fails to meet the required documentation standards and does not provide reviewers with essential context about the changes, their rationale, or testing approach.	Please add a pull request description following the repository template. Include a clear explanation of why this config naming convention fix was necessary (issue/motivation), describe the solution and its scope (which files and classes are affected), list the test coverage that validates these changes, and complete the PR Checklist to confirm adherence to coding guidelines and testing requirements. This context is essential for reviewers to understand and validate the changes.
Docstring Coverage	⚠️ Warning	Docstring coverage is 36.36% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The pull request title "[None][fix] InputProcessor config naming convention fix" directly aligns with the main changes in the pull request. The changeset systematically renames the `model_config` parameter to `config` across multiple InputProcessor classes and updates all related references throughout the codebase. The title is concise, specific (identifying both the component "InputProcessor" and the nature of the change "config naming convention"), and accurately conveys the primary objective without being vague or overly broad.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (10)

tensorrt_llm/_torch/models/modeling_hyperclovax.py (4)
679-706: Fix tensor index usage from nonzero; Python slicing requires ints

mask.nonzero() yields a 2D tensor; token_len becomes a Tensor and breaks slices. Use as_tuple=True and ints.

Apply:
-            mask = (sample == self.config.img_start_id)
-            img_start_ids = mask.nonzero()
+            mask = (sample == self.config.img_start_id)
+            # 1D positions as Python ints
+            img_start_ids = torch.nonzero(mask, as_tuple=True)[0].tolist()
@@
-            for multi_img_idx, img_start_idx in enumerate(img_start_ids):
-                token_len = img_start_idx - temp_start
+            for multi_img_idx, img_start_idx in enumerate(img_start_ids):
+                token_len = int(img_start_idx) - temp_start
726-729: Use tokenizer(...) instead of encode(...) with return_tensors

encode() doesn’t accept return_tensors; this will error at runtime.
-        input_ids = self.tokenizer.encode(text_prompt,
-                                          add_special_tokens=False,
-                                          return_tensors="pt")
+        encoded = self.tokenizer(
+            text_prompt, add_special_tokens=False, return_tensors="pt"
+        )
+        input_ids = encoded["input_ids"]
650-655: Correct type hints for Python 3.8 and actual types

Use Dict/Any for 3.8.

text_prompt is a str, not dict.
-    def _post_process(self,
-                      input_ids: torch.Tensor,
-                      preprocessed_image: dict[str, any] = None):
+    def _post_process(self,
+                      input_ids: torch.Tensor,
+                      preprocessed_image: Optional[Dict[str, Any]] = None):
@@
-    def _preprocess(self, text_prompt: dict[str, any], images: List[Any],
-                    mm_processor_kwargs: Dict[str, Any]):
+    def _preprocess(self, text_prompt: str, images: Optional[List[Any]],
+                    mm_processor_kwargs: Dict[str, Any]):
As per coding guidelines.

Also applies to: 710-714

1-1: Add 2025 NVIDIA Apache-2.0 header

File lacks the required current-year header.

Please prepend the standard NVIDIA Apache-2.0 header with year 2025. As per coding guidelines.
tensorrt_llm/_torch/models/modeling_phi4mm.py (1)

1-1: Add 2025 NVIDIA Apache-2.0 header

Required license header is missing.

Please add the standard header with year 2025. As per coding guidelines.

tensorrt_llm/_torch/models/modeling_mistral.py (1)

1-1: Add 2025 NVIDIA Apache-2.0 header

Header is required by guidelines.

Please add the standard header with year 2025. As per coding guidelines.
tensorrt_llm/_torch/models/modeling_gemma3vl.py (2)
63-69: Pass device via BatchFeature.to(), not as processor kwarg

AutoProcessor generally doesn’t accept device=; use .to(device=..., dtype=...).
-        processor_output = self.processor(
+        processor_output = self.processor(
             text=text_prompt,
             images=images,
             do_rescale=do_rescale,
             return_tensors="pt",
-            device=self.device).to(dtype=self.dtype)
+        ).to(device=self.device, dtype=self.dtype)
1-1: Add 2025 NVIDIA Apache-2.0 header

Missing required header.

Add standard header with year 2025. As per coding guidelines.
tensorrt_llm/_torch/models/modeling_llava_next.py (1)

1-1: Add 2025 NVIDIA Apache-2.0 header

Please prepend the required header.

As per coding guidelines.

tensorrt_llm/_torch/models/modeling_vila.py (1)

1-18: Update header year to 2025

Header present but year is 2024; guidelines require current year.

Please update the copyright year to 2025.

🧹 Nitpick comments (9)

tensorrt_llm/_torch/models/modeling_hyperclovax.py (1)

668-669: Guard decoder_max_length when absent

Avoid TypeError if config.decoder_max_length is None/missing.

-        len_inputs_embeds = min(self.config.decoder_max_length,
-                                len_inputs_embeds)
+        if getattr(self.config, "decoder_max_length", None) is not None:
+            len_inputs_embeds = min(self.config.decoder_max_length,
+                                    len_inputs_embeds)

tensorrt_llm/_torch/models/modeling_phi4mm.py (1)

793-794: Fallback dtype if config.torch_dtype is None

Avoid downstream dtype issues when configs omit torch_dtype.
-        self.dtype = self.config.torch_dtype
+        self.dtype = getattr(self.config, "torch_dtype", None) or torch.bfloat16
As per coding guidelines.

tensorrt_llm/_torch/models/modeling_mistral.py (1)

239-246: Silence unused parameter warning

sampling_params is unused but required by interface; mark as intentionally unused.
-    def __call__(
-        self, inputs: TextPrompt, sampling_params: SamplingParams
+    def __call__(
+        self, inputs: TextPrompt, sampling_params: SamplingParams  # unused
     ) -> Tuple[List[int], Optional[ExtraProcessedInputs]]:
Or rename to _sampling_params. Based on static analysis hints.

tensorrt_llm/_torch/models/modeling_gemma3vl.py (1)

49-51: dtype source OK, consider fallback

If config.torch_dtype can be None, add a fallback to torch.bfloat16 for stability.
-        self.dtype = self.config.torch_dtype
+        self.dtype = getattr(self.config, "torch_dtype", None) or torch.bfloat16

tensorrt_llm/_torch/models/modeling_vila.py (1)

73-76: Fix typo in error message

Minor text nit: “Unsupportede” → “Unsupported”.
-        raise ValueError(f"Unsupportede dtype for VILA: {dtype}")
+        raise ValueError(f"Unsupported dtype for VILA: {dtype}")

tensorrt_llm/_torch/models/modeling_llama.py (2)

1047-1065: Expose get_vocab_size for hashing paths.

Some infra (multimodal hashing) calls get_vocab_size on the processor. Add a small override to avoid relying on Base fallback to self.model_config.

 class Llama4InputProcessor(InputProcessor):
@@
     def __init__(self,
                  model_path: str,
-                 config: PretrainedConfig,
+                 config: PretrainedConfig,
                  tokenizer: AutoTokenizer,
                  trust_remote_code: bool = True):
@@
         self.image_token_end_index = self.config.eoi_token_index
+
+    def get_vocab_size(self) -> int:
+        # Prefer model config, fall back to tokenizer for robustness.
+        return int(self.config.text_config.vocab_size
+                   if getattr(self.config, "text_config", None)
+                   and getattr(self.config.text_config, "vocab_size", None) is not None
+                   else getattr(self.tokenizer, "vocab_size"))

Based on learnings

1159-1167: Avoid decode in the inner loop (token perf/nit).

Use tokenizer.convert_ids_to_tokens([token_id])[0] to get the string token without running full decode.

tensorrt_llm/_torch/models/modeling_qwen2vl.py (1)

287-294: get_dummy_text: guard when vocab_size is None.

Use same fallback as above to avoid ValueError in np.random.randint.

-        ids = np.random.randint(
-            low=0,
-            high=int(self.config.vocab_size),  # high is exclusive in NumPy
+        vocab = (self.config.vocab_size
+                 if getattr(self.config, "vocab_size", None) is not None
+                 else getattr(self.tokenizer, "vocab_size"))
+        ids = np.random.randint(
+            low=0,
+            high=int(vocab),  # high is exclusive in NumPy
             size=input_seq_len,
         ).tolist()

tensorrt_llm/inputs/registry.py (1)

470-499: Registry now passes HF config; also update vocab-size resolution to check self.config.

Many processors now store self.config only. Extend BaseMultimodalInputProcessor.get_vocab_size to look there before falling back to tokenizer.

 class BaseMultimodalInputProcessor:
@@
-    def get_vocab_size(self) -> Optional[int]:
+    def get_vocab_size(self) -> Optional[int]:
@@
-        Resolution order:
-        1) self.model_config.vocab_size
-        2) self.tokenizer.vocab_size
+        Resolution order:
+        1) self.config.vocab_size (if present)
+        2) self.model_config.vocab_size (legacy processors)
+        3) self.tokenizer.vocab_size
@@
-        # 1) Model config
-        if hasattr(self, 'model_config') and getattr(
-                self.model_config, 'vocab_size', None) is not None:
-            return int(self.model_config.vocab_size)
+        # 1) HF config on processor
+        if hasattr(self, 'config') and getattr(self.config, 'vocab_size',
+                                               None) is not None:
+            return int(self.config.vocab_size)
+        # 2) Legacy model_config on processor
+        if hasattr(self, 'model_config') and getattr(self, 'model_config', None) is not None \
+           and getattr(self.model_config, 'vocab_size', None) is not None:
+            return int(self.model_config.vocab_size)
-        # 2) Direct tokenizer on self
+        # 3) Direct tokenizer on self
         if hasattr(self, 'tokenizer') and getattr(self.tokenizer, 'vocab_size',
                                                   None) is not None:
             return int(self.tokenizer.vocab_size)

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a6017f6 and 438cf23.

📒 Files selected for processing (10)

tensorrt_llm/_torch/models/modeling_gemma3vl.py (3 hunks)
tensorrt_llm/_torch/models/modeling_hyperclovax.py (4 hunks)
tensorrt_llm/_torch/models/modeling_llama.py (3 hunks)
tensorrt_llm/_torch/models/modeling_llava_next.py (3 hunks)
tensorrt_llm/_torch/models/modeling_mistral.py (3 hunks)
tensorrt_llm/_torch/models/modeling_nanov2vlm.py (2 hunks)
tensorrt_llm/_torch/models/modeling_phi4mm.py (2 hunks)
tensorrt_llm/_torch/models/modeling_qwen2vl.py (11 hunks)
tensorrt_llm/_torch/models/modeling_vila.py (5 hunks)
tensorrt_llm/inputs/registry.py (1 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

**/*.{h,hpp,hh,hxx,cpp,cxx,cc,cu,cuh,py}

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

Use only spaces, no tabs; indent with 4 spaces.

Files:

tensorrt_llm/_torch/models/modeling_mistral.py
tensorrt_llm/_torch/models/modeling_vila.py
tensorrt_llm/_torch/models/modeling_hyperclovax.py
tensorrt_llm/_torch/models/modeling_llava_next.py
tensorrt_llm/_torch/models/modeling_llama.py
tensorrt_llm/_torch/models/modeling_phi4mm.py
tensorrt_llm/inputs/registry.py
tensorrt_llm/_torch/models/modeling_qwen2vl.py
tensorrt_llm/_torch/models/modeling_gemma3vl.py
tensorrt_llm/_torch/models/modeling_nanov2vlm.py

**/*.py

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

**/*.py: Python code must target Python 3.8+.
Indent Python code with 4 spaces; do not use tabs.
Maintain module namespace when importing; prefer 'from package.subpackage import foo' then 'foo.SomeClass()' instead of importing the class directly.
Python filenames should be snake_case (e.g., some_file.py).
Python classes use PascalCase names.
Functions and methods use snake_case names.
Local variables use snake_case; prefix 'k' for variables that start with a number (e.g., k_99th_percentile).
Global variables use upper SNAKE_CASE prefixed with 'G' (e.g., G_MY_GLOBAL).
Constants use upper SNAKE_CASE (e.g., MY_CONSTANT).
Avoid shadowing variables from an outer scope.
Initialize all externally visible members of a class in the constructor.
Prefer docstrings for interfaces that may be used outside a file; comments for in-function or file-local interfaces.
Use Google-style docstrings for classes and functions (Sphinx-parsable).
Document attributes and variables inline so they render under the class/function docstring.
Avoid reflection when a simpler, explicit approach suffices (e.g., avoid dict(**locals()) patterns).
In try/except, catch the most specific exceptions possible.
For duck-typing try/except, keep the try body minimal and use else for the main logic.

Files:

tensorrt_llm/_torch/models/modeling_mistral.py
tensorrt_llm/_torch/models/modeling_vila.py
tensorrt_llm/_torch/models/modeling_hyperclovax.py
tensorrt_llm/_torch/models/modeling_llava_next.py
tensorrt_llm/_torch/models/modeling_llama.py
tensorrt_llm/_torch/models/modeling_phi4mm.py
tensorrt_llm/inputs/registry.py
tensorrt_llm/_torch/models/modeling_qwen2vl.py
tensorrt_llm/_torch/models/modeling_gemma3vl.py
tensorrt_llm/_torch/models/modeling_nanov2vlm.py

**/*.{cpp,cxx,cc,h,hpp,hh,hxx,cu,cuh,py}

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

Prepend the NVIDIA Apache-2.0 copyright header with current year to the top of all source files (e.g., .cpp, .h, .cu, .py).

Files:

tensorrt_llm/_torch/models/modeling_mistral.py
tensorrt_llm/_torch/models/modeling_vila.py
tensorrt_llm/_torch/models/modeling_hyperclovax.py
tensorrt_llm/_torch/models/modeling_llava_next.py
tensorrt_llm/_torch/models/modeling_llama.py
tensorrt_llm/_torch/models/modeling_phi4mm.py
tensorrt_llm/inputs/registry.py
tensorrt_llm/_torch/models/modeling_qwen2vl.py
tensorrt_llm/_torch/models/modeling_gemma3vl.py
tensorrt_llm/_torch/models/modeling_nanov2vlm.py

🧬 Code graph analysis (10)

tensorrt_llm/_torch/models/modeling_mistral.py (4)

tensorrt_llm/_torch/models/modeling_bert.py (1)

config (266-285)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/runtime/multimodal_model_runner.py (1)

processor (680-683)

tensorrt_llm/inputs/registry.py (2)

get_mm_token_ids (96-110)

get_mm_special_token_ids (112-123)

tensorrt_llm/_torch/models/modeling_vila.py (1)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/models/modeling_hyperclovax.py (4)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/models/modeling_mistral.py (1)

get_vocab_size (281-285)

tensorrt_llm/_torch/models/modeling_nanov2vlm.py (1)

get_vocab_size (298-299)

tensorrt_llm/inputs/registry.py (1)

get_vocab_size (74-94)

tensorrt_llm/_torch/models/modeling_llava_next.py (2)

tensorrt_llm/_torch/models/modeling_utils.py (1)

config (522-523)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/models/modeling_llama.py (3)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/llmapi/llm.py (2)

tokenizer (743-747)

tokenizer (750-751)

tensorrt_llm/_torch/model_config.py (1)

from_pretrained (425-516)

tensorrt_llm/_torch/models/modeling_phi4mm.py (2)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/model_config.py (1)

torch_dtype (217-222)

tensorrt_llm/inputs/registry.py (1)

tensorrt_llm/_torch/models/modeling_utils.py (1)

get_model_architecture (711-723)

tensorrt_llm/_torch/models/modeling_qwen2vl.py (2)

tensorrt_llm/_torch/models/modeling_utils.py (1)

config (522-523)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/models/modeling_gemma3vl.py (4)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/llmapi/llm.py (2)

tokenizer (743-747)

tokenizer (750-751)

tensorrt_llm/runtime/multimodal_model_runner.py (1)

processor (680-683)

tensorrt_llm/_torch/model_config.py (1)

torch_dtype (217-222)

tensorrt_llm/_torch/models/modeling_nanov2vlm.py (5)

tensorrt_llm/models/modeling_utils.py (1)

PretrainedConfig (369-570)

tensorrt_llm/_torch/model_config.py (1)

torch_dtype (217-222)

tensorrt_llm/_torch/models/modeling_hyperclovax.py (1)

get_vocab_size (592-593)

tensorrt_llm/_torch/models/modeling_mistral.py (1)

get_vocab_size (281-285)

tensorrt_llm/inputs/registry.py (1)

get_vocab_size (74-94)

🪛 Ruff (0.14.1)

tensorrt_llm/_torch/models/modeling_mistral.py

240-240: Unused method argument: sampling_params

(ARG002)

tensorrt_llm/_torch/models/modeling_vila.py

870-870: Unused method argument: model_path

(ARG002)

873-873: Unused method argument: trust_remote_code

(ARG002)

tensorrt_llm/_torch/models/modeling_phi4mm.py

766-766: Avoid specifying long messages outside the exception class

(TRY003)

tensorrt_llm/_torch/models/modeling_nanov2vlm.py

265-265: Avoid specifying long messages outside the exception class

(TRY003)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Pre-commit Check

🔇 Additional comments (6)

tensorrt_llm/_torch/models/modeling_mistral.py (1)

281-286: Config lookup for vocab size is correct

Using self.config.text_config.vocab_size aligns with Mistral3’s config layout.

tensorrt_llm/_torch/models/modeling_llava_next.py (1)

65-69: Hidden size/vocab lookups correctly moved to config

Using self.config.text_config.hidden_size and self.config.image_token_index is consistent with the model’s config structure.

Ensure all call sites constructing LlavaNextInputProcessor pass a PretrainedConfig with text_config/image_token_index populated.

tensorrt_llm/_torch/models/modeling_llama.py (2)

8-9: LGTM on import adjustments.

Types align with constructor changes; no issues.

1124-1129: LGTM on hidden-size validation.

Clear error on mismatch; safe guard is correct.

tensorrt_llm/_torch/models/modeling_nanov2vlm.py (1)

259-273: Constructor rename and config-sourced fields look good.

Consistent with repo-wide pattern; dtype/ids sourced from self.config correctly.

Also applies to: 288-293

tensorrt_llm/_torch/models/modeling_qwen2vl.py (1)

399-401: LGTM on get_rope_index call-site.

Signature and usage now match classmethod.

tensorrt_llm/_torch/models/modeling_qwen2vl.py

yechank-nvidia · 2025-10-28T07:42:36Z

/bot run

tensorrt-cicd · 2025-10-28T07:48:15Z

PR_Github #22743 [ run ] triggered by Bot. Commit: 438cf23

jaedeok-nvidia

This PR renames model_config by config, having consistent naming convention between VLMs and LLMs. Thanks @yechank-nvidia for this fix.

tensorrt-cicd · 2025-10-28T09:15:27Z

PR_Github #22743 [ run ] completed with state SUCCESS. Commit: 438cf23
/LLM/main/L0_MergeRequest_PR pipeline #17149 completed with status: 'FAILURE'

tensorrt_llm/inputs/registry.py

2ez4bz

Approving mistral changes.

tensorrt_llm/_torch/models/modeling_mistral.py

yechank-nvidia · 2025-10-30T06:01:30Z

/bot run

tensorrt-cicd · 2025-10-30T06:07:46Z

PR_Github #22993 [ run ] triggered by Bot. Commit: 408ff32

tensorrt-cicd · 2025-10-30T07:07:08Z

PR_Github #22993 [ run ] completed with state SUCCESS. Commit: 408ff32
/LLM/main/L0_MergeRequest_PR pipeline #17334 completed with status: 'FAILURE'

Signed-off-by: yechank <[email protected]>

yechank-nvidia · 2025-10-31T05:03:50Z

/bot run

tensorrt-cicd · 2025-10-31T05:09:54Z

PR_Github #23130 [ run ] triggered by Bot. Commit: 49e403f

tensorrt-cicd · 2025-10-31T05:44:42Z

PR_Github #23130 [ run ] completed with state FAILURE. Commit: 49e403f
/LLM/main/L0_MergeRequest_PR pipeline #17444 completed with status: 'FAILURE'

yechank-nvidia · 2025-10-31T05:52:46Z

/bot run

tensorrt-cicd · 2025-10-31T05:59:03Z

PR_Github #23137 [ run ] triggered by Bot. Commit: 49e403f

tensorrt-cicd · 2025-10-31T07:59:48Z

PR_Github #23137 [ run ] completed with state SUCCESS. Commit: 49e403f
/LLM/main/L0_MergeRequest_PR pipeline #17448 completed with status: 'FAILURE'

yechank-nvidia · 2025-10-31T08:21:27Z

/bot run

tensorrt-cicd · 2025-10-31T08:27:26Z

PR_Github #23164 [ run ] triggered by Bot. Commit: 49e403f

tensorrt-cicd · 2025-10-31T09:53:18Z

PR_Github #23164 [ run ] completed with state SUCCESS. Commit: 49e403f
/LLM/main/L0_MergeRequest_PR pipeline #17467 completed with status: 'FAILURE'

yechank-nvidia · 2025-10-31T11:54:10Z

/bot run

tensorrt-cicd · 2025-10-31T12:01:07Z

PR_Github #23196 [ run ] triggered by Bot. Commit: 49e403f

tensorrt-cicd · 2025-10-31T13:09:33Z

PR_Github #23196 [ run ] completed with state SUCCESS. Commit: 49e403f
/LLM/main/L0_MergeRequest_PR pipeline #17482 completed with status: 'FAILURE'

yechank-nvidia requested review from a team as code owners October 28, 2025 01:23

yechank-nvidia requested review from amukkara, brb-nv, byshiue, dongjiyingdjy, hypdeb, rakib-hasan, symphonylyh and tomeras91 October 28, 2025 01:23

yechank-nvidia self-assigned this Oct 28, 2025

yechank-nvidia requested a review from jaedeok-nvidia October 28, 2025 01:25

yechank-nvidia mentioned this pull request Oct 28, 2025

[https://nvbugs/5549829][fix] Qwen2.5-VL TP > 1 + Quantized weight load fix #8680

Merged

coderabbitai bot reviewed Oct 28, 2025

View reviewed changes

tensorrt_llm/_torch/models/modeling_qwen2vl.py Outdated Show resolved Hide resolved

tensorrt_llm/_torch/models/modeling_qwen2vl.py Show resolved Hide resolved

yechank-nvidia added the Multimodal Label for issues & PRs regarding Multimodal related objects label Oct 28, 2025

jaedeok-nvidia approved these changes Oct 28, 2025

View reviewed changes

amukkara approved these changes Oct 28, 2025

View reviewed changes

tensorrt_llm/inputs/registry.py Outdated Show resolved Hide resolved

2ez4bz approved these changes Oct 28, 2025

View reviewed changes

tensorrt_llm/_torch/models/modeling_mistral.py Outdated Show resolved Hide resolved

yechank-nvidia force-pushed the inputprocessor_config branch from 438cf23 to 4ca83ea Compare October 30, 2025 05:54

2ez4bz approved these changes Oct 31, 2025

View reviewed changes

yechank-nvidia added 4 commits October 31, 2025 14:03

refactor MultimodalInputProcessor

476a15b

Signed-off-by: yechank <[email protected]>

import ordering

ae8a553

Signed-off-by: yechank <[email protected]>

config naming convention fix

2af0372

Signed-off-by: yechank <[email protected]>

registry file import ordering

49e403f

Signed-off-by: yechank <[email protected]>

yechank-nvidia force-pushed the inputprocessor_config branch from 408ff32 to 49e403f Compare October 31, 2025 05:03

[None][fix] InputProcessor config naming convention fix #8705

Are you sure you want to change the base?

[None][fix] InputProcessor config naming convention fix #8705

Uh oh!

Conversation

yechank-nvidia commented Oct 28, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Oct 28, 2025

Walkthrough

Changes

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yechank-nvidia commented Oct 28, 2025

Uh oh!

tensorrt-cicd commented Oct 28, 2025

Uh oh!

jaedeok-nvidia left a comment

Choose a reason for hiding this comment

Uh oh!

tensorrt-cicd commented Oct 28, 2025

Uh oh!

Uh oh!

2ez4bz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yechank-nvidia commented Oct 30, 2025

Uh oh!

tensorrt-cicd commented Oct 30, 2025

Uh oh!

tensorrt-cicd commented Oct 30, 2025

Uh oh!

yechank-nvidia commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

yechank-nvidia commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

yechank-nvidia commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

yechank-nvidia commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

tensorrt-cicd commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yechank-nvidia commented Oct 28, 2025 •

edited by coderabbitai bot

Loading