Drivebench inference for internvl: function "process_pil_image" unused

Hi! I noticed that in [internvl_lmdeploy.py](https://github.com/xiaomi-research/recogdrive/blob/main/vqa_evaluation/DriveBench/inference/internvl_lmdeploy.py), you defined a helper function `process_pil_image` to preprocess image inputs for InternVL models, but it doesn’t seem to be used anywhere in the current inference pipeline.

Could you clarify whether this preprocessing step is required for correct inference (or for matching the model’s expected input format), and if so, where it should be applied?

Also, could you share the decoding settings used in your experiments, specifically max_new_tokens, top_p, and temperature?

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drivebench inference for internvl: function "process_pil_image" unused #56

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Drivebench inference for internvl: function "process_pil_image" unused #56

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions