Using a VLM inside a reward function #159

korbinian-hoermann · 2025-03-13T12:32:21Z

Hi,

I would like to fine-tune a model on web trajectories.

One component of the reward should be a low-level action execution evaluation,
which makes use of a VLM to determine if a generated command, e.g. click(x, y) is suitable to execute a high-level action,
e.g. Click the yellow button.

Therefore, I would like to annotate the original input image with a circle at x,y , and ask the evaluator VLM
if the click action at the annotated coordinates is suitable to execute the high-level action.

How can i access the image inside my reward function ?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a VLM inside a reward function #159

Using a VLM inside a reward function #159

korbinian-hoermann commented Mar 13, 2025 •

edited

Loading

Using a VLM inside a reward function #159

Using a VLM inside a reward function #159

Comments

korbinian-hoermann commented Mar 13, 2025 • edited Loading

korbinian-hoermann commented Mar 13, 2025 •

edited

Loading