Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using a VLM inside a reward function #159

Open
korbinian-hoermann opened this issue Mar 13, 2025 · 0 comments
Open

Using a VLM inside a reward function #159

korbinian-hoermann opened this issue Mar 13, 2025 · 0 comments

Comments

@korbinian-hoermann
Copy link

korbinian-hoermann commented Mar 13, 2025

Hi,

I would like to fine-tune a model on web trajectories.

One component of the reward should be a low-level action execution evaluation,
which makes use of a VLM to determine if a generated command, e.g. click(x, y) is suitable to execute a high-level action,
e.g. Click the yellow button.

Therefore, I would like to annotate the original input image with a circle at x,y , and ask the evaluator VLM
if the click action at the annotated coordinates is suitable to execute the high-level action.

How can i access the image inside my reward function ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant