-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ Request ] Multimodal model generation #132
[ Request ] Multimodal model generation #132
Comments
If you use Hugginface's transformers library it is possible. Just add the prefix function to the generate method and it will work:
|
I am currently working with miniCPM:
When I pass this prefix function to the model.generate I get these kinds of errors:
|
Here is a small notebook I made to show where it goes wrong. |
There is a new sample that shows multimodal extraction working: |
Now it is only possible to pass text to enforced model. Some tasks require multimodal models, like processing images. It would be awesome if it would be possible to enforce a multimodal model to use a particular JSON schema.
The text was updated successfully, but these errors were encountered: