Skip to content

Total Program Failure: Trellis2 DINOv3 extractor cannot load the HuggingFace DINOv3 model #161

@jimpickrellgit

Description

@jimpickrellgit

I’m trying to use the DINOv3 extractor in Trellis2, but it doesn’t load the HuggingFace model (facebook/dinov3-vitl16-pretrain-lvd1689m). The extractor expects model.layer, but the HF model exposes model.model.layer. It also expects RoPE to return two tensors (cos, sin), while the HF model returns a single tensor, causing ValueError: expected 2, got 1.

Before I assume this is a bug, can you confirm whether the DINOv3 extractor is intended to work with the public HF DINOv3 model, or if a different model is required? Alternately is there a different version of the program I should be using?

Has anyone else been able to get this to work?

This seems like an absolute dealbreaker as far as using this software at all. If this doesn't work nothing works.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions