Skip to content

Weird outputs using 'large' DiT planner #48

@ahmadchalhoub

Description

@ahmadchalhoub

Hello,

I am doing some experiments using the ReCogDrive model with the nuScenes dataset, and I noticed some weird behavior specifically using the 'large' DiT planner on my custom implementation that integrates with nuScenes (when using self.dit_type == "large" here:

cfg = make_recogdrive_config(self.dit_type, action_dim=3, action_horizon=8, grpo=self.grpo, input_embedding_dim=1536,sampling_method=sampling_method)
).

I have done the integration of the ReCogDrive pipeline with the nuScenes dataset with all the needed custom work, and I am getting really good performance when I use the 'small' DiT planner (check the attached image below), with both the 2B and 8B InternVL VLMs. Below is a sample result from the 2B VLM model (with the 'small' DiT planner):

Image Image

However, the 'large' DiT planner generates weird trajectories, shown below (the result below is using the 2B VLM):

Image Image

I was wondering if you have any thoughts about what could be causing this? Amazing work btw!

Thanks,
Ahmad

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions