Skip to content

GeneratorArgs.is_torchtune_model is a misnomer #1273

Open
@Jack-Khuu

Description

@Jack-Khuu

🚀 The feature, motivation and pitch

is_torchtune_model is a misnomer and can result in buggy code. It gates logic for models that have tune suffix, but not all torchtune models end with this suffix. For example Flamingo (Llama3.2 11B) is also a torchtune model

This results in code like this:

if (
self.is_torchtune_model
or self.model.config.model_type == ModelType.Flamingo
):

Alternatives

No response

Additional context

No response

RFC (Optional)

While the logic around torchtune models itself needs some refactor, the short term solution is some combination of:

  • Rename the field to be more accurate
  • Refactor the logic such that it does account for torchtune models that do not end in tune

Metadata

Metadata

Assignees

No one assigned

    Labels

    actionableItems in the backlog waiting for an appropriate impl/fixgood first issueGood for newcomerstriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions