Skip to content

Results are only presented with niche LLM of doubao-1.5 #9

@Qingbolan

Description

@Qingbolan

Fix LaaJ LLM model to GPT-4.1-mini, try the following possible candidates as base LLMs:

  • Qwen3: 8b, 14b, 30b-a3b
  • Gemini: 2.0-flash-lite, 2.5-flash-lite
  • repeat time: one time first for all five models, and preserve two base models (one from Qwen and one from Gemini) for repeating three times
  • datasets: first try LM-SYS-100

Metadata

Metadata

Assignees

No one assigned

    Labels

    llm-eval-more-modelsResults are only presented with niche LLM doubao-1.5rebuttlerebuttle of paper submit

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions