-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
llm-eval-more-modelsResults are only presented with niche LLM doubao-1.5Results are only presented with niche LLM doubao-1.5rebuttlerebuttle of paper submitrebuttle of paper submit
Description
Fix LaaJ LLM model to GPT-4.1-mini, try the following possible candidates as base LLMs:
- Qwen3: 8b, 14b, 30b-a3b
- Gemini: 2.0-flash-lite, 2.5-flash-lite
- repeat time: one time first for all five models, and preserve two base models (one from Qwen and one from Gemini) for repeating three times
- datasets: first try LM-SYS-100
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
llm-eval-more-modelsResults are only presented with niche LLM doubao-1.5Results are only presented with niche LLM doubao-1.5rebuttlerebuttle of paper submitrebuttle of paper submit