Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md

Repository files navigation

awesome-eval

https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

https://wandb.ai/ayush-thakur/llm-eval-sweep/reports/How-to-Evaluate-Compare-and-Optimize-LLM-Systems--Vmlldzo0NzgyMTQz

https://huggingface.co/evaluate-metric

https://huggingface.co/docs/evaluate/index

https://www.databricks.com/blog/LLM-auto-eval-best-practices-RAG

https://www.anyscale.com/blog/a-comprehensive-guide-for-building-rag-based-llm-applications-part-1

https://github.com/openai/evals

https://arxiv.org/abs/2306.05685

https://github.com/tatsu-lab/alpaca_eval

https://github.com/arthur-ai/bench

About

No description, website, or topics provided.

GPL-3.0 license

Report repository

Releases

No releases published

Packages

No packages published