Evaluation
an archive of posts with this tag
Aug 13, 2024 | Challenges in Evaluating LLMs: A Statistical Analysis of Chatbot Arena Leaderboard |
---|---|
Jul 1, 2024 | On OpenLLM Leaderboard |
an archive of posts with this tag
Aug 13, 2024 | Challenges in Evaluating LLMs: A Statistical Analysis of Chatbot Arena Leaderboard |
---|---|
Jul 1, 2024 | On OpenLLM Leaderboard |