1 points by metadat 3 hours ago | 1 comments
Given a task (e.g. "Build a Spotify but for movies"), compares the results of 2 unidentified LLMs and lets you rank which one is better before revealing which model you preferred.
Check out the global leaderboard to see which model is most beloved:
https://web.lmarena.ai/leaderboard
Large Language Model Battle Arena
(web.lmarena.ai)1 points by metadat 3 hours ago | 1 comments
Comments
Given a task (e.g. "Build a Spotify but for movies"), compares the results of 2 unidentified LLMs and lets you rank which one is better before revealing which model you preferred.
Check out the global leaderboard to see which model is most beloved:
https://web.lmarena.ai/leaderboard