Large Language Model Leaderboard

Compare the performance of large language models across different benchmarks. Higher scores indicate better performance. Click the button below to change the sorting criteria.

Last updated: April 3, 2025

Sort by:

Proprietary

Open