World / Knowledge
Overall ranking

A summary of all our daily evaluations showing the aggregated performance of the models we're testing daily.

ProviderModelCorrect answersAverage response time (ms)
Evaluation over time