Overview
Vellum is an LLM leaderboard tool that allows users to compare the performance of different AI models across various capabilities like reasoning, math, coding, and tool use.
Role in this knowledge base
It is presented as a resource for comparing and evaluating the performance of different AI models.
Key facts
- It provides a leaderboard for comparing models on reasoning, math, coding, and tool use.