The ultimate intelligence benchmark is not games.
It's physical ability.
How good are the leading models at controlling robots?
BOT-AGI-1 is an independent robotics benchmark, with tasks that humans can easily solve.
Full benchmark coming soon
Interested in contributing tasks, evaluations, or model results to BOT-AGI-1?
Get in touch