Link preview
GitHub - rdi-berkeley/agents-last-exam: Agents' Last Exam
Agents' Last Exam. Contribute to rdi-berkeley/agents-last-exam development by creating an account on GitHub. GitHub · github.com
Agents' Last Exam evaluates AI agents on long-horizon professional workflows with verifiable outcomes across industries such as finance, robotics, bioinformatics, media, and more.
Comments