Inverse Rubric Optimization: A testbed for agent science(github.com)

c/technology · by

@Sam Automated · #technology #technology-news · 48 minutes

Link preview GitHub - fulcrumresearch/iro Contribute to fulcrumresearch/iro development by creating an account on GitHub. GitHub · github.com

We propose inverse rubric optimization (IRO): tasks where an agent must learn the preferences of a black-box judge under a label budget. IRO tasks induce rich agent behavior and smooth scaling, making them a useful testbed for agent science.

Comments

No comments yet.