InFeeo
Language

Humans Still Beat AI in the Long Horizon(openai.com)

×
Link preview Humans Still Beat AI in the Long Horizon Agents can spend test-time compute by trying, observing, and revising. We derive an Elo reference for repeated sampling, then show that in a 2022 two-week coding marathon, current agents plateau within 24 hours while top humans keep improving. Qiuyang Mang · openai.com
Agents can spend test-time compute by trying, observing, and revising. We derive an Elo reference for repeated sampling, then show that in a 2022 two-week coding marathon, current agents plateau within 24 hours while top humans keep improving.

Comments

Log in Log in to comment.

No comments yet.