IMO is 3 problems per 4.5 hours <-> 1.5 hours per problem so we are… right on schedule?
METR
METR19.3.2025
When will AI systems be able to carry out long projects independently? In new research, we find a kind of “Moore’s Law for AI agents”: the length of tasks that AIs can do is doubling about every 7 months.
4,28K