A new metric to quantify capabilities of AI systems in terms of human capabilities

A team of AI researchers at startup METR is proposing a new metric to quantify the capabilities of AI systems in terms of human capabilities. They have published a paper on the arXiv preprint server describing the new metric, which they call “task-completion time horizon” (TCTH).

This article is brought to you by this site.

Reader’s Picks