A team of AI researchers at startup METR is proposing a new metric to quantify the capabilities of AI systems in terms of human capabilities. They have published a paper on the arXiv preprint server describing the new metric, which they call “task-completion time horizon” (TCTH).
A new metric to quantify capabilities of AI systems in terms of human capabilities
Reader’s Picks
-
Eventgoers’ live experiences are shaped by media technologies like social media, whether used in the moment or not, and memory [...]
-
Language learners often assume that using rare, complex vocabulary will make their speech sound more fluent. Research suggests that there [...]
-
Lead researchers Nicole Hiekel from the Max Planck Institute for Demographic Research (MPIDR) and Katia Begall from the Radboud Universiteit [...]