Self-trained vision transformers mimic human gaze with surprising precision

Can machines ever see the world as we see it? Researchers have uncovered compelling evidence that vision transformers (ViTs), a type of deep-learning model that specializes in image analysis, can spontaneously develop human-like visual attention patterns when trained without labeled instructions.

This article is brought to you by this site.

Reader’s Picks