Can machines ever see the world as we see it? Researchers have uncovered compelling evidence that vision transformers (ViTs), a type of deep-learning model that specializes in image analysis, can spontaneously develop human-like visual attention patterns when trained without labeled instructions.
Self-trained vision transformers mimic human gaze with surprising precision
Reader’s Picks
-
Myanmar’s history of prolonged conflict has led to the forced displacement and resettlement of generations of refugees to the U.S., [...]
-
During fieldwork in cities in China, I came across a new marital practice, locally described as liang-tou-dun, literally “two places [...]
-
Every year, around 90,000 young people make the transition from school to work. A large number of them start to [...]