Popular AIs head-to-head: OpenAI beats DeepSeek on sentence-level reasoning

April 17, 2025

ChatGPT and other AI chatbots based on large language models are known to occasionally make things up, including scientific and legal citations. It turns out that measuring how accurate an AI model’s citations are is a good way of assessing the model’s reasoning abilities.

This article is brought to you by this site.

Screen time prevalent under grandparents’ care, study finds

April 18, 2025

When Grandma and Grandpa are in charge, the children are likely staring at a screen—a long-standing parental complaint now supported [...]
Environmental variability promotes the evolution of cooperation among humans, simulation suggests

April 18, 2025

Researchers at the University of Tsukuba have demonstrated that intensified environmental variability (EV) can promote the evolution of cooperation through [...]
MLB’s international Latino players, coaches face challenges despite diversity efforts

April 17, 2025

Using Major League Baseball as a case study, Cornell research highlights potential shortcomings in diversity metrics that could obscure inequities [...]

Popular AIs head-to-head: OpenAI beats DeepSeek on sentence-level reasoning

Reader’s Picks

Why people with autism struggle to get hired, and how businesses can help by changing how they look at job interviews

Winding up value: How media shapes the luxury watch market

Ethical leadership can boost well-being and performance in remote work environments

How to tackle the ‘gender play gap’: 4 ways to encourage young women back into sport

Career benefits of opening for established artists like Taylor Swift revealed in study

For sales quota periods, one size doesn’t fit all

Giving cash to families in poor, rural communities can help bring down child marriage rates: New research

Good workplace culture key to improving lawyers’ well-being