Popular AIs head-to-head: OpenAI beats DeepSeek on sentence-level reasoning

ChatGPT and other AI chatbots based on large language models are known to occasionally make things up, including scientific and legal citations. It turns out that measuring how accurate an AI model’s citations are is a good way of assessing the model’s reasoning abilities.

This article is brought to you by this site.

Reader’s Picks