PocketPod

AI Models Struggle with Historical Understanding

A new study finds top LLMs perform poorly on advanced historical questions, struggling with nuanced understanding compared to basic facts, as tested by the Hist-LLM benchmark.

-1:20

Sources