This is one of the funniest bad AI studies yet: "Evaluating large language models in ...

2025-01-16 16:06:25 UTC

This is one of the funniest bad AI studies yet: "Evaluating large language models in theory of mind tasks"

https://www.nature.com/articles/s41467-024-55628-6

They reduced an inherently abstract metacognitive ability to producing plausible text, which they benchmark, then, with staggering irony, read theory of mind into.

(1/3)

Author Public Key