Key findings: 45% of all AI answers had at least one significant issue. 31% of ...

2025-10-22 11:51:47 UTC

Key findings:
45% of all AI answers had at least one significant issue.
31% of responses showed serious sourcing problems – missing, misleading, or incorrect attributions.
20% contained major accuracy issues, including hallucinated details and outdated information.
Gemini performed worst with significant issues in 76% of responses, more than double the other assistants, largely due to its poor sourcing performance.

https://www.bbc.co.uk/mediacentre/2025/new-ebu-research-ai-assistants-news-content

Author Public Key

npub1h4exh5tk775lyhh9s46f7jzryqe7wxfayy4ngxwr2khn3w2tpscqrwhnn5

Seen on

wss://relay.momostr.pink

Show more details

Kevin Marks on Nostr: Key findings: 45% of all AI answers had at least one significant issue. 31% of ...