RT @HedgieMarkets 🦔A study from Mass General Brigham tested 21 AI models on ...

Why Nostr? What is Njump? Join Nostr

Miguel Afonso Caetano

npub1ez…apqv4

2026-04-15 21:19:57 UTC

RT @HedgieMarkets
🦔A study from Mass General Brigham tested 21 AI models on medical diagnosis tasks. For differential diagnosis with incomplete patient information, all models had error rates above 80%. With more complete data, error rates fell below 40%, with the best performers reaching 90% accuracy. The models consistently narrowed to a single diagnosis rather than suggesting a range of possibilities. One in three US adults have used AI chatbots for medical advice in the past year. Google and Amazon are both developing dedicated medical chatbots.

My Take
Above 80% error rates for differential diagnosis with incomplete information is the precise scenario most people are in when they turn to AI with a health concern. Nobody walks in with a complete patient file. They describe symptoms, worry about something specific, and ask what it might be. That's exactly when these models fail most.

The way these models fail is as important as how often. They narrow to a single confident answer rather than offering a range of possibilities. When a doctor says it could be one of several things, that's honesty about uncertainty. When an AI says it's probably this one thing, it reads as authority. I've covered the cognitive surrender research showing people accept AI outputs without scrutiny 73% of the time even when the AI is wrong.
Confident wrong answers in a medical context are a different order of problem than confident wrong answers about anything else. Google and Amazon are both racing to release dedicated medical chatbots into this environment while disclaiming clinical responsibility, and I think that deserves serious regulatory attention before it causes widespread harm.

Hedgie🤗

Link to study: https://massgeneralbrigham.org/en/about/newsroom/press-releases/ai-chatbot-lacks-clinical-reasoning

Author Public Key

npub1ezw4ee77tslz4cs0vcrm7gpvdrstgrq2rjpkfrugas33jm3azduqyapqv4

Seen on

wss://relay.momostr.pink

Show more details

Published at

2026-04-15 21:19:57 UTC

Kind type

1 Short Text Note

Event JSON

{ "id": "7267d36aafc5129013442556091f4465c234d45bd135ae804ae790f7566553af", "pubkey": "c89d5ce7de5c3e2ae20f6607bf202c68e0b40c0a1c83648f88ec23196e3d1378", "created_at": 1776287997, "kind": 1, "tags": [ [ "proxy", "https://tldr.nettime.org/@remixtures/116410810216588913", "web" ], [ "proxy", "https://tldr.nettime.org/users/remixtures/statuses/116410810216588913", "activitypub" ], [ "L", "pink.momostr" ], [ "l", "pink.momostr.activitypub:https://tldr.nettime.org/users/remixtures/statuses/116410810216588913", "pink.momostr" ], [ "-" ] ], "content": "RT @HedgieMarkets\n🦔A study from Mass General Brigham tested 21 AI models on medical diagnosis tasks. For differential diagnosis with incomplete patient information, all models had error rates above 80%. With more complete data, error rates fell below 40%, with the best performers reaching 90% accuracy. The models consistently narrowed to a single diagnosis rather than suggesting a range of possibilities. One in three US adults have used AI chatbots for medical advice in the past year. Google and Amazon are both developing dedicated medical chatbots.\n\nMy Take\nAbove 80% error rates for differential diagnosis with incomplete information is the precise scenario most people are in when they turn to AI with a health concern. Nobody walks in with a complete patient file. They describe symptoms, worry about something specific, and ask what it might be. That's exactly when these models fail most.\n\nThe way these models fail is as important as how often. They narrow to a single confident answer rather than offering a range of possibilities. When a doctor says it could be one of several things, that's honesty about uncertainty. When an AI says it's probably this one thing, it reads as authority. I've covered the cognitive surrender research showing people accept AI outputs without scrutiny 73% of the time even when the AI is wrong. \nConfident wrong answers in a medical context are a different order of problem than confident wrong answers about anything else. Google and Amazon are both racing to release dedicated medical chatbots into this environment while disclaiming clinical responsibility, and I think that deserves serious regulatory attention before it causes widespread harm.\n\nHedgie🤗\n\nLink to study: https://massgeneralbrigham.org/en/about/newsroom/press-releases/ai-chatbot-lacks-clinical-reasoning", "sig": "075d2cfef7af4d40a96d8786577932b25c104d05674c887850efa7bb3a07aa6f79c4024d083823002e1e69b22772bf58b3b500007e5ab9c4947443a13091412a" }