cmd on Nostr: There's some really great commentary in here about the current state of AI: * LLMs ...
https://youtu.be/CbO2YosyTt4There's some really great commentary in here about the current state of AI:
* LLMs have trouble with confusing concepts that have similar words or spelling. Not a big deal with basic tasks, but terrible for scientific and academic work.
* If a person makes such a mistake, you only have to correct them once. However an LLM will continue to make this mistake until it is retrained.
* These models do not have a real sense of understanding of what they are doing. The LLM will regurgitate text with uncanny accuracy when it comes to language and dialogue. But there is no deeper thinking going on (which the LLMs admit).
* GPT5 smoked all the other models, even Grok 4 (but I don't think she was using Super Grok).
Overall a really fascinating no-bs test with a fair conclusion.
Published at
2025-08-28 15:34:02 UTCEvent JSON
{
"id": "8fd604fb455a45ce7549bdebaf7b22d4e1f93306c467dfc6df72c22db737acdd",
"pubkey": "4229c21f0101abc3ba45233e176e975fa9e671bb18a6722bdf7726ba25445ff9",
"created_at": 1756395242,
"kind": 1,
"tags": [],
"content": "https://youtu.be/CbO2YosyTt4\n\nThere's some really great commentary in here about the current state of AI:\n\n* LLMs have trouble with confusing concepts that have similar words or spelling. Not a big deal with basic tasks, but terrible for scientific and academic work.\n\n* If a person makes such a mistake, you only have to correct them once. However an LLM will continue to make this mistake until it is retrained.\n\n* These models do not have a real sense of understanding of what they are doing. The LLM will regurgitate text with uncanny accuracy when it comes to language and dialogue. But there is no deeper thinking going on (which the LLMs admit).\n\n* GPT5 smoked all the other models, even Grok 4 (but I don't think she was using Super Grok).\n\nOverall a really fascinating no-bs test with a fair conclusion. \n\n",
"sig": "f0f3cdfb16e043efd83082aece78b7f84afa62f956777642e943bdde2471abc0598c7e91aad96158fb719dd3df4f7e766e0dfe5c603f0b545cdd7bde20ab4a06"
}