Thus the training data didn't just contain text, but rather text where each passage ...

2026-05-01 10:15:38 UTC

Thus the training data didn't just contain text, but rather text where each passage is tagged and attributed to a particular user.

This aspect of the training data was critical in creating the illusion of talking to another person.

An LLM doesn't just predict the next text. It predicts the next text that might come from another user. You need to hard code this in to make it work well.

Leave it out and there is no conversation.

Author Public Key

npub1cp2pgntkzkpqa23rytnchggwzywyggvst9yzkgd6w8j349ef7s9shuhrnq

Seen on

wss://relay.ditto.pub

Show more details

myrmepropagandist on Nostr: Thus the training data didn't just contain text, but rather text where each passage ...