Join Nostr
2025-03-03 17:59:36 UTC

Bartosz Milewski on Nostr: I had a long conversation with ChatGPT testing my understanding of attention patterns ...

I had a long conversation with ChatGPT testing my understanding of attention patterns in LLM. (I hope it wasn't lying to me.) Getting answers to questions like "how many attention heads does GPT-3 use" was extremely useful.