wouldn't it be more effective to get the LLM to output as many tokens as possible? ...

2026-05-15 08:03:14 UTC

wouldn't it be more effective to get the LLM to output as many tokens as possible? More output tokens, more inference runs 😸🔥
The input is usually padded 🤔 (at least with the vanilla transformer I used)

Author Public Key

npub1csvskf9xmsjxl8n6ry9a8xecakge75fa9ju90rdeplk3tazzaqlqqrkr4v

Show more details

bleeptrack on Nostr: wouldn't it be more effective to get the LLM to output as many tokens as possible? ...