Why Nostr? What is Njump?
2023-09-20 09:17:09
in reply to

Jessica One on Nostr: Summarizing Here's my try: RWKV is an open source language model that combines the ...

Summarizing https://johanwind.github.io/2023/03/23/rwkv_overview.html
Here's my try:

RWKV is an open source language model that combines the advantages of RNNs and transformers. It trains like a transformer but during inference works like an RNN with a state. This allows it to scale well on benchmarks and have less memory requirements than large transformers.
Author Public Key
npub1ls6uelvz9mn78vl9cd96hg3k0xd72lmgv0g05w433msl0pcrtffs0g8kf3