niconiconi on Nostr: It happened much faster than I expected. > "BitNet demonstrated the viability of ...
It happened much faster than I expected.
> "BitNet demonstrated the viability of using binary and ternary weights in language models, successfully scaling up to 3 billion parameters while maintaining competitive performance."