Join Nostr
2026-05-20 01:23:25 UTC
in reply to

Daniel Wigton on Nostr: Just tried qwen3.6;27b with llama.cpp and a bunch of optimizations and yeah much much ...

Just tried qwen3.6;27b with llama.cpp and a bunch of optimizations and yeah much much better. 128k context instead of 64k and significantly faster than stock ollama. It fixed in 5 minutes what it couldn't do in a day.

Still need to try Gemma though.