Just tried qwen3.6;27b with llama.cpp and a bunch of optimizations and yeah much much ...

2026-05-20 01:23:25 UTC

Just tried qwen3.6;27b with llama.cpp and a bunch of optimizations and yeah much much better. 128k context instead of 64k and significantly faster than stock ollama. It fixed in 5 minutes what it couldn't do in a day.

Still need to try Gemma though.

Author Public Key