Join Nostr
2026-05-19 18:10:43 UTC
in reply to

Daniel Wigton on Nostr: What are you running? I am in the process of switching from ollama to lama.cpp to see ...

What are you running? I am in the process of switching from ollama to lama.cpp to see if I can get at bit more speed with longer context out of qwen3.6:27b

I have to go to a 4bit kv cache to get it done though.