Join Nostr
2026-05-25 17:03:54 UTC

Roland on Nostr: My first game built entirely with a local LLM: 🐍⏪ Rewind Snake Qwen 3.6 35B A3B: ...

My first game built entirely with a local LLM: 🐍⏪ Rewind Snake
https://rolznz.github.io/rewind-snake

Qwen 3.6 35B A3B: the first time for me that a local model has felt both smart AND fast enough to actually be usable.

![](https://m.stacker.news/142522)

**Win sats!**

* 10,000 sats to the highest score on Normal Mode
* 10,000 sats to the highest score on Enhanced Mode

Play it here: [https://rolznz.github.io/rewind-snake/](https://rolznz.github.io/rewind-snake/)

*To be eligible, enter your lightning address in the high score entry. Winner announced Wednesday morning US.*

I one-shotted a snake game to see if it could... then kept pushing to see how far it'd go.

* 🔥 Wall Breaker mode
* ⏪ a "rewind time" mechanic — pay to undo your death
* 🏆 online high scores + replays
* 📱 mobile support + PWA

Making a game was just a test. What I really have been after is self-sovereign AI - building on a mid-tier laptop is now possible.

**My setup:** Qwen3.6-35B-A3B-UD-IQ3\_XXS on a single NVIDIA RTX 4060 mobile — 8GB VRAM (A mid-tier laptop graphics card)

Using llama.cpp, built from source:

./build/bin/llama-server -m Qwen3.6-35B-A3B-UD-IQ3\_XXS.gguf -ngl 99 -np 1 -fa on -ctk q8\_0 -ctv q8\_0 -c 131072 --host 0.0.0.0 --port 8088 -ncmoe 38 --no-mmap

(Any ideas how I can optimize it more? MTP was not successful for me - 15% faster but much higher memory usage)

I built the app with PI agent. It's great for local-LLM dev because it doesn't waste context.

I also connected PI agent to Alby's builder and payments skills. Now I can build payment apps, and my agent can have budgeted, private access to my wallet.

I also built a simple "second brain" — a place to brainstorm and dump ideas without being spied on.

Looking forward to more self-sovereign AI experiments!

https://stacker.news/items/1496254