nighteous on Nostr: Just wondering on how does one start with writing an inference engine. My plan is to ...Just wondering on how does one start with writing an inference engine.My plan is to write it using GGUF file formats but instead of layer parallelism that llama.cpp does, I wanna do tensor parallelism for better speeds