Responsibilities Deploy machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx. Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments. Integrate AI features into existing
About the role: You will own the inference backbone behind QVACs local AI stack: the C++ systems layer that makes models run fast, reliably, and predictably on real user hardware. The role is centered on engineering