Hamr
Local-first coding agent. Built for local models on consumer GPUs.
Quick Start
sh
npm install -g @skaft/hamr
hamrStart your inference server (Relay, llama.cpp, LM Studio, Ollama, vLLM) at its default port, then run hamr. From the TUI: /login → "Use a custom/self-hosted endpoint" to configure.
Why Hamr
- Relay-first — recommended inference gateway: model lifecycle, prefix-cache pre-warming
- Local-first — designed for your GPU, not a cloud bill
- Pi SDK — embed the agent in your own tools
- OpenAI-compatible — works with any
/chat/completionsendpoint