node-llama-cpp, GGUF) and executes the actual code changes,
so routine work costs nothing in API fees.
The idea: premium plans, local executes
A premium model is great at planning. The Forge is great (and free) at doing. Orkestral splits the job so you pay only for the thinking, not the typing.Fast Apply
The Forge pairs with Orkestral’s own Fast Apply, a deterministic engine that merges edits into your files:
Fast Apply only touches the changed lines, it never rewrites the whole file, and it
needs no external service.
$0 by design
Offline and free
Because the model is bundled and runs locally, executing changes uses no API
credits and works with no internet connection. A cost view shows how many runs
were resolved locally vs. escalated to a premium model.
When it escalates
If the Forge can’t apply a change confidently, Orkestral escalates once to a premium model as a fallback, so correctness is never sacrificed for cost.The Forge model ships inside the installer, so it’s ready on first launch with
no extra download.