A full AI coding assistant. On your Mac.
No 5-hour windows. No per-prompt billing. Run 300 prompts back-to-back at 2 AM — same cost as running zero.
No subscription. No API bill. No surprise charges. The compute is your own M-chip — already paid for.
Air-gapped lab, plane wifi, client site with no internet. Your AI works wherever you do.
Nothing is sent to a server. No training pipeline. No one reading your code. Not even us.
Security research, exploit analysis, malware RE — no content policy blocking legitimate work.
Points directly at your codebase. No copy-paste. No truncation. The agent sees what you see.
Real quotes from real developers, week of April 2026.
"I literally JUST got a Pro subscription, and my very first prompt nuked my daily usage limit and apparently 13% of my total weekly limit. Are my expectations just way too high?"
— u/dev_exhausted · 847 upvotes
"By message 30 you're paying 31× what message 1 cost. Once you've burned past about 60% of the context window, the model's output quality starts dropping."
— Token Optimisation 101, DEV Community
"Claude Code on Opus 4.6 with two parallel sessions hit 100% of the 5-hour limit in 20 minutes. That's not a marginal difference — it's a workflow-breaking one."
— Startup Fortune benchmark
"The Pro plan imposes a strict limit of 10–40 prompts every five hours — a constraint that many developers find they exhaust within mere minutes of intensive coding sessions."
— Goose vs Claude Code thread
your-local-agent runs unlimited sessions, at zero per-token cost, with no 5-hour reset window — because the model is on your machine, not theirs.
Pick a scenario and watch a real conversation unfold.
Three industries. Three problems cloud AI literally cannot solve.
From nothing to a running local AI in under five minutes.
Auto-detected at setup. Swap anytime.
| RAM | Model | Size | Speed |
|---|---|---|---|
| 8 GB | Qwen3 4B | 3.2 GB | ~35 tok/s |
| 16 GBrecommended | Qwen3 8B | 5 GB | ~25 tok/s |
| 24 GB | Qwen3 14B | 9 GB | ~20 tok/s |
| 32 GB+ | Qwen3 32B | 19.5 GB | ~12 tok/s |
| Cloud AI | your-local-agent | |
|---|---|---|
| Token limits | Yes — resets every 5h | None. Ever. |
| Works offline | No | Yes — plane, lab, bunker |
| Prompts stored / logged | Yes | No — never leaves device |
| Content filtered | Yes — blocks security work | No filters |
| Monthly cost | $20–$200/mo | $0 forever |
| Reads local files directly | No — you paste manually | Yes — full repo access |