your-local-agent · open source · apple silicon

Unlimited.
Offline.
Yours.

A full AI coding assistant. On your Mac.

$ curl -fsSL https://raw.githubusercontent.com/noelps-git/your-local-agent/main/setup.sh | bash

Works on M1, M2, M3, M4·5 minutes·Free forever

Star on GitHub See it in action ↓

What you get

Everything cloud AI withholds.

Unlimited

tokens & sessions

No 5-hour windows. No per-prompt billing. Run 300 prompts back-to-back at 2 AM — same cost as running zero.

per month, forever

No subscription. No API bill. No surprise charges. The compute is your own M-chip — already paid for.

Offline

always available

Air-gapped lab, plane wifi, client site with no internet. Your AI works wherever you do.

Zero logs

prompts stay private

Nothing is sent to a server. No training pipeline. No one reading your code. Not even us.

No filters

unfiltered responses

Security research, exploit analysis, malware RE — no content policy blocking legitimate work.

Reads files

full repo context

Points directly at your codebase. No copy-paste. No truncation. The agent sees what you see.

The problem

You're not out of ideas.
You're out of tokens.

Real quotes from real developers, week of April 2026.

r/ClaudeAI · Apr 2026

"I literally JUST got a Pro subscription, and my very first prompt nuked my daily usage limit and apparently 13% of my total weekly limit. Are my expectations just way too high?"

— u/dev_exhausted · 847 upvotes

dev.to · Mar 2026

"By message 30 you're paying 31× what message 1 cost. Once you've burned past about 60% of the context window, the model's output quality starts dropping."

— Token Optimisation 101, DEV Community

Startup Fortune · Apr 2026

"Claude Code on Opus 4.6 with two parallel sessions hit 100% of the 5-hour limit in 20 minutes. That's not a marginal difference — it's a workflow-breaking one."

— Startup Fortune benchmark

r/LocalLLaMA

"The Pro plan imposes a strict limit of 10–40 prompts every five hours — a constraint that many developers find they exhaust within mere minutes of intensive coding sessions."

— Goose vs Claude Code thread

your-local-agent runs unlimited sessions, at zero per-token cost, with no 5-hour reset window — because the model is on your machine, not theirs.

Deep use cases

Where it actually matters.

Three industries. Three problems cloud AI literally cannot solve.

Security Research

Malware RE in an air-gapped lab

You've captured an unknown loader binary from a client endpoint. You're in an isolated network — ChatGPT is unreachable, and cloud models refuse exploit payloads outright. You need to reverse it now.

What happens → Reads your Ghidra decompile, identifies the RC4 variant, writes a Python config decryptor, extracts C2 domain. Fully offline. No content filter.

Biotech / Drug Discovery

Proprietary SMILES that can't leave the lab

Your lab has 800 novel compound candidates as SMILES strings. Pasting them into any cloud AI violates your IP agreement. You need to iterate on RDKit scripts at 11 PM before a board meeting.

What happens → Reads your local .csv, fixes the RDKit TypeError, applies Lipinski Ro5 filter, outputs 307 passing candidates. Your compounds never left the machine.

Embedded / Firmware

Debugging stripped ARM at 35,000 ft

You're on a flight to a client site. The firmware is stripped — no debug symbols. You need GDB watchpoints and pretty-printers written on the fly. No wifi. No Claude. Deadline in 4 hours.

What happens → Reads your objdump output, infers FreeRTOS 10.x TCB layout, writes debug_init.gdb with watchpoints. No internet. No timer.

Setup

Three steps.

From nothing to a running local AI in under five minutes.

Step 01

Run one command

$curl -fsSL https://raw.githubusercontent.com/noelps-git/your-local-agent/main/setup.sh | bash

Step 02

Pick your model

$local-ai-setup

↳ auto-detected for your RAM

Step 03

Start

$local-ai-start

↳ running at localhost:11434

Full documentation →

RAM	Model	Size	Speed
8 GB	Qwen3 4B	3.2 GB	~35 tok/s
16 GBrecommended	Qwen3 8B	5 GB	~25 tok/s
24 GB	Qwen3 14B	9 GB	~20 tok/s
32 GB+	Qwen3 32B	19.5 GB	~12 tok/s

Comparison

Cloud AI vs.
your-local-agent.

	Cloud AI	your-local-agent
Token limits	Yes — resets every 5h	None. Ever.
Works offline	No	Yes — plane, lab, bunker
Prompts stored / logged	Yes	No — never leaves device
Content filtered	Yes — blocks security work	No filters
Monthly cost	$20–$200/mo	$0 forever
Reads local files directly	No — you paste manually	Yes — full repo access