Use your favorite coding agent with the model you want.

Works with Claude Code Codex Gemini CLI OpenCode Amp Pi Oh My Pi GitHub Copilot Grok CLI

~/project aivo zsh
$ 
Bring your own keys
OpenRouter, Vercel, Fireworks, MiniMax, DeepSeek, Bedrock.
Mix any agent & model
Kimi in Claude Code. Gemini in Codex. Opus in Pi.
Local by design
Keys encrypted on disk. Traffic goes direct.

Install aivo with script or package managers.

curl -fsSL https://getaivo.dev/install.sh | bash
irm https://getaivo.dev/install.ps1 | iex
brew install yuanchuan/tap/aivo

Run a coding agent.

# or codex, gemini, opencode, pi, amp, grok aivo claude

Bring your own keys

Add API keys from any provider you trust. Keys are encrypted and stored locally, so you can manage them and switch anytime.

# OpenRouter, Vercel AI Gateway, # OpenCode Go, DeepSeek, CloudFlare, # Google AI Studio, Kilo Gateway, # Amazon Bedrock, Cursor, # more... aivo keys add

The picker covers most providers. A few work differently:

GitHub Copilot

If you have a GitHub Copilot subscription, you can add it as a provider in aivo. (Be aware of the GitHub Copilot charges by request count, not by token usage)

# choose GitHub Copilot aivo keys add

Codex & Claude Code login

Log in once, keep multiple accounts in one place, and switch without signing in and out. Each subscription only works with its own agent — a Codex login for Codex, a Claude login for Claude Code.

# choose OpenAI Codex, Claude Code aivo keys add

Cursor subscription

Cursor permits programmatic use through the Cursor SDK and CLI, so aivo bridges it to any agent like a BYOK key.

# choose Cursor from the picker aivo keys add # then run a Cursor model in any agent aivo pi -m composer-2.5 aivo claude -m composer-2.5

Ollama

Connect to Ollama for models on your machine or its cloud service. If a model isn't present, aivo asks to pull it before use.

# choose Ollama aivo keys add # chat with an ollama model aivo chat -m ministral-3:3b

Other local servers — LM Studio, llama.cpp, etc. — work the same way: add their OpenAI-compatible endpoint as a provider.

Export & import

Move keys between machines through a single password-encrypted file. OAuth logins and GitHub Copilot are skipped by default — they're machine-bound and won't transfer cleanly.

# prompts for a password aivo keys export ~/keys.aivo # same password on the other machine, or pass a URL aivo keys import ~/keys.aivo aivo keys import https://example.com/keys.aivo # non-interactive (CI / scripts) aivo keys export ~/keys.aivo --password-stdin <<< "my secret"

Other commands

# list all saved keys aivo keys # quickly add a key with options aivo keys add --base-url https://openrouter.ai/api --key sk-xxx # activate a key to use aivo keys use aivo keys use mykey # print saved data of a key aivo keys cat # edit a saved key aivo keys edit # health-check aivo keys ping # for more options aivo keys --help

Run a coding agent

Launch a coding agent with any provider or model you want. All extra arguments are passed through to the underlying tool.

Claude Code Anthropic's official terminal coding agent.
aivo claude

Use a saved API key with -k, or omit the value to open the key picker. Pass -k alone with -m to pick both at once.

aivo claude -k openrouter aivo claude -k copilot aivo claude -k

Pin a model with -m, or omit the value to open the model picker. Once a model is applied, aivo claude remembers it for the next run.

aivo claude -m moonshotai/kimi-k2.5 aivo claude -m

Per-slot models Claude Code runs multiple models in a single session — haiku/sonnet/opus slots plus reasoning and subagent overrides. Pin each slot to a different model, or pass a slot flag without a value to open the picker for that slot.

aivo claude --sonnet-model deepseek-v4-pro --haiku-model deepseek-v4-flash aivo claude --reasoning-model gpt-5.4 --subagent-model claude-haiku-4-5 # open the picker for one slot aivo claude --opus-model
Codex OpenAI's official Codex CLI.
aivo codex

Use a saved API key with -k, or omit the value to open the picker.

aivo codex -k vercel aivo codex -k

Pin a model with -m, or omit the value to open the model picker.

aivo codex -m xiaomi/mimo-v2-pro aivo codex -m

Codex desktop app (macOS only) Launch the native Codex with the same backend. The app's in-app model picker only shows names starting with gpt- or claude-; other models still work, but won't appear there.

aivo codex-app aivo codex-app -k
Gemini CLI Google's official Gemini CLI.
aivo gemini

Use a saved API key with -k, or omit the value to open the picker.

aivo gemini -k google aivo gemini -k

Pin a model with -m, or omit the value to open the model picker.

aivo gemini -m gemini-2.5-pro aivo gemini -m
OpenCode An open-source coding agent.
aivo opencode

Use a saved API key with -k, or omit the value to open the picker.

aivo opencode -k openrouter aivo opencode -k

Pin a model with -m, or omit the value to open the model picker.

aivo opencode -m z-ai/glm-4.7 aivo opencode -m
Amp Sourcegraph's coding agent.

Install the plugin first Amp support ships as a separate plugin. Install it once, then run it like any other agent.

aivo plugins install github:yuanchuan/aivo-amp aivo amp

Use a saved API key with -k, or omit the value to open the picker.

aivo amp -k openrouter aivo amp -k

Pin a model with -m, or omit the value to open the model picker.

aivo amp -m gpt-5.4 aivo amp -m

Per-mode models Amp routes between rush, smart, deep, and large modes. Pin each mode to a different model, or pass a mode flag without a value to open the picker for that mode.

aivo amp --smart-model claude-sonnet-4.6 --rush-model claude-haiku-4-5 aivo amp --deep-model gpt-5.4-thinking --large-model deepseek-v4-pro # pin the starting mode, or omit the value to open the mode picker aivo amp --mode deep aivo amp --mode
Pi A terminal coding agent from the pi-mono toolkit.
aivo pi

Use a saved API key with -k, or omit the value to open the picker.

aivo pi -k cursor aivo pi -k

Pin a model with -m, or omit the value to open the model picker.

aivo pi -m composer-2.5 aivo pi -m

Transparent mode Because Pi supports multiple upstream protocols, by default aivo puts its local router in the middle and normalizes the stream. Use --transparent to hand Pi the URL and the right API type so it talks to the provider directly.

aivo pi --transparent -k openrouter
Oh My Pi A terminal coding agent built on pi-mono.

Install the plugin first omp support ships as a separate plugin. Install it once, then run it like any other agent.

aivo plugins install github:yuanchuan/aivo-omp aivo omp

Use a saved API key with -k, or omit the value to open the picker.

aivo omp -k openrouter aivo omp -k

Pin a model with -m, or omit the value to open the model picker.

aivo omp -m deepseek-v4-pro aivo omp -m
GitHub Copilot GitHub's official terminal coding agent.

Install the plugin first GitHub Copilot CLI support ships as a separate plugin. Install it once, then run it like any other agent.

aivo plugins install github:yuanchuan/aivo-copilot aivo copilot

Use a saved API key with -k, or omit the value to open the picker.

aivo copilot -k openrouter aivo copilot -k

Pin a model with -m, or omit the value to open the model picker.

aivo copilot -m claude-sonnet-4.6 aivo copilot -m
Grok CLI An open-source coding agent for the Grok API.

Install the plugin first Grok CLI support ships as a separate plugin. Install it once, then run it like any other agent.

aivo plugins install github:yuanchuan/aivo-grok aivo grok

Use a saved API key with -k, or omit the value to open the picker.

aivo grok -k openrouter aivo grok -k

Pin a model with -m, or omit the value to open the model picker.

aivo grok -m gpt-5.4 aivo grok -m

Quick start

Without a tool name, aivo run opens an interactive picker of agents and plugins, with your last choice selected by default.

aivo run

Launch aliases

Pin a coding agent and its usual flags under one name, then launch the whole thing with aivo <name> (or the long form, aivo run <name>). Inline flags at run time override the saved ones — handy for locking in the per-slot model setups you'd otherwise retype every session.

# Claude Code with separate models per slot aivo alias work claude -k openrouter \ --opus-model deepseek/deepseek-v4-pro \ --sonnet-model deepseek/deepseek-v4-flash \ --haiku-model deepseek/deepseek-v4-flash \ # Amp with separate models per mode, pinned to deep mode aivo alias task amp -k openrouter \ --smart-model openai/gpt-5.5 \ --deep-model anthropic/claude-opus-4.7 \ --rush-model openai/gpt-5.4 \ --large-model deepseek/deepseek-v4-pro # run them aivo work aivo task

Models

Once you add a provider and set up the key, you can use the models command to see the available models the provider offers. Most providers provide the model list through API, so aivo fetches it on demand and caches it for later use.

# active key's models, or pick one aivo models aivo models -k openrouter # filter by name aivo models -s free aivo models -k openrouter -s claude # force refresh the cached list aivo models --refresh

JSON output

--json prints the raw upstream model list for scripts. The shape depends on the provider — pipe to jq to explore and filter.

aivo models --json aivo models -k openrouter --json | jq

Model aliases

If a model name is too long, you can give it a short alias. Model aliases are accepted anywhere -m/--model works.

aivo alias fast=claude-haiku-4-5 aivo alias mimo xiaomi/mimo-v2-pro # use it aivo claude -m fast aivo chat -k vercel -m mimo # list and remove aivo alias aivo alias rm fast

Hugging Face

Run any open-weight GGUF directly from a Hugging Face repo. The first time you reference a model, aivo downloads the file to ~/.config/aivo/cache/huggingface and serves it through a bundled llama-server. Subsequent runs reuse the cached file.

The hf: prefix and full https://huggingface.co/… URLs are accepted anywhere a model name works — in aivo chat, a coding agent's -m, the bare aivo shortcut, etc.

aivo hf:Qwen/Qwen2.5-0.5B-Instruct-GGUF aivo https://huggingface.co/allenai/Olmo-3-1025-7B

Pin a quant

GGUF repos usually publish several quantizations. Append :<quant> (e.g. Q5_K_M, Q4_K_M) to pick a specific file; if you don't, aivo prompts you to choose the first time it downloads.

aivo chat hf:bartowski/Llama-3.2-3B-Instruct-GGUF:Q5_K_M aivo claude -m hf:bartowski/Llama-3.2-3B-Instruct-GGUF:Q4_K_M

Use in a coding agent

Pass hf: wherever -m/--model is accepted. Combine with the per-slot flags for Claude Code and Amp to mix local and remote models in the same session.

aivo claude -m hf:Qwen/Qwen2.5-0.5B-Instruct-GGUF aivo pi -m hf:Qwen/Qwen2.5-0.5B-Instruct-GGUF # mix: remote opus, local sonnet aivo claude \ --opus-model claude-opus-4.7 \ --sonnet-model hf:Qwen/Qwen2.5-0.5B-Instruct-GGUF

Re-open a cached model

A bare hf: opens a picker over the local cache — useful once you've pulled a few models and don't want to retype the repo path.

aivo chat hf: aivo claude hf:

Manage the local cache

aivo hf lists the cached repos; --verbose expands each repo to show every downloaded quant. The rest are housekeeping commands.

# list cached repos aivo hf aivo hf --verbose # pre-pull a model (handy before going offline) aivo hf pull hf:Qwen/Qwen2.5-0.5B-Instruct-GGUF # delete one quant or a whole repo aivo hf rm <repo> --quant Q5_K_M aivo hf rm <repo> --all -y # wipe the whole cache aivo hf clean -y

Chat

aivo chat is the built-in coding agent in your terminal. It reads and edits files and runs shell commands, each behind a permission prompt. Like the rest of aivo, it works with any model you've added — from a frontier API to a local GGUF.

aivo chat running in a terminal: a prompt, the model's reply, and a status line showing the active model, working directory, and token usage

Full-screen TUI

A streaming, full-screen agent in your terminal. The selected model is remembered per saved key, and the exact session resumes later — transcript, edits, and all.

aivo chat aivo chat -m gpt-5.4 # resume a saved session aivo chat --resume aivo chat --resume last # open key and model pickers if you forget the names aivo chat -k aivo chat -m

Tools & sandbox

The agent reads, writes, and edits files and runs shell commands. Every tool call is gated by a permission prompt, and commands run inside an isolated workspace sandbox (macOS, and Linux via Landlock) that asks before reaching outside it.

Agent profiles

Save a reusable role as a Markdown file in ~/.config/aivo/agents/<name>.md — YAML frontmatter (a description, optional model and tools scope) over a body that becomes its system prompt. Launch chat as one with --agent, switch in-session with /agent, or let the agent delegate a scoped task to one mid-session.

aivo chat --agent reviewer

Attach files

Queue text files or images for the agent to read. Pass --attach as many times as you need, or paste an image straight from the clipboard with Ctrl+V inside chat.

aivo chat --attach README.md aivo chat --attach README.md --attach screenshot.png

One-shot prompts

Send one prompt and exit. The shortcut is a bare quoted string — aivo "..." rewrites to aivo chat -p "...". Use -p/--prompt when you also want flags.

aivo "pro tips for git" aivo -p "pro tips for git" -m gpt-5.4 # type interactively, Ctrl-D to send aivo -p

Shell pipelines

Pipe anything into aivo. It reads stdin, adds your prompt, and sends it to the model.

git diff | aivo -p "Write a commit message" cat error.log | aivo -p "Find the root cause" cat error.log | aivo -p

Serve

Expose your active provider as a local OpenAI-compatible endpoint. Any tool that speaks the OpenAI API can use it — VS Code extensions, Python scripts, anything.

Start the server

Model aliases are resolved at the server — clients can post {"model": "fast", ...} and aivo rewrites it to the real upstream model before forwarding. The alias list also shows up in /v1/models.

aivo serve aivo serve --port 8080 aivo serve --host 0.0.0.0

Failover

If a request hits a rate limit (429) or server error (5xx), aivo retries with the next saved key automatically.

aivo serve --failover

Logging

Log every request and response. Pipe to jq for readable output, or write to a file.

aivo serve --log | jq . aivo serve --log /tmp/requests.jsonl

Auth, CORS & timeout

aivo serve --auth-token aivo serve --auth-token my-secret aivo serve --cors aivo serve --timeout 60

Try it with curl

curl http://localhost:24860/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "hello"}]}'

Logs & Stats

One feed across aivo's own events (chat, run, serve), native CLI sessions (claude, codex, gemini, pi, opencode), and amp threads — read from each tool's on-disk session files and aivo's local SQLite. Scoped to the current project by default.

View logs

# recent activity in the current project aivo logs # every project on this machine aivo logs --all

Filter and search

Narrow by source, model, key, time, or text. --errors shows only failures.

aivo logs --by chat -n 5 aivo logs --by claude --errors aivo logs --by native aivo logs --model glm-4.7 aivo logs -s "rate limit" aivo logs --since 24h aivo logs --json

Share

Publish a session via a tunneled viewer URL — useful for showing a teammate a bug repro, an agent transcript, or an interesting model run.

The server is just a bridge — it forwards viewer requests to your machine over the tunnel and keeps no copy of the session.

# pick a session in the current project aivo logs share # share a specific row by id prefix aivo logs share 1335c631 # pick from every project on this machine aivo logs share --all # follow updates and open in the browser aivo logs share --live --open # skip redaction (be sure) aivo logs share --no-redact

Usage stats

Aggregates token counts from aivo chat, Claude Code, Codex, Gemini, OpenCode, Amp, and Pi by reading each tool's native data files.

aivo stats aivo stats claude aivo stats chat

Filter and format

# raw numbers for scripts aivo stats -n # filter by provider aivo stats -s openrouter # last N units (m, h, d, w) aivo stats --since 7d aivo stats claude --since 24h # show all models aivo stats -a # bypass cache aivo stats -r

Example output

$ aivo stats ──────────────────────────────────────────────────── 408M tokens · 14B cached · 5.0K sessions · 77 models By tool sessions tokens claude 4.2K 295M ████████████████████ codex 256 87M █████▉ opencode 166 10M ▊ chat 91 8.0M ▌ gemini 204 4.2M ▎ pi 85 3.8M ▎ By model tokens gpt-5.4 75M ████████████████████ minimax-m2.5 63M ████████████████▊ claude-sonnet-4.6 53M ██████████████▏ claude-opus-4.6 40M ██████████▋ minimax-m2.7 38M ██████████▏ claude-opus-4.7 31M ████████▏ laguna-m.1 22M █████▉ claude-haiku-4-5-20251001 14M ███▉ kimi-k2.5:cloud 11M ██▉ claude-sonnet-4.5 6.6M █▊ gpt-5.3-codex 6.1M █▋ glm-4.7-free 6.0M █▋ glm-4.7 5.0M █▍ mercury-2 4.3M █▏ deepseek-v4-flash 3.7M █ gpt-5.5 3.6M █ kimi-k2.5 2.0M ▌ starter 1.8M ▌ gpt-5.1-codex 1.5M ▍ kat-coder-pro-v1 1.4M ▍ others (57 models) 9.8M ██▋

Starter models

Try aivo without setting up a provider first. aivo-starter bundles a couple of models for learning and testing — enough to feel how aivo works before you bring your own key.

  • aivo/starter DeepSeek V4 Flash · official API A general-purpose default for everyday tasks.
  • poolside/laguna-m.1 Laguna M.1 · Poolside Tuned for code generation and editing.

Try it

Once aivo is installed, you can use it straight away.

aivo "tell me a short story" aivo codex

Limits

The starter pool has a finite daily budget — intended for interactive coding-agent sessions. For regular or automated use, please add your own API key.

Not for automated scripts batch jobs benchmarks third-party clients OpenClaw

Update

aivo update detects whether you installed via Homebrew or the install script and updates accordingly. Failed updates are rolled back automatically; a manual rollback is available if a new version misbehaves.

aivo update # force update even if installed via a package manager aivo update --force # restore the previous version from the last backup aivo update --rollback

Sync model data

Model limits come from models.dev and ship baked into each release. --sync-model-data refreshes that snapshot between releases without touching the binary.

aivo update --sync-model-data

FAQ

Can I use my Claude or ChatGPT subscription with other agents?

No. A Claude Code or Codex subscription login only works inside its own agent, by each subscription's terms. Only your own API keys (BYOK) carry across every agent.

Can I use aivo outside of coding agents?

aivo serve exposes your active provider as a local OpenAI-compatible endpoint, so any tool that speaks the OpenAI API — editor extensions, scripts, your own apps — can talk to it.

Where are my keys stored?

On your machine, never uploaded. Keys are encrypted in ~/.config/aivo/ and locked behind your OS keychain, and every request goes straight from your machine to the provider.

Does aivo send my prompts or usage anywhere?

It depends on the path. With your own provider keys, requests go straight from your machine to the provider — aivo runs no server in between and logs nothing. The starter pool and aivo-pro run through aivo's gateway, which records usage — the model, token counts, and a short snippet of recent input (for abuse detection) — to enforce limits and handle billing. Aivo doesn't sell your data or train on it.