DeepSeek thinks, plans, and drives Claude Code through your project
so you don't babysit every prompt.
Claude Code can read your codebase, write code, run builds, fix errors. It's remarkably capable. But here's the catch: you are still the one deciding what to do next. You break the task into steps, you prompt for each piece, you review and redirect. Even with plan mode and task lists, you're babysitting.
supervis puts DeepSeek between you and Claude Code as a technical lead. You describe what you want once. supervis handles the rest.
One prompt. supervis took it from there: planned the approach, had Claude Code scaffold the project, build the UI, write the logic, run the build, and deploy to Cloudflare Pages. No human intervention between start and live URL.
Result: markdown-notes-31o.pages.dev · source code · Full screenshots in assets/ (demo_1 through demo_7)
DeepSeek tells Claude Code what to do. Claude Code explores the codebase, writes code, runs builds, and reports back. DeepSeek reviews the results and decides the next step. DeepSeek doesn't read your files directly. It directs and decides, Claude Code does the hands-on work. This keeps the supervisor's token usage low.
The loop keeps going until the entire task is done. You watch it work in real time, and step in only when you want to.
No more "can I edit this file?" prompts either. supervis runs Claude Code with --permission-mode bypassPermissions, so it reads, writes, builds, and tests without stopping to ask at every step. Under the hood, every task is a call to:
claude -p <prompt> --output-format stream-json --verbose --permission-mode bypassPermissions
You see everything Claude Code does: every file read, every edit, every shell command, live in the TUI. Full transparency, zero interruptions.
Claude Code is great. You can give it plan mode, task lists, detailed prompts. But you're still the one leading the session. You check in after each step, redirect when something goes off track, decide what to tackle next, and prompt again when it stalls. That's fine for small tasks. For anything bigger, you're babysitting.
You describe the end goal once, and it drives Claude Code through the entire thing:
You: add JWT authentication with a login endpoint, protect all routes, make sure the build passes
DeepSeek: thinking...
┌─ Claude Code Read project structure and check for existing auth...
│ Read: src/routes/index.ts
│ Read: package.json
│ No existing auth found.
└─ done (4 tool calls)
DeepSeek: No auth layer yet. Starting with JWT middleware, then login
endpoint, then route protection.
┌─ Claude Code Create JWT middleware and wire it up...
│ Write: src/auth/middleware.ts
│ Write: src/auth/login.ts
│ Edit: src/routes/index.ts
│ ↳ Bash: npm run build
│ Build passed.
└─ done (12 tool calls)
DeepSeek: Done. POST /auth/login returns a token, all routes
verify it. Build is clean.
If the build had failed, supervis would have told Claude Code to fix the errors and try again, without you typing a single follow-up. It keeps going until the task is actually done.
Less technical knowledge? supervis handles the decisions you wouldn't know to make. More technical knowledge? You write better prompts and supervis becomes a serious force multiplier.
pipx install supervisOr ask Claude Code: "install supervis from github.com/arikusi/supervis"
You need two things:
- Claude Code installed (subscription is enough, no Anthropic API key needed)
- A DeepSeek API key
supervis calls Claude Code as a local subprocess, not through the API. DeepSeek handles the planning via its own API, which is remarkably cheap for what it delivers.
cd your-project
supervisFirst run will ask for your DeepSeek API key and save it.
Quick start:
pipx install supervis && cd your-project && supervissupervis is only as good as the context it gets. A blank project works, but a project with a .supervis/SUPERVIS.md works significantly better:
mkdir .supervis
cat > .supervis/SUPERVIS.md << 'EOF'
Tech stack: Next.js 15, TypeScript, PostgreSQL, Tailwind CSS.
Follow the plan in PLAN.md.
Always run `npm run build` after making changes.
Use the existing auth patterns in src/lib/auth.
EOFThink of it like onboarding a new developer. The more context you give (tech stack, conventions, existing patterns, a plan document), the fewer wrong turns supervis takes. Setting up relevant MCP servers and environment variables for your project also helps Claude Code do its job better.
One important thing: you're talking to a supervisor, not a code editor. supervis will delegate everything to Claude Code. Frame your prompts that way.
Example prompts:
Have Claude build me a personal portfolio site. I'm a frontend developer based
in Berlin, 3 years of experience, React and TypeScript. Include an about page,
project showcase, and contact form. Keep it clean and modern.
Have Claude set up a REST API for a todo app. Express, TypeScript. CRUD
endpoints, input validation, error handling. Have it write tests and run
them before finishing.
Have Claude read through this project, understand the architecture, then
add dark mode. Nothing should break. Run the build when done.
You tell supervis what you want. supervis tells Claude Code how to build it.
| Command | What it does |
|---|---|
/model chat |
DeepSeek-chat with thinking, best quality (default) |
/model chat-fast |
DeepSeek-chat without thinking, faster |
/model reasoner |
DeepSeek-reasoner, maximum reasoning, 64K output |
/status |
Model, cost, uptime, message count |
/budget |
Cost vs. budget limit |
/export md or json |
Export conversation to file |
/undo |
Git stash or revert last changes |
/update |
Check for new supervis version |
/reasoning |
Toggle DeepSeek thinking/reasoning display |
/queue |
Show queued messages |
/cancel |
Cancel queued messages (/cancel N for specific) |
/reset |
Clear session and start fresh |
Ctrl+Z interrupts the running agent. Ctrl+Q quits. Up/down arrows cycle through input history.
TOML config, layered: built-in defaults → ~/.config/supervis/config.toml → .supervis/config.toml → environment variables.
Example global config
api_key = "sk-..."
model = "deepseek-chat"
thinking = true
[behavior]
max_cost = 1.00
shell_timeout = 15
claude_timeout = 300
truncation_limit = 4000Per-project override
model = "deepseek-reasoner"
[behavior]
max_cost = 2.00Environment variables: DEEPSEEK_API_KEY, SUPERVIS_MODEL, SUPERVIS_THINKING
Cost budget: Set max_cost to cap spending. supervis warns at 80% and stops at 100%.
DeepSeek pricing: $0.28/1M input · $0.028/1M cached · $0.42/1M output. How much you spend depends on the task. Simple changes take fewer turns, complex features take more. The status bar tracks cost in real time so there are no surprises.
- It's not magic. Vague prompts get vague results. Be specific about what you want.
- Claude Code runs with
bypassPermissions(explained above). It edits files without asking. That's intentional, but be aware. - DeepSeek only, for now. It works well. Contributors are welcome to add other providers.
- No session persistence yet. Closing supervis loses the conversation.
- Large monorepos benefit from a focused
.supervis/SUPERVIS.md. Without guidance, supervis may wander.
Event-driven, async. Business logic emits typed events through an EventBus. The Textual TUI subscribes and renders. Zero coupling between logic and UI.
Modules
supervisor/
app.py — Textual App, layout, key bindings, event bridge
orchestrator.py — async message loop, drives the agent
deepseek.py — DeepSeek API client, streaming, agent loop
claude.py — Claude Code subprocess, stream-json parsing
session.py — Session + CostTracker dataclasses
events.py — EventBus + typed event definitions
commands.py — slash command registry
tools.py — tool definitions for DeepSeek
version_check.py — PyPI update checker
config.py — TOML config (global + per-project + env vars)
memory.py — conversation summarization
prompts.py — system prompt
widgets/ — OutputLog, InputBar, StatusBar
Issues and PRs welcome at github.com/arikusi/supervis. Use Discussions for questions and ideas.
MIT — built by arikusi