Stop typing prompts. Just speak.
Your phone's voice input β instant text at your computer's cursor β in any app.
Global voice input for vibe coding.
Quick Start β’ Features β’ Comparison β’ Download
English | δΈζ
Why type when you can talk?
| Typing | Speaking with TalkType | |
|---|---|---|
| Speed | ~40 WPM | ~150 WPM |
| 200-word prompt | ~5 minutes | ~1.3 minutes |
| Hands | Both occupied | Free to browse code |
| Fatigue | RSI risk | Relaxed |
Speaking is 3.75x faster than typing. A complex 200-word prompt that takes 5 minutes to type? Speak it in under 80 seconds.
πΉ GIF coming soon β showing: open app β scan QR β speak on phone β text appears in Cursor
Help wanted: If you use TalkType, record a short demo and submit a PR!
Get started in 30 seconds:
- Download β Latest Release (macOS / Windows / Linux)
- Open TalkType on your computer β QR code appears
- Scan the QR code with your phone β web page opens (no app install)
- Speak on your phone β tap Send β text appears at your cursor β¨
β οΈ Phone and computer must be on the same WiFi.
- π Add to Home Screen β on your phone's browser, "Add to Home Screen" for instant access (no rescan needed next time)
- π Bilingual UI β switch between English and Chinese in the app settings
- ποΈ Phone-native voice input β uses your phone's built-in STT, no model download needed
- β¨οΈ Global text injection β text appears at your cursor in ANY app
- π± Zero install on phone β phone client is a web page, no app needed
- π 100% local network β all data stays on your WiFi, zero cloud dependency
- π· QR code pairing β scan and connect in seconds
- β΅ Auto-Enter β optionally press Enter after sending, perfect for chat apps
- π Auto reconnect β remembers connections, auto-reconnects on same WiFi
- π i18n β English and Chinese UI, auto-detects your language
- π Cross-platform β macOS, Windows, Linux
- π² PWA support β add to phone home screen for app-like experience
Other voice input tools run Whisper locally (1β3 GB model download, CPU/GPU intensive) or use cloud APIs (latency, cost, privacy risk).
TalkType takes a fundamentally different approach: your phone IS the voice engine. Apple's, Google's, and Samsung's STT are trained on billions of data points, support 100+ languages, and are already on your device. TalkType simply bridges that to your computer over local WiFi.
Zero config. Zero cost. Zero compromise on privacy.
| TalkType | SuperWhisper | Wispr Flow | VoiceTypr | MacWhisper | |
|---|---|---|---|---|---|
| Price | Free | $9/mo | $15/mo | Free | Freemium |
| Open Source | β | β | β | β | β |
| Voice Engine | Phone native | Local Whisper | Cloud AI | Local Whisper | Local Whisper |
| Model Download | None | 1β3 GB | N/A | 1β3 GB | 1β3 GB |
| Setup Time | 30 sec | 5β10 min | 5 min | 5β10 min | 5 min |
| Languages | 100+ native | ~50 | ~30 | ~50 | ~50 |
| macOS | β | β | β | β | β |
| Windows | β | β | β | β | β |
| Linux | β | β | β | β | β |
| Privacy | LAN only | Local | βοΈ Cloud | Local | Local |
| Scenario | Description |
|---|---|
| π§βπ» Vibe Coding | Speak prompts to Cursor, Copilot, Windsurf, or any AI coding tool |
| π€ AI Chat | Dictate to ChatGPT, Claude, Gemini β 3x faster than typing |
| π Writing | Draft emails, docs, and notes hands-free |
| π¬ Messaging | Reply in Slack, Discord, WeChat, Teams by voice |
| π Search | Voice search in any app or browser |
Go to Releases and download for your platform:
| Platform | File | Notes |
|---|---|---|
| π macOS (Apple Silicon) | TalkType_x.x.x_aarch64.dmg |
M1 / M2 / M3 / M4 |
| π macOS (Intel) | TalkType_x.x.x_x64.dmg |
Older Intel Macs |
| πͺ Windows | TalkType_x.x.x_x64-setup.exe |
Windows 10 / 11 |
| π§ Linux | .deb / .AppImage |
Ubuntu, Debian, etc. |
π‘ Which Mac? Click β About This Mac β check "Chip". Apple M1/M2/β¦ =
aarch64, Intel =x64.
π macOS
- Grant Accessibility permission on first launch: System Settings β Privacy & Security β Accessibility β Enable TalkType β
- The app guides you through this on first run
- Closing the window minimizes to menu bar tray (right-click β Quit)
πͺ Windows
- Allow firewall access when prompted (required for phone to connect)
- Closing the window minimizes to system tray (right-click β Quit)
π§ Linux
.debpackage handles dependencies automatically- For
.AppImage: runchmod +x TalkType_*.AppImagefirst
Why use phone voice input instead of local Whisper?
Phone STT engines (Apple, Google, Samsung) are trained on billions of data points, support 100+ languages natively, and are heavily optimized for mobile hardware. No model download (1β3 GB saved), no GPU required, no configuration β it just works, instantly.
Is my data safe?
Yes. All communication happens over your local WiFi via WebSocket. No data ever leaves your network. No cloud, no accounts, no telemetry.
Phone can't open the page after scanning?
- Ensure phone and computer are on the same WiFi
- Check your computer's firewall settings
- Try disabling VPN or proxy
Connected but no text appearing?
- macOS: Check Accessibility permission (System Settings β Privacy & Security β Accessibility)
- Make sure an input field is focused on your computer
- Text goes to wherever your cursor currently is
βββββββββββββββ WiFi (LAN) ββββββββββββββββ
β π± Phone β βββ WebSocket βββΊ β π» Computer β
β (Browser) β tap to send β (Desktop App)β
β β β β β
β π€ Voice β β β¨οΈ Keyboard β
β β Text β β Injection β
βββββββββββββββ βββββββββ¬ββββββββ
β
Any Active App
(Cursor / Claude /
ChatGPT / Docs)
Tech Stack:
- Desktop: Tauri 2 + Rust (Axum HTTP/WS server + keyboard simulation)
- Mobile: Pure HTML/CSS/JS PWA (served by desktop app, zero install)
Build from source
- Rust 1.70+ (latest stable recommended)
- Node.js 18+
- Tauri CLI:
cargo install tauri-cli --version "^2"
git clone https://github.com/puke3615/TalkType.git
cd TalkType
npm install
cargo tauri devcargo tauri buildOutput: src-tauri/target/release/bundle/
TalkType/
βββ src/index.html # Desktop UI (i18n, QR code)
βββ src-tauri/
β βββ src/
β β βββ lib.rs # App entry & Tauri commands
β β βββ server.rs # HTTP + WebSocket server (Axum)
β β βββ mobile.html # Mobile PWA page (i18n, served at /)
β β βββ keyboard.rs # Keyboard simulation (per-platform)
β β βββ network.rs # LAN IP discovery
β β βββ qrcode_gen.rs # QR code generation
β βββ tauri.conf.json
βββ .github/workflows/ # CI/CD (macOS + Windows + Linux)
MIT β free to use, modify, and distribute.
If TalkType saves you time, give it a β!
Built with β€οΈ using Tauri 2 + Rust