Skip to content

puke3615/TalkType

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

14 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

TalkType

TalkType

Stop typing prompts. Just speak.
Your phone's voice input β†’ instant text at your computer's cursor β†’ in any app.
Global voice input for vibe coding.

Release License Platform i18n

Quick Start β€’ Features β€’ Comparison β€’ Download

English | δΈ­ζ–‡


⚑ 3x Faster Than Typing

Why type when you can talk?

Typing Speaking with TalkType
Speed ~40 WPM ~150 WPM
200-word prompt ~5 minutes ~1.3 minutes
Hands Both occupied Free to browse code
Fatigue RSI risk Relaxed

Speaking is 3.75x faster than typing. A complex 200-word prompt that takes 5 minutes to type? Speak it in under 80 seconds.

Demo

πŸ“Ή GIF coming soon β€” showing: open app β†’ scan QR β†’ speak on phone β†’ text appears in Cursor

Help wanted: If you use TalkType, record a short demo and submit a PR!

πŸš€ Quick Start

Get started in 30 seconds:

  1. Download β†’ Latest Release (macOS / Windows / Linux)
  2. Open TalkType on your computer β†’ QR code appears
  3. Scan the QR code with your phone β†’ web page opens (no app install)
  4. Speak on your phone β†’ tap Send β†’ text appears at your cursor ✨

⚠️ Phone and computer must be on the same WiFi.

Pro Tips

  • πŸ“Œ Add to Home Screen β€” on your phone's browser, "Add to Home Screen" for instant access (no rescan needed next time)
  • 🌍 Bilingual UI β€” switch between English and Chinese in the app settings

✨ Features

  • πŸŽ™οΈ Phone-native voice input β€” uses your phone's built-in STT, no model download needed
  • ⌨️ Global text injection β€” text appears at your cursor in ANY app
  • πŸ“± Zero install on phone β€” phone client is a web page, no app needed
  • πŸ”’ 100% local network β€” all data stays on your WiFi, zero cloud dependency
  • πŸ“· QR code pairing β€” scan and connect in seconds
  • ↡ Auto-Enter β€” optionally press Enter after sending, perfect for chat apps
  • πŸ” Auto reconnect β€” remembers connections, auto-reconnects on same WiFi
  • 🌍 i18n β€” English and Chinese UI, auto-detects your language
  • 🌍 Cross-platform β€” macOS, Windows, Linux
  • πŸ“² PWA support β€” add to phone home screen for app-like experience

🎯 Why TalkType?

The Unfair Advantage

Other voice input tools run Whisper locally (1–3 GB model download, CPU/GPU intensive) or use cloud APIs (latency, cost, privacy risk).

TalkType takes a fundamentally different approach: your phone IS the voice engine. Apple's, Google's, and Samsung's STT are trained on billions of data points, support 100+ languages, and are already on your device. TalkType simply bridges that to your computer over local WiFi.

Zero config. Zero cost. Zero compromise on privacy.

vs. Other Voice Input Tools

TalkType SuperWhisper Wispr Flow VoiceTypr MacWhisper
Price Free $9/mo $15/mo Free Freemium
Open Source βœ… ❌ ❌ βœ… ❌
Voice Engine Phone native Local Whisper Cloud AI Local Whisper Local Whisper
Model Download None 1–3 GB N/A 1–3 GB 1–3 GB
Setup Time 30 sec 5–10 min 5 min 5–10 min 5 min
Languages 100+ native ~50 ~30 ~50 ~50
macOS βœ… βœ… βœ… βœ… βœ…
Windows βœ… ❌ ❌ βœ… ❌
Linux βœ… ❌ ❌ ❌ ❌
Privacy LAN only Local ☁️ Cloud Local Local

πŸ’‘ Use Cases

Scenario Description
πŸ§‘β€πŸ’» Vibe Coding Speak prompts to Cursor, Copilot, Windsurf, or any AI coding tool
πŸ€– AI Chat Dictate to ChatGPT, Claude, Gemini β€” 3x faster than typing
πŸ“ Writing Draft emails, docs, and notes hands-free
πŸ’¬ Messaging Reply in Slack, Discord, WeChat, Teams by voice
πŸ” Search Voice search in any app or browser

πŸ“¦ Download

Go to Releases and download for your platform:

Platform File Notes
🍎 macOS (Apple Silicon) TalkType_x.x.x_aarch64.dmg M1 / M2 / M3 / M4
🍎 macOS (Intel) TalkType_x.x.x_x64.dmg Older Intel Macs
πŸͺŸ Windows TalkType_x.x.x_x64-setup.exe Windows 10 / 11
🐧 Linux .deb / .AppImage Ubuntu, Debian, etc.

πŸ’‘ Which Mac? Click β†’ About This Mac β†’ check "Chip". Apple M1/M2/… = aarch64, Intel = x64.

βš™οΈ Platform Notes

🍎 macOS
  • Grant Accessibility permission on first launch: System Settings β†’ Privacy & Security β†’ Accessibility β†’ Enable TalkType βœ…
  • The app guides you through this on first run
  • Closing the window minimizes to menu bar tray (right-click β†’ Quit)
πŸͺŸ Windows
  • Allow firewall access when prompted (required for phone to connect)
  • Closing the window minimizes to system tray (right-click β†’ Quit)
🐧 Linux
  • .deb package handles dependencies automatically
  • For .AppImage: run chmod +x TalkType_*.AppImage first

❓ FAQ

Why use phone voice input instead of local Whisper?

Phone STT engines (Apple, Google, Samsung) are trained on billions of data points, support 100+ languages natively, and are heavily optimized for mobile hardware. No model download (1–3 GB saved), no GPU required, no configuration β€” it just works, instantly.

Is my data safe?

Yes. All communication happens over your local WiFi via WebSocket. No data ever leaves your network. No cloud, no accounts, no telemetry.

Phone can't open the page after scanning?
  • Ensure phone and computer are on the same WiFi
  • Check your computer's firewall settings
  • Try disabling VPN or proxy
Connected but no text appearing?
  • macOS: Check Accessibility permission (System Settings β†’ Privacy & Security β†’ Accessibility)
  • Make sure an input field is focused on your computer
  • Text goes to wherever your cursor currently is

πŸ”§ How It Works

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”       WiFi (LAN)        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  πŸ“± Phone    β”‚  ◄── WebSocket ──►      β”‚  πŸ’» Computer  β”‚
β”‚  (Browser)   β”‚      tap to send        β”‚  (Desktop App)β”‚
β”‚              β”‚                         β”‚       ↓       β”‚
β”‚  🎀 Voice    β”‚                         β”‚  ⌨️ Keyboard  β”‚
β”‚  β†’ Text      β”‚                         β”‚   Injection   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                          β””β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜
                                                 ↓
                                          Any Active App
                                        (Cursor / Claude /
                                         ChatGPT / Docs)

Tech Stack:

  • Desktop: Tauri 2 + Rust (Axum HTTP/WS server + keyboard simulation)
  • Mobile: Pure HTML/CSS/JS PWA (served by desktop app, zero install)

πŸ› οΈ Development

Build from source

Prerequisites

  • Rust 1.70+ (latest stable recommended)
  • Node.js 18+
  • Tauri CLI: cargo install tauri-cli --version "^2"

Dev Mode

git clone https://github.com/puke3615/TalkType.git
cd TalkType
npm install
cargo tauri dev

Build

cargo tauri build

Output: src-tauri/target/release/bundle/

Project Structure

TalkType/
β”œβ”€β”€ src/index.html              # Desktop UI (i18n, QR code)
β”œβ”€β”€ src-tauri/
β”‚   β”œβ”€β”€ src/
β”‚   β”‚   β”œβ”€β”€ lib.rs              # App entry & Tauri commands
β”‚   β”‚   β”œβ”€β”€ server.rs           # HTTP + WebSocket server (Axum)
β”‚   β”‚   β”œβ”€β”€ mobile.html         # Mobile PWA page (i18n, served at /)
β”‚   β”‚   β”œβ”€β”€ keyboard.rs         # Keyboard simulation (per-platform)
β”‚   β”‚   β”œβ”€β”€ network.rs          # LAN IP discovery
β”‚   β”‚   └── qrcode_gen.rs       # QR code generation
β”‚   └── tauri.conf.json
└── .github/workflows/          # CI/CD (macOS + Windows + Linux)

πŸ“„ License

MIT β€” free to use, modify, and distribute.


If TalkType saves you time, give it a ⭐!
Built with ❀️ using Tauri 2 + Rust

About

πŸŽ™οΈ Speak on your phone β†’ text at your cursor. Works in any app. Free & open source. | πŸŽ™οΈ ζ‰‹ζœΊθ―΄θ― β†’ ζ–‡ε­—ε‡ΊηŽ°εœ¨η”΅θ„‘ε…‰ζ ‡ε€„γ€‚ι€‚η”¨δΊŽδ»»ζ„εΊ”η”¨γ€‚ε…θ΄ΉεΌ€ζΊγ€‚

Topics

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors