$ timeahead.in
/ servers/pypi/voice-mode
pypi

voice-mode

Natural (2-way) voice conversations with Claude Code

1k stars6k/wkupdated 1d agogithub ↗
81good
▣ Overview

What it does

VoiceMode adds 2-way voice conversations to Claude Code. Developers speak naturally rather than type; the server transcribes speech to text via Whisper (local or cloud), sends the transcript to Claude, and converts responses back to audio via Kokoro (local) or OpenAI TTS (cloud). It features low-latency streaming, smart silence detection to know when you've finished speaking, and works entirely offline if local speech services are configured.

Who it's for

Engineers and developers using Claude Code in situations where typing isn't practical: pair-programming while cooking, debugging during walks between meetings, code reviews over coffee, or extended sessions where hands-free operation reduces eye strain and improves focus.

Common use cases

  • Ask quick questions or get clarifications without breaking focus from physical tasks
  • Pair-program or debug while multitasking (walking, cooking, holding a beverage)
  • Reduce eye strain during extended coding sessions via voice-based interaction
  • Review code changes and discuss architecture decisions in real time without a keyboard
  • Work in environments (meetings, open office) where a keyboard is inconvenient

Setup pitfalls

  • System dependencies differ by platform: Ubuntu/Debian need ffmpeg, portaudio, libasound2, and pulseaudio; macOS requires ffmpeg and portaudio via Homebrew; WSL2 specifically requires pulseaudio packages for microphone access
  • One secret has been detected in the repository—review the codebase before use in sensitive environments
  • Microphone and speaker permissions must be granted to the terminal or app on macOS and Linux
  • OpenAI API key (set via OPENAI_API_KEY env var) is required for cloud-based STT/TTS but optional if local Whisper and Kokoro services are configured
1 credential detected in repository history via Gitleaks
▣ Score BreakdownMCPScore = Σ(raw × weight)
DimensionRawWeighted
Security
35%
80
28.0
Freshness
25%
100
25.0
Adoption
20%
76
15.2
Quality
10%
80
8.0
Trust
10%
50
5.0
Total
81.2
⚿ Capabilities & Risk Explainer
fs readfs writenetworkexecsecrets
◆ Risk level: high· 2 tools · auth: API key
fs read + fs write + network + exec + secrets active — can execute code, access credentials, and make external network calls.
Tool nameDescriptionDestructive?
list_whisper_versions✓ no
list_kokoro_versions✓ no
⚙ Install config
Claude Desktop · Cursor · Windsurf · VS Code (Copilot) · Claude Code
add to your MCP client config:
{
  "mcpServers": {
    "voice-mode": {
      "command": "uvx",
      "args": [
        "voice-mode"
      ]
    }
  }
}
📈 Score historylast 30 snapshots
5/10/20266/11/2026 · 30 snapshots
⚙ Maintenance health
53/ 100 · is this project alive?
contributors (1y)12
top contributor share64%
releases (1y)100
last release3d ago
ci✗ none
⛁ Raw data
weekly downloads6k
github stars1k
forks170
open issues57
license✓ present
readme length6069 chars
last publish0d ago
last commit1d ago
last updated1d ago
install verified✓ pass · 25d ago
owner of this server? claim your listing to get a verified badgeclaim →
🔔 Score drop alerts
get notified by email when this server's score drops 5+ points