VoxCode is a production-grade, advanced-level voice-controlled programming assistant. It enables developers to generate, optimize, and explain code using natural voice commands, bridging the gap between human thought and executable code with a high-performance, dual-engine speech pipeline.
- API Server: Hardened Flask REST API with structured logging, per-IP rate limiting, and DB connection pooling for high concurrency.
- Speech Pipeline: Hybrid engine using Vosk (real-time partials) and Whisper (high-accuracy final transcription). Includes WebRTC VAD for voice activity detection and Noisereduce for adaptive environmental noise profiling.
- Command Processor: Advanced intent detection using a hybrid of regex pattern matching and Zero-Shot BART Transformers. Supports multi-intent detection and contextual memory for follow-up commands (e.g., "convert that to rust").
- NLP Service: Deep linguistic analysis with spaCy. Features a coding-specific dictionary (100+ entries) and abbreviation expansion (e.g.,
fn→function). - Template Service: Idiomatic code scaffolding for 12+ languages including Rust, Go, Swift, and Kotlin.
- Dashboard: Premium UI with Glassmorphism effects, full Dark Mode support, and real-time audio waveform visualization.
- Pro Hooks: Adaptive polling hooks with audio RMS monitoring and stream processing for AI responses.
- Interactions: Toast-based notification system, step-by-step code explanation mode, and 45+ domain-specific voice shortcuts.
- Backend: Python 3.11, Flask, PostgreSQL (NeonDB), Whisper, Vosk, spaCy, BART, NLTK.
- Frontend: React 19, Vite, Tailwind CSS, Lucide icons, Prism.js.
- DevOps: Local LLM hosting (Llama-CPP), structured logging, health metrics.
- Dependencies:
.\install_dependencies.bat - Configuration:
Copy
.env.exampleto.envand set yourDATABASE_URLandJWT_SECRET. - Start:
.\start.bat
- "Create a linked list in rust"
- "Quick sort algorithm in go"
- "Build a flask api with jwt auth"
- "React component named UserCard"
- "Optimize this code"
- "Explain how this function works"