Install on macOS
- Go to the Download tab.
- Download
VocalCanvas.dmg. - Open the DMG and drag
Vocal Canvas.apptoApplications. - Open the app from
Applications(use right-click → Open on first launch if prompted).
v0.1.1 (Beta)
Public beta: features, UI details, and packaging behavior may change quickly.
VocalCanvas.dmg.Vocal Canvas.app to Applications.Applications (use right-click → Open on first launch if prompted).VocalCanvasWindows.zip and extract it.pip install -r requirements.txt
python vocal_canvas_windows.py
V3TS (Voice to Text to Speech) is a real-time voice mirror built into the Vocal Canvas desktop app. It listens through your microphone, transcribes what you say using an on-device Whisper model, and instantly reads it back through the selected system voice — all without any cloud round-trips.
Transcription runs entirely on your machine using faster-whisper, a local implementation of OpenAI's Whisper model. No audio is ever sent to a server. The tiny model delivers around 230ms inference time on a 1-second audio clip, keeping the experience near-instant even when you're talking fast.
V3TS automatically mutes the microphone listener while the synthesized voice is playing, plus a 500ms tail to catch acoustic echo. This prevents the app from hearing and re-reading its own TTS output in a loop.
When BlackHole 2ch is installed, V3TS routes the synthesized voice to a virtual microphone visible in macOS System Settings. Any other app — Discord, Zoom, OBS, and more — can select that virtual mic as its input and receive the synthesized voice directly.
Pick any macOS system voice, set the speaking rate, choose your input microphone, and select the Whisper model size (tiny, base, or small) to trade speed for accuracy. All settings are applied live with no restart required.