The voice module wraps your transcription and TTS provider behind a single declarative interface. The user speaks, the agent answers in voice, the rest of the YAML is the same as any other agent. Latency below 800ms feels natural and is achievable with the right provider mix.
1modules:2 voice:3 config:4 stt: { provider: whisper, model: whisper-1 }5 tts: { provider: openai, voice: nova }67agents:8 - id: voice9 modules: [voice]10 brain: { model: claude-haiku-4-5 }11 system_prompt: "Speak conversationally. Keep answers under 30 seconds."# 1. install runtime
curl -sSL https://digitorn.ai/install | sh
# 2. save the YAML above to ~/.digitorn/apps/my-voice-assistant/app.yaml
mkdir -p ~/.digitorn/apps/my-voice-assistant
# paste the YAML into app.yaml
# 3. deploy
digitorn deploy my-voice-assistantEngineering notes from the Digitorn team. No marketing, no launch announcements, no "10 prompts that will change your life". Just the things we write that we'd want to read.