it0/packages/services/voice-agent
hailin 186234bae2 fix: increase STT silence_duration_ms to prevent choppy transcription
Default silence_duration_ms=350 is too aggressive for Chinese speech,
causing sentences to be fragmented into 1-3 character chunks. Increase
to 800ms and raise VAD threshold to 0.6 so the STT waits longer before
finalizing a turn, producing complete sentences for LLM processing.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-01 18:37:13 -08:00
..
src fix: increase STT silence_duration_ms to prevent choppy transcription 2026-03-01 18:37:13 -08:00
Dockerfile fix: resolve websockets version conflict and use CPU-only torch 2026-02-28 09:02:31 -08:00
requirements.txt fix: resolve websockets version conflict and use CPU-only torch 2026-02-28 09:02:31 -08:00