it0/packages/services/voice-service
hailin 65e68a0487 feat: streaming TTS — synthesize per-sentence as agent tokens arrive
Replace batch TTS (wait for full response) with streaming approach:
- _agent_generate → _agent_stream async generator (yield text chunks)
- _process_speech accumulates tokens, splits on sentence boundaries
- Each sentence is TTS'd and sent immediately while more tokens arrive
- First audio plays within ~1s of agent response vs waiting for full text

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-24 03:14:22 -08:00
..
src feat: streaming TTS — synthesize per-sentence as agent tokens arrive 2026-02-24 03:14:22 -08:00
Dockerfile fix: upgrade faster-whisper to 1.2.1 to resolve av build failure 2026-02-19 16:40:04 -08:00
requirements.txt fix: rewrite voice pipeline for direct WebSocket I/O, fix TTS and navigation 2026-02-23 23:34:35 -08:00