it0/packages/services/voice-service
hailin a568558585 feat: replace speech_to_text with GTCRN ML noise reduction + backend STT
Replace traditional on-device speech_to_text with a modern pipeline:
- Record audio via `record` package with hardware noise suppression
- Apply GTCRN neural denoising (sherpa-onnx, ICASSP 2024, 48K params)
- Trim silence, POST to backend /voice/transcribe (faster-whisper)

Changes:
- Add /transcribe endpoint to voice-service for audio file upload
- Add SpeechEnhancer wrapper for sherpa-onnx GTCRN model (523KB)
- Rewrite chat_page.dart voice input: record → denoise → transcribe
- Keep NoiseReducer.trimSilence for silence removal only
- Upgrade record to v6.2.0, add sherpa_onnx, path_provider

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-22 07:59:15 -08:00
..
src feat: replace speech_to_text with GTCRN ML noise reduction + backend STT 2026-02-22 07:59:15 -08:00
Dockerfile fix: upgrade faster-whisper to 1.2.1 to resolve av build failure 2026-02-19 16:40:04 -08:00
requirements.txt fix: resolve runtime errors for NestJS, Kong, and voice-service 2026-02-19 19:00:03 -08:00