hailin/it0 - it0 - AI Wolves Team

Commit Graph

Author	SHA1	Message	Date
hailin	57fabb4653	fix: set interleaved=true for PcmPlayer streaming playback FlutterSoundPlayer.feedUint8FromStream() requires interleaved mode. With interleaved=false, every feed() call threw: "Cannot feed with UInt8 with non interleaved mode" - feedUint8FromStream (Uint8List) → requires interleaved: true - feedFromStream (Float32List) → requires interleaved: false Since we feed raw PCM bytes (Uint8List), interleaved must be true. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:59:06 -08:00
hailin	e706a4cdc7	fix: enable simultaneous playback + recording in voice call Root cause: PcmPlayer called openPlayer() without audio session config, so Android defaulted to earpiece-only mode. When the mic was actively recording, playback was silently suppressed — the agent's TTS audio was sent successfully over WebSocket but never reached the speaker. Changes: 1. PcmPlayer (pcm_player.dart): - Added audio_session package for proper audio session management - Configure AudioSession with playAndRecord category so mic + speaker work simultaneously - Set voiceCommunication usage to enable Android hardware AEC (echo cancellation) — prevents feedback loops when speaker is active - defaultToSpeaker routes output to loudspeaker instead of earpiece - Restored setSpeakerOn() method stub (used by UI toggle) 2. AgentCallPage (agent_call_page.dart): - Fixed fire-and-forget bug: _pcmPlayer.feed() returns Future but was called without await, causing interleaved feedUint8FromStream calls - Added _feedChain serializer to guarantee sequential audio feeding 3. Dependencies: - Added audio_session package to pubspec.yaml Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:48:16 -08:00
hailin	a6cd3c20d9	feat: add WebSocket robustness to voice call (heartbeat, reconnect, jitter buffer) Addresses reliability gaps in the real-time voice WebSocket connection between Flutter client and Python voice-service backend. Backend (voice-service): - Heartbeat: new _heartbeat_sender coroutine sends JSON ping text frames every 15s alongside the Pipecat pipeline; failed send = dead connection - Session preservation: on WebSocket disconnect, sessions are now marked "disconnected" with a timestamp instead of being deleted, allowing reconnection within a configurable TTL (default 60s) - Reconnect endpoint: POST /sessions/{id}/reconnect verifies the session is alive and in "disconnected" state, returns fresh websocket_url - Reconnect-aware WS handler: detects "disconnected" sessions, cancels stale pipeline tasks, creates a new pipeline, sends "session.resumed" - Background cleanup: asyncio loop every 30s removes sessions that have been disconnected longer than session_ttl - Structured event protocol: text frames = JSON control messages (ping/pong/session.resumed/session.ended/error), binary = PCM audio - New settings: session_ttl (60s), heartbeat_interval (15s), heartbeat_timeout (45s) Flutter (agent_call_page.dart): - Heartbeat monitoring: tracks last server ping timestamp, triggers reconnect if no ping received in 45s (3 missed intervals) - Auto-reconnect: exponential backoff (1s→2s→4s→8s→16s), max 5 attempts; calls /reconnect endpoint to verify session, rebuilds WebSocket, resets audio buffer, restarts heartbeat - Reconnecting UI: yellow warning banner "重新连接中... (N/5)" with spinner overlay during reconnection attempts - WebSocket data routing: _onWsData distinguishes String (JSON control) from binary (audio) frames, handles ping/session.resumed/session.ended - User-initiated disconnect guard: _userEndedCall flag prevents reconnect attempts when user intentionally hangs up - session_id field compatibility: supports session_id/sessionId/id Flutter (pcm_player.dart): - Jitter buffer: queues incoming PCM chunks, starts playback only after accumulating 4800 bytes (150ms at 16kHz 16-bit mono) to smooth out network timing variance - reset() method: clears buffer on reconnect to discard stale audio - Buffer underrun handling: re-enters buffering phase if queue empties Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 07:32:19 -08:00
hailin	092a561867	feat: 完成 iAgent App 三大功能 + 修复租户上下文 ## 功能一：设置页（完整实现） - 新增浅色主题（lightTheme），支持深色/浅色/跟随系统三种模式 - app.dart 接入 themeMode 动态切换 - 设置页完整重写：个人信息编辑、修改密码、主题切换、通知开关 - 新增 settings_remote_datasource 对接后端 admin/settings API - settings_providers 新增 AccountProfileNotifier 管理远程个人资料 ## 功能二：语音通话（音频集成） - 添加 flutter_sound 依赖，创建 PcmPlayer 流式 PCM 播放器 - agent_call_page 替换空壳：真实麦克风采集（record + GTCRN 降噪） - 真实 PCM 16kHz 流式播放，基于 RMS 能量驱动波形动画 - 修复 WebSocket URL 路径：/ws/voice/ → /api/v1/voice/ws/ - voice_repository_impl 支持后端返回相对路径自动拼接 ## 功能三：推送通知（WebSocket MVP） - 添加 flutter_local_notifications + socket_io_client 依赖 - 创建 AppNotification 实体、NotificationService（Socket.IO 连接 comm-service） - 通知 providers：列表管理 + 未读计数 - 登录后自动连接通知服务，登出断开 - 底部导航 Alerts 标签添加未读角标（Badge） - AndroidManifest 添加 POST_NOTIFICATIONS 权限 - main.dart 初始化本地通知插件 ## 修复：租户上下文未初始化（500错误） - 根因：登录后未设置 currentTenantIdProvider，导致 X-Tenant-Id 头缺失 - Flutter 端：login() 成功后从 JWT 设置 tenantId，logout 时清除 - 后端：tenant-context.middleware 增加 JWT tenantId 回退逻辑 - AuthUser 模型新增 tenantId 字段解析新增 5 个文件，修改 16 个文件，添加 3 个依赖包 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 01:10:52 -08:00
hailin	a568558585	feat: replace speech_to_text with GTCRN ML noise reduction + backend STT Replace traditional on-device speech_to_text with a modern pipeline: - Record audio via `record` package with hardware noise suppression - Apply GTCRN neural denoising (sherpa-onnx, ICASSP 2024, 48K params) - Trim silence, POST to backend /voice/transcribe (faster-whisper) Changes: - Add /transcribe endpoint to voice-service for audio file upload - Add SpeechEnhancer wrapper for sherpa-onnx GTCRN model (523KB) - Rewrite chat_page.dart voice input: record → denoise → transcribe - Keep NoiseReducer.trimSilence for silence removal only - Upgrade record to v6.2.0, add sherpa_onnx, path_provider Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 07:59:15 -08:00

5 Commits