hailin/it0 - it0 - AI Wolves Team

Commit Graph

Author	SHA1	Message	Date
hailin	f0ad6e09e6	fix: move entrypoint.sh to project root (deploy/ is in .dockerignore) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:14:31 -08:00
hailin	bad7f4802d	fix: use root entrypoint to copy SSH key then drop to appuser The bind-mounted SSH key is owned by host uid (1000/node) but the service runs as appuser (uid 1001). Use su-exec in entrypoint.sh to copy the key as root, fix ownership, then drop privileges. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:13:55 -08:00
hailin	329916e1f6	fix: correct SSH key permissions in agent-service container Mount host key to /tmp/host-ssh-key (read-only), then copy to appuser's .ssh directory with correct ownership at container start. Fixes "Permission denied" due to uid mismatch on bind mount. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:00:02 -08:00
hailin	795e8a11c5	feat: enable SSH access from agent-service container - Add openssh-client to Dockerfile.service (alpine) - Create .ssh directory with correct permissions for appuser - Mount host SSH key into agent-service container (read-only) This allows the Agent SDK to SSH into managed servers using the Bash tool. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:55:54 -08:00
hailin	36d36acad4	fix: set tenantId when creating credentials in inventory-service The createCredential method was missing the tenantId assignment, causing a NOT NULL constraint violation on the credentials table. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:52:14 -08:00
hailin	51b348e609	feat: complete tenant member management (CRUD + delete tenant) Backend: add 5 missing endpoints to TenantController: - DELETE /tenants/:id (deprovision schema + cleanup) - GET /tenants/:id/members (query tenant schema users) - PATCH /tenants/:id/members/:memberId (change role) - DELETE /tenants/:id/members/:memberId (remove member) - PUT /tenants/:id (alias for frontend compatibility) Frontend: add member actions to tenant detail page: - Role column changed to dropdown selector - Added remove member button with confirmation - Added updateMember and removeMember mutations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:00:09 -08:00
hailin	bc7e32061a	fix: improve voice call reconnection robustness Server side (session_router.py): - /reconnect now accepts sessions in "active" state (not just "disconnected") - When client reconnects to an active session, the old WebSocket/pipeline is automatically replaced when the new WebSocket connects - Only truly terminal states (e.g. "ended") return 409 Flutter side (agent_call_page.dart): - Distinguish terminal errors (404 session gone, 409 ended) from transient errors (network timeout, server unreachable) in reconnect loop - Terminal errors break immediately instead of wasting retry attempts - Extract _connectWebSocket() helper for cleaner reconnect flow - Add DioException handling for proper HTTP status code inspection Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 07:33:34 -08:00
hailin	57fabb4653	fix: set interleaved=true for PcmPlayer streaming playback FlutterSoundPlayer.feedUint8FromStream() requires interleaved mode. With interleaved=false, every feed() call threw: "Cannot feed with UInt8 with non interleaved mode" - feedUint8FromStream (Uint8List) → requires interleaved: true - feedFromStream (Float32List) → requires interleaved: false Since we feed raw PCM bytes (Uint8List), interleaved must be true. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:59:06 -08:00
hailin	e706a4cdc7	fix: enable simultaneous playback + recording in voice call Root cause: PcmPlayer called openPlayer() without audio session config, so Android defaulted to earpiece-only mode. When the mic was actively recording, playback was silently suppressed — the agent's TTS audio was sent successfully over WebSocket but never reached the speaker. Changes: 1. PcmPlayer (pcm_player.dart): - Added audio_session package for proper audio session management - Configure AudioSession with playAndRecord category so mic + speaker work simultaneously - Set voiceCommunication usage to enable Android hardware AEC (echo cancellation) — prevents feedback loops when speaker is active - defaultToSpeaker routes output to loudspeaker instead of earpiece - Restored setSpeakerOn() method stub (used by UI toggle) 2. AgentCallPage (agent_call_page.dart): - Fixed fire-and-forget bug: _pcmPlayer.feed() returns Future but was called without await, causing interleaved feedUint8FromStream calls - Added _feedChain serializer to guarantee sequential audio feeding 3. Dependencies: - Added audio_session package to pubspec.yaml Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:48:16 -08:00
hailin	75083f23aa	debug: add TTS send_bytes logging to pipeline Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:19:18 -08:00
hailin	5be7f9c078	fix: resample OpenAI TTS output from 24kHz to 16kHz WAV OpenAI TTS returns 24kHz audio which Android MediaPlayer can't play via FlutterSound's pcm16WAV codec. Request raw PCM and resample to 16kHz before wrapping in WAV header, matching the local TTS format. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 05:38:39 -08:00
hailin	4456550393	feat: lazy-load local TTS/STT models on first request Local /synthesize and /transcribe endpoints now auto-load Kokoro/Whisper models on first call instead of returning 503 when not pre-loaded at startup. This allows switching between Local and OpenAI providers in the Flutter test page without requiring server restart. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 04:38:49 -08:00
hailin	7b71a4f2fc	fix: properly close WebSocket with subscription cancel + fire-and-forget Root cause: IOWebSocketChannel.sink.close() can hang indefinitely (dart-lang/web_socket_channel#185). Previous fix used unawaited close but didn't cancel the stream subscription, so the old listener could still push events to _messageController. Fix: Extract _closeCurrentConnection() that: 1. Cancels StreamSubscription first (stops duplicate events immediately) 2. Fire-and-forget sink.close(goingAway) (frees underlying socket) This follows the workaround recommended in the official issue tracker. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 03:45:43 -08:00
hailin	45eb6bc453	fix: use unawaited close to prevent WebSocket reconnect hang The await on sink.close() blocks indefinitely when the server doesn't respond to the close handshake. Use fire-and-forget with unawaited() so the new connection can proceed immediately. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 03:41:13 -08:00
hailin	3185438f36	fix: close previous WebSocket before opening new connection When sending a second message in the same session, the old WebSocket connection was not closed, causing both connections to subscribe to the same session room. This resulted in each text event being received twice, producing garbled/duplicated output text. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 03:37:16 -08:00
hailin	e02b350043	fix: create /data/claude-tenants dir with appuser ownership in Dockerfile Without this, the SDK engine fails to create tenant HOME directories because the Docker volume mount point doesn't exist and appuser lacks write permissions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 02:52:57 -08:00
hailin	cc0f06e2be	feat: SDK engine native resume with per-tenant HOME isolation Replace prompt-prefix workaround with SDK's native resume mechanism. Each tenant gets isolated HOME directory (/data/claude-tenants/{tenantId}) to prevent cross-tenant session file mixing. SDK session IDs are persisted in session.metadata for cross-request resume support. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 02:27:38 -08:00
hailin	2403ce5636	feat: multi-turn conversation context management with session history UI Implement DB-based conversation message storage (engine-agnostic) that works across both Claude API and Agent SDK engines. Add ChatGPT/Claude-style conversation history drawer in Flutter with date-grouped session list, session switching, and new chat functionality. Backend: entity, repository, context service, migration 004, session/message API endpoints. Flutter: ConversationDrawer, sessionId flow from backend response via SessionInfoEvent, session list/switch/delete support. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 19:04:35 -08:00
hailin	7cda482e49	fix: simplify _dioBinary in voice test page to avoid interceptor conflicts Remove shared interceptors from the binary Dio instance to prevent request dedup/retry interceptors from interfering with audio downloads. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 17:57:58 -08:00
hailin	c02c2a9a11	feat: add OpenAI TTS/STT provider support in voice pipeline - Add STT_PROVIDER/TTS_PROVIDER config (local or openai) in settings - Pipeline uses OpenAI API for STT/TTS when provider is "openai" - Skip loading local models (Kokoro/faster-whisper) when using OpenAI - VAD (Silero) always loads for speech detection Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 09:27:38 -08:00
hailin	f7d39d8544	fix: use theme-aware colors in voice test page for dark mode readability Replace hardcoded Colors.grey with Theme.of(context).colorScheme for result containers and status text so they're readable in both light and dark themes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 09:21:06 -08:00
hailin	f8f0d17820	fix: disable SSL verification for OpenAI proxy with self-signed cert Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 08:59:06 -08:00
hailin	d43baed3a5	feat: add OpenAI TTS/STT API endpoints for comparison testing - Add openai package to voice-service requirements - Add /api/v1/test/tts/synthesize-openai (tts-1/tts-1-hd/gpt-4o-mini-tts) - Add /api/v1/test/stt/transcribe-openai (gpt-4o-transcribe/whisper-1) - Add OPENAI_API_KEY and OPENAI_BASE_URL env vars to voice-service - Flutter test page: SegmentedButton to toggle Local/OpenAI provider - All endpoints maintain same response format for easy comparison Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 07:20:03 -08:00
hailin	ac0b8ee1c6	fix: rewrite voice test page using flutter_sound for both record and play - Remove record package dependency, use FlutterSoundRecorder instead - Use permission_handler for microphone permission (already in pubspec) - Proper temp file path via path_provider - Cleanup temp files after upload - Single package (flutter_sound) handles both recording and playback Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:41:10 -08:00
hailin	d4783a3497	fix: use temp directory path for audio recording instead of empty string The record package requires a valid file path. Empty string caused ENOENT (No such file or directory) on Android. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:39:07 -08:00
hailin	5d4fd96d43	feat: streaming claude-api engine, engineType override, fix voice test page - Claude API engine now uses streaming API (messages.stream) for real-time text delta output instead of waiting for full response - Agent controller accepts optional engineType body parameter to allow callers (e.g. voice pipeline) to select a specific engine - Fix voice_test_page.dart compilation error: replace audioplayers (not installed) with flutter_sound (already in pubspec.yaml) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:30:11 -08:00
hailin	6e832c7615	feat: add voice I/O test page in Flutter settings - TTS: text input → Kokoro synthesis → audio playback - STT: long-press record → faster-whisper transcription - Round-trip: record → STT → TTS → playback - Added /api/v1/test route to Kong gateway for voice-service - Accessible from Settings → 语音 I/O 测试 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:16:10 -08:00
hailin	0bd050c80f	feat: add STT test and round-trip test to voice test page - STT: record from mic or upload audio file → faster-whisper transcription - Round-trip: record → STT → TTS → playback (full pipeline test) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:08:00 -08:00
hailin	0aa20cbc73	feat: add temporary TTS test page at /api/v1/test/tts Browser-accessible page to test text-to-speech synthesis without going through the full voice pipeline. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:06:02 -08:00
hailin	740f8f5f88	fix: sentence splitting bug in voice pipeline TTS streaming When the first punctuation mark appeared before _MIN_SENTENCE_LEN chars, the regex search would always find it first and skip it, permanently blocking all subsequent sentence splits. Fix by advancing search_start past short matches instead of breaking out of the loop. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:03:05 -08:00
hailin	7ac753ada4	fix: add ANTHROPIC_BASE_URL to agent-service for proxy access The agent-service was missing the ANTHROPIC_BASE_URL environment variable, causing the Claude Agent SDK to call api.anthropic.com directly instead of going through the proxy at 67.223.119.33, resulting in 403 Forbidden errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 04:49:27 -08:00
hailin	79fae0629e	chore: upgrade claude-agent-sdk to ^0.2.52 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 04:12:03 -08:00
hailin	2a150dcff5	fix: prevent error event from overriding completed status in controller Add finished guard so that once a task reaches completed/error terminal state, subsequent events don't flip the status back. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:49:21 -08:00
hailin	6876ec569b	fix: remove ANTHROPIC_API_KEY from agent-service to use subscription mode Default to OAuth subscription billing via ~/.claude/.credentials.json instead of consuming API key credits. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:43:09 -08:00
hailin	e20936ee2a	feat: collapsible thinking node in chat timeline Thinking content auto-expands while streaming, auto-collapses when done. User can toggle with "Thinking ∨" button, matching Claude Code VSCode UX. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:34:58 -08:00
hailin	8e4bd573f4	fix: deduplicate text events from SDK stream_event and assistant message SDK sends text both via stream_event deltas (token-level) and assistant message (complete block). Track hasStreamedText flag per session to skip duplicate text extraction from assistant messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:31:48 -08:00
hailin	65e68a0487	feat: streaming TTS — synthesize per-sentence as agent tokens arrive Replace batch TTS (wait for full response) with streaming approach: - _agent_generate → _agent_stream async generator (yield text chunks) - _process_speech accumulates tokens, splits on sentence boundaries - Each sentence is TTS'd and sent immediately while more tokens arrive - First audio plays within ~1s of agent response vs waiting for full text Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:14:22 -08:00
hailin	aa2a49afd4	fix: extract text from assistant message + fix event data parsing Root causes found: 1. SDK engine only emitted 'completed' without 'text' events because mapSdkMessage skipped text blocks in 'assistant' messages (assumed stream_event deltas would provide them, but SDK didn't send deltas) 2. Voice pipeline read evt_data.data.content but engine events are flat (evt_data.content) — so even if text arrived, it was never extracted Fixes: - Extract text/thinking blocks from assistant messages in SDK engine - Fix voice pipeline to read content directly from evt_data, not nested Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:01:25 -08:00
hailin	a7b42e6b98	feat: add detailed logging to agent engine and task controller Log every SDK message type, event emission, and stream lifecycle to diagnose why text events are missing in voice-agent flow. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:56:09 -08:00
hailin	0dbe711ed3	feat: add detailed logging to voice pipeline (STT/Agent/TTS timing) Log timestamps, content, and event details at each pipeline stage to help diagnose voice-agent integration issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:47:21 -08:00
hailin	1d5c834dfe	feat: add event buffering to agent WS gateway for late subscribers Buffer stream events when no WS clients are subscribed yet, then replay them when a client subscribes. This eliminates the race condition where events are lost between task creation and WS subscription. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:41:38 -08:00
hailin	370e32599f	fix: subscribe to agent WS before creating task to avoid race condition The engine stream could emit text events before the voice pipeline subscribed, causing all text to be lost. Now we connect and subscribe first, then POST the task. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:35:57 -08:00
hailin	82d12a5ff5	feat: mount voice model cache volumes to avoid re-downloading on restart Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:28:28 -08:00
hailin	abf5e29419	feat: route voice pipeline through agent-service instead of direct LLM Voice calls now use the same agent task + WS subscription flow as the chat UI, enabling tool use and command execution during voice sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 00:47:31 -08:00
hailin	7afbd54fce	fix: rewrite voice pipeline for direct WebSocket I/O, fix TTS and navigation Root cause: Pipecat's WebsocketServerTransport creates its own WebSocket server on (host,port) and expects FrameProcessor subclasses. Our code was passing a FastAPI WebSocket object as 'host' and using plain STT/TTS/VAD service classes that aren't FrameProcessors. The pipeline crashed immediately when receiving audio, causing "disconnects when speaking". Changes: - base_pipeline.py: Complete rewrite — replaced Pipecat Pipeline with direct async loop: WebSocket → VAD → STT → Claude LLM → TTS → WebSocket. Supports barge-in (interrupt TTS when user speaks), audio chunking, and 24kHz→16kHz TTS resampling. - session_router.py: Pass WebSocket directly to pipeline instead of wrapping in AppTransport. - app_transport.py: Deprecated (no longer needed). - kokoro_service.py: Fix misaki compatibility (MutableToken→MToken rename), use correct Chinese voice 'zf_xiaoxiao', handle torch tensors. - main.py: Apply misaki monkey-patch before importing kokoro. - settings.py: Change default TTS voice from 'zh_female_1' (non-existent) to 'zf_xiaoxiao' (valid Kokoro-82M Chinese female voice). - requirements.txt: Remove pipecat-ai dependency, pin kokoro==0.3.5 + misaki==0.7.17, add Chinese NLP deps (pypinyin, cn2an, jieba, ordered-set). - agent_call_page.dart: Wrap each cleanup step in try/catch to ensure Navigator.pop() always executes after call ends. Add 3s timeout on session delete request. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 23:34:35 -08:00
hailin	6cd53e713c	fix: bypass JWT for voice WebSocket route (fixes 401 on WS upgrade) 根因：Kong 日志显示 voice WebSocket 连接被 JWT 插件返回 401，因为 WebSocket RFC 6455 不支持自定义 header，Flutter 的 WebSocketChannel.connect 无法携带 Authorization header。修复策略（业界标准做法）： 1. Kong: 将 voice-service 的 JWT 从 service 级别改为 route 级别，仅在 voice-api 和 twilio-webhook 路由启用 JWT， voice-ws 路由免除（session 创建已通过 JWT 验证， session_id 本身作为认证凭据） 2. 后端: session_router 返回的 websocket_url 改为 /ws/voice/{session_id}（匹配 Kong voice-ws 路由路径） 3. FastAPI: 在 app 级别增加 /ws/voice/{session_id} 顶级 WebSocket 路由，委托给 session_router 的 handler Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 21:30:11 -08:00
hailin	2a87dd346e	fix: send empty JSON body to voice session endpoint (fixes 422) FastAPI 的 create_session 端点声明了 Pydantic request body (CreateSessionRequest)，虽然所有字段都有默认值，但 FastAPI 仍要求请求包含有效 JSON body（至少 {}）。Flutter 端 dio.post 未传 data 参数导致 Content-Type 缺失，FastAPI 返回 422 Unprocessable Entity。修复：添加 data: {} 发送空 JSON 对象。 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 21:21:48 -08:00
hailin	45c54acb87	fix: improve voice call page UI centering and error display 1. 布局居中：将 Column 包裹在 Center 中，所有文本添加 textAlign: TextAlign.center，确保头像、标题、副标题在各种屏幕尺寸上居中显示。 2. 错误展示优化：将 SnackBar 大面积红色块替换为行内错误卡片，采用圆角容器 + error icon + 简洁文案，视觉上更融洽。新增 _errorMessage 状态字段 + _friendlyError() 方法，将 DioException 等异常转换为中文友好提示（如 "语音服务暂不可用 (503)"），避免用户看到大段英文 stacktrace。 3. 错误状态清理：点击接听时自动清除上一次的 _errorMessage。 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 21:10:49 -08:00
hailin	b7814d42a9	fix: resolve 3 timeline UI bugs (blank start, spinner style, tool status) 1. 任务开始时空白状态：当 agent streaming 启动但尚无 assistant 响应时，在时间线末尾插入虚拟 "处理中..." 节点（带脉动 * 动画），避免用户发送 prompt 后界面无任何反馈。 (chat_page.dart: _needsWorkingNode + ListView itemCount+1) 2. * 动画从旋转改为脉动：_SpinnerDot 由 Transform.rotate 改为 Transform.scale（0.6x↔1.2x 缩放 + 透明度 0.6↔1.0 呼吸）， duration 从 1000ms 降至 800ms 并启用 reverse，视觉效果类似星星闪烁而非机械旋转。 (timeline_event_node.dart: _SpinnerDotState) 3. 工具执行完成后状态卡在 spinner：ToolResultEvent 到达时仅创建新 toolResult 消息，未回溯更新对应 toolUse 消息的 ToolStatus，导致时间线上工具节点永远显示 executing spinner。修复：在 ToolResultEvent handler 中向前查找最近的 executing 状态的 toolUse 消息，将其 status 更新为 completed/error。 (chat_providers.dart: ToolResultEvent case) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 18:13:30 -08:00
hailin	20325a84bd	feat: redesign chat UI from bubble style to timeline workflow Replace traditional chat bubble layout with a Claude Code-inspired timeline/workflow design: - Vertical gray line connecting sequential event nodes - Colored dots for each event (green=done, red=error, yellow=warning) - Animated spinning asterisk (*) on active nodes - Streaming text with blinking cursor in timeline nodes - Tool execution shown as code blocks within timeline - User prompts as distinct nodes with person icon New file: timeline_event_node.dart (TimelineEventNode, CodeBlock) Rewritten: chat_page.dart (timeline layout, no more bubbles) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 17:33:42 -08:00

... 3 4 5 6 7 ...

356 Commits All Branches Search

356 Commits

All Branches