hailin/it0 - it0 - AI Wolves Team

Commit Graph

Author	SHA1	Message	Date
hailin	112c445143	fix: resolve websockets version conflict and use CPU-only torch - Upgrade websockets from ==12.0 to >=13.0 (openai[realtime] requires >=13) - Install torch CPU-only build separately in Dockerfile to avoid ~2GB CUDA download - Remove torch from requirements.txt (installed via --index-url cpu wheel) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 09:02:31 -08:00
hailin	94a14b3104	feat: migrate voice call from WebSocket/PCM to LiveKit WebRTC 实时语音对话架构迁移：WebSocket → LiveKit WebRTC ## 背景原语音通话架构基于 FastAPI WebSocket 传输原始 PCM，管道串行执行（VAD → 批量STT → Agent → 攒句 → 批量TTS），首音频延迟约 6 秒。迁移到 LiveKit Agents 框架后，利用 WebRTC 传输 + 流水线并行，预期延迟降至 1.5-2 秒。 ## 架构 Flutter App ←── WebRTC (Opus/UDP) ──→ LiveKit Server ←──→ Voice Agent livekit_client (自部署, Go) (Python, LiveKit Agents SDK) ├─ VAD (Silero) ├─ STT (faster-whisper / OpenAI) ├─ LLM (自定义插件 → agent-service) └─ TTS (Kokoro / OpenAI) 关键设计：LLM 不直接调用 Claude API，而是通过自定义插件代理到现有 agent-service，保留 Tool Use、会话历史、租户隔离等能力。 ## 新增服务 ### voice-agent (packages/services/voice-agent/) LiveKit Agent Worker，包含： - agent.py: 入口，prewarm() 预加载模型，entrypoint() 编排会话 - plugins/agent_llm.py: 自定义 LLM 插件，代理 agent-service API - POST /api/v1/agent/tasks 创建任务 - WS /ws/agent 订阅流式事件 (stream_event) - 跨轮复用 session_id 保持对话上下文 - plugins/whisper_stt.py: 本地 faster-whisper STT (批量识别) - plugins/kokoro_tts.py: 本地 Kokoro-82M TTS (24kHz PCM) - config.py: pydantic-settings 配置 ### LiveKit Server (deploy/docker/) - livekit.yaml: 信令端口 7880, RTC TCP 7881, UDP 50000-50200 - docker-compose.yml: 新增 livekit-server + voice-agent 容器 ### LiveKit Token 端点 - voice-service/src/api/livekit_token.py: POST /api/v1/voice/livekit/token 生成 Room JWT，嵌入 auth_header 到 AgentDispatch metadata ## Flutter 客户端改造 - agent_call_page.dart: 从 ~814 行简化到 ~380 行 - 替换: WebSocketChannel, AudioRecorder, PcmPlayer, 手动心跳/重连 - 使用: Room.connect(), setMicrophoneEnabled(true), LiveKit 事件监听 - 波形动画改用 participant.audioLevel - pubspec.yaml: 添加 livekit_client: ^2.3.0 - app_config.dart: 增加 livekitUrl 字段 - api_endpoints.dart: 增加 livekitToken 端点 ## 配置说明 (环境变量) - STT_PROVIDER: local (默认, faster-whisper) / openai - TTS_PROVIDER: local (默认, Kokoro) / openai - WHISPER_MODEL: base (默认) / small / medium / large - WHISPER_LANGUAGE: zh (默认) - KOKORO_VOICE: zf_xiaoxiao (默认) - DEVICE: cpu (默认) / cuda ## 不变的部分 - agent-service: 完全不改，voice-agent 通过现有 API 调用 - voice-service 核心: pipeline/STT/TTS/VAD 保留 (Twilio 备用) - Kong 网关: 现有路由不变 - 数据库: 无 schema 变更 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 08:55:33 -08:00
hailin	4987cad881	fix: increase body parser limit to 50mb for large PDF uploads Claude API supports up to 32MB PDFs; base64 encoding adds ~33% overhead. 50mb body limit covers the maximum single-document upload case. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 05:35:43 -08:00
hailin	c9367ee22a	fix: PDF attachments sent as document blocks instead of image blocks PDF files were incorrectly wrapped as type:'image' content blocks, causing Claude API to reject them as "Invalid image data". - conversation-context.service: check mediaType for application/pdf, use type:'document' block (Anthropic native PDF support) instead - claude-agent-sdk-engine: detect both 'image' and 'document' blocks when deciding to build multimodal SDK prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 05:27:41 -08:00
hailin	9b467924a0	fix: add attachments JSONB column to conversation_messages schema Update migration files to include the attachments column for multimodal image storage. Also add ALTER TABLE migration for existing deployments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 04:18:35 -08:00
hailin	2c657e2b4c	fix: use NestJS native useBodyParser instead of direct express import The direct `import * as express from 'express'` caused a MODULE_NOT_FOUND error in the Docker production image since express is only available as a transitive dependency via @nestjs/platform-express. Use NestExpressApplication.useBodyParser() which is the official NestJS API. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 04:01:54 -08:00
hailin	b9c3bfdf91	feat: add multimodal image support to Claude Agent SDK engine - SDK engine now constructs AsyncIterable<SDKUserMessage> with image content blocks when attachments are present in conversationHistory, using the SDK's native multimodal prompt format - CLI engine logs a warning when images are detected, since the `-p` flag only accepts text (upstream Claude CLI limitation) - Both SDK and API engines now fully support multimodal image input Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 03:38:59 -08:00
hailin	e4c2505048	feat: add multimodal image input with streaming markdown optimization Two major features in this commit: 1. Streaming Markdown Rendering Optimization - Replace deprecated flutter_markdown with gpt_markdown (active, AI-optimized) - Real-time markdown rendering during streaming (was showing raw syntax) - Solid block cursor (█) instead of AnimationController blink - 80ms token throttle buffer reducing rebuilds from per-token to ~12.5/sec - RepaintBoundary isolation for markdown widget repaints - StreamTextWidget simplified from StatefulWidget to StatelessWidget 2. Multimodal Image Input (camera + gallery + display) - Flutter: image_picker for gallery/camera, base64 encoding, attachment preview strip with delete, thumbnails in sent messages - Data layer: List<String>? → List<Map<String, dynamic>>? for structured attachment payloads through datasource/repository/usecase - ChatAttachment model with base64Data, mediaType, fileName - ChatMessage entity + ChatMessageModel both support attachments field - Backend DTO, Entity (JSONB), Controller, ConversationContextService all extended to receive, store, and reconstruct Anthropic image content blocks in loadContext() - Claude API engine skips duplicate user message when history already ends with multimodal content blocks - NestJS body parser limit raised to 10MB for base64 image payloads - Android CAMERA permission added to manifest - Image.memory uses cacheWidth/cacheHeight for memory efficiency - Max 5 images per message enforced in UI Data flow: ImagePicker → base64Encode → ChatAttachment → POST body → DB (JSONB) → loadContext → Anthropic image content blocks → Claude API Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 03:24:17 -08:00
hailin	50dbb641a3	fix: comprehensive hardening of agent task cancel/inject/approve flows 6 rounds of systematic audit identified and fixed 14 bugs across backend controller and Flutter client: ## Backend (agent.controller.ts) Security & Tenant Isolation: - Add @TenantId + ForbiddenException check to cancelTask, injectMessage, approveCommand — all 4 write endpoints now enforce tenant isolation - Add tenantId check on session reuse in executeTask to prevent cross-tenant session hijacking Architecture & Correctness: - Extract shared runTaskStream() from inline fire-and-forget block, used by both executeTask and injectMessage to reduce duplication - Use session.engineType (not getActiveEngine()) in cancelTask, injectMessage, approveCommand — fixes wrong-engine-cancel when global engine config is switched after task creation - Add concurrent task prevention: executeTask checks for existing RUNNING task on same session and cancels it before starting new one - Add runningTasks Map to track task promises, awaitTaskCleanup() helper with 3s timeout for inject to wait for partial text save - captureSdkSessionId() captures SDK session ID into metadata without DB save (callers persist), preventing fire-and-forget race Cancel/Reject Improvements: - cancelTask: idempotent (returns early if already CANCELLED/COMPLETED), session stays 'active' (was 'cancelled'), emits cancelled WS event - approveCommand reject: session stays 'active' (was 'cancelled'), now emits cancelled WS event so Flutter stream listeners clean up - approveCommand approved: collect text events and save assistant response to conversation history on completion (was missing) Minor: - task.result! non-null assertion → task.result ?? 'Unknown error' - Add findRunningBySessionId() to TaskRepository ## Flutter API Contract Fix: - approveCommand: route changed from /api/v1/ops/approvals/:id/approve to /api/v1/agent/tasks/:id/approve with {approved: true} body - rejectCommand: route changed from /api/v1/ops/approvals/:id/reject to /api/v1/agent/tasks/:id/approve with {approved: false} body Resource Management: - ChatNotifier.dispose() now disconnects WebSocket to prevent connection leak when navigating away from chat Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 22:20:46 -08:00
hailin	d5f663f7af	feat: inject-message support for mid-stream task interruption Backend (agent-engine.port.ts): - Add `cancelled` event type: emitted when a task is cancelled (user-initiated or injection), so Flutter can close the old stream cleanly - Add `task_info` event type: emitted after inject to pass the new taskId to the client, enabling cancel/re-inject on the replacement task Flutter (features/chat/): - ChatState: track current `taskId` alongside `sessionId`; clear on completion or error - Handle `TaskInfoEvent`: update taskId in state when server issues a new task - Handle `CancelledEvent`: treat as stream termination (agentStatus → idle) - MessageType.interrupted: new UI node (warning style) for mid-stream cancels - _inject(): send text as an inject request while streaming; backend cancels the current task and starts a new one with the injected message - Input area: during streaming, hint changes to "追加指令...", Enter key calls _inject() instead of _send(), and both inject-send + stop buttons are shown - isAwaitingApproval kept separate from isStreaming so approval flow is not blocked by inject mode Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-27 21:33:50 -08:00
hailin	ce4e7840ec	fix: route AgentSkillService to per-tenant schema to match SDK engine Previously AgentSkillService wrote skills to public.agent_skills (TypeORM entity with tenantId column filter), while ClaudeAgentSdkEngine read from it0_t_{tenantId}.skills (per-tenant schema). The two tables were never connected, so any skill added via the CRUD API was invisible to the agent. This fix: - Rewrites AgentSkillService to use DataSource + raw SQL against the per-tenant schema it0_t_{tenantId}.skills - Maps API fields: script→content, enabled→is_active - Removes AgentSkillRepository and AgentSkill entity from module (no longer needed) - CRUD API response shape is unchanged (fields mapped back to script/enabled) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-27 21:21:36 -08:00
hailin	3278696f4c	feat: inject tenant skills into agent system prompt Load active skills from the tenant's schema `skills` table and append them to the system prompt before passing to the Claude Agent SDK. This closes the gap where skills existed in the DB but were never surfaced to the agent during task execution. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 20:42:15 -08:00
hailin	36d36acad4	fix: set tenantId when creating credentials in inventory-service The createCredential method was missing the tenantId assignment, causing a NOT NULL constraint violation on the credentials table. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:52:14 -08:00
hailin	51b348e609	feat: complete tenant member management (CRUD + delete tenant) Backend: add 5 missing endpoints to TenantController: - DELETE /tenants/:id (deprovision schema + cleanup) - GET /tenants/:id/members (query tenant schema users) - PATCH /tenants/:id/members/:memberId (change role) - DELETE /tenants/:id/members/:memberId (remove member) - PUT /tenants/:id (alias for frontend compatibility) Frontend: add member actions to tenant detail page: - Role column changed to dropdown selector - Added remove member button with confirmation - Added updateMember and removeMember mutations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 10:00:09 -08:00
hailin	bc7e32061a	fix: improve voice call reconnection robustness Server side (session_router.py): - /reconnect now accepts sessions in "active" state (not just "disconnected") - When client reconnects to an active session, the old WebSocket/pipeline is automatically replaced when the new WebSocket connects - Only truly terminal states (e.g. "ended") return 409 Flutter side (agent_call_page.dart): - Distinguish terminal errors (404 session gone, 409 ended) from transient errors (network timeout, server unreachable) in reconnect loop - Terminal errors break immediately instead of wasting retry attempts - Extract _connectWebSocket() helper for cleaner reconnect flow - Add DioException handling for proper HTTP status code inspection Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 07:33:34 -08:00
hailin	75083f23aa	debug: add TTS send_bytes logging to pipeline Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 06:19:18 -08:00
hailin	5be7f9c078	fix: resample OpenAI TTS output from 24kHz to 16kHz WAV OpenAI TTS returns 24kHz audio which Android MediaPlayer can't play via FlutterSound's pcm16WAV codec. Request raw PCM and resample to 16kHz before wrapping in WAV header, matching the local TTS format. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 05:38:39 -08:00
hailin	4456550393	feat: lazy-load local TTS/STT models on first request Local /synthesize and /transcribe endpoints now auto-load Kokoro/Whisper models on first call instead of returning 503 when not pre-loaded at startup. This allows switching between Local and OpenAI providers in the Flutter test page without requiring server restart. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 04:38:49 -08:00
hailin	cc0f06e2be	feat: SDK engine native resume with per-tenant HOME isolation Replace prompt-prefix workaround with SDK's native resume mechanism. Each tenant gets isolated HOME directory (/data/claude-tenants/{tenantId}) to prevent cross-tenant session file mixing. SDK session IDs are persisted in session.metadata for cross-request resume support. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 02:27:38 -08:00
hailin	2403ce5636	feat: multi-turn conversation context management with session history UI Implement DB-based conversation message storage (engine-agnostic) that works across both Claude API and Agent SDK engines. Add ChatGPT/Claude-style conversation history drawer in Flutter with date-grouped session list, session switching, and new chat functionality. Backend: entity, repository, context service, migration 004, session/message API endpoints. Flutter: ConversationDrawer, sessionId flow from backend response via SessionInfoEvent, session list/switch/delete support. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 19:04:35 -08:00
hailin	c02c2a9a11	feat: add OpenAI TTS/STT provider support in voice pipeline - Add STT_PROVIDER/TTS_PROVIDER config (local or openai) in settings - Pipeline uses OpenAI API for STT/TTS when provider is "openai" - Skip loading local models (Kokoro/faster-whisper) when using OpenAI - VAD (Silero) always loads for speech detection Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 09:27:38 -08:00
hailin	f8f0d17820	fix: disable SSL verification for OpenAI proxy with self-signed cert Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 08:59:06 -08:00
hailin	d43baed3a5	feat: add OpenAI TTS/STT API endpoints for comparison testing - Add openai package to voice-service requirements - Add /api/v1/test/tts/synthesize-openai (tts-1/tts-1-hd/gpt-4o-mini-tts) - Add /api/v1/test/stt/transcribe-openai (gpt-4o-transcribe/whisper-1) - Add OPENAI_API_KEY and OPENAI_BASE_URL env vars to voice-service - Flutter test page: SegmentedButton to toggle Local/OpenAI provider - All endpoints maintain same response format for easy comparison Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 07:20:03 -08:00
hailin	5d4fd96d43	feat: streaming claude-api engine, engineType override, fix voice test page - Claude API engine now uses streaming API (messages.stream) for real-time text delta output instead of waiting for full response - Agent controller accepts optional engineType body parameter to allow callers (e.g. voice pipeline) to select a specific engine - Fix voice_test_page.dart compilation error: replace audioplayers (not installed) with flutter_sound (already in pubspec.yaml) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:30:11 -08:00
hailin	6e832c7615	feat: add voice I/O test page in Flutter settings - TTS: text input → Kokoro synthesis → audio playback - STT: long-press record → faster-whisper transcription - Round-trip: record → STT → TTS → playback - Added /api/v1/test route to Kong gateway for voice-service - Accessible from Settings → 语音 I/O 测试 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:16:10 -08:00
hailin	0bd050c80f	feat: add STT test and round-trip test to voice test page - STT: record from mic or upload audio file → faster-whisper transcription - Round-trip: record → STT → TTS → playback (full pipeline test) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:08:00 -08:00
hailin	0aa20cbc73	feat: add temporary TTS test page at /api/v1/test/tts Browser-accessible page to test text-to-speech synthesis without going through the full voice pipeline. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:06:02 -08:00
hailin	740f8f5f88	fix: sentence splitting bug in voice pipeline TTS streaming When the first punctuation mark appeared before _MIN_SENTENCE_LEN chars, the regex search would always find it first and skip it, permanently blocking all subsequent sentence splits. Fix by advancing search_start past short matches instead of breaking out of the loop. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 05:03:05 -08:00
hailin	79fae0629e	chore: upgrade claude-agent-sdk to ^0.2.52 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 04:12:03 -08:00
hailin	2a150dcff5	fix: prevent error event from overriding completed status in controller Add finished guard so that once a task reaches completed/error terminal state, subsequent events don't flip the status back. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:49:21 -08:00
hailin	8e4bd573f4	fix: deduplicate text events from SDK stream_event and assistant message SDK sends text both via stream_event deltas (token-level) and assistant message (complete block). Track hasStreamedText flag per session to skip duplicate text extraction from assistant messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:31:48 -08:00
hailin	65e68a0487	feat: streaming TTS — synthesize per-sentence as agent tokens arrive Replace batch TTS (wait for full response) with streaming approach: - _agent_generate → _agent_stream async generator (yield text chunks) - _process_speech accumulates tokens, splits on sentence boundaries - Each sentence is TTS'd and sent immediately while more tokens arrive - First audio plays within ~1s of agent response vs waiting for full text Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:14:22 -08:00
hailin	aa2a49afd4	fix: extract text from assistant message + fix event data parsing Root causes found: 1. SDK engine only emitted 'completed' without 'text' events because mapSdkMessage skipped text blocks in 'assistant' messages (assumed stream_event deltas would provide them, but SDK didn't send deltas) 2. Voice pipeline read evt_data.data.content but engine events are flat (evt_data.content) — so even if text arrived, it was never extracted Fixes: - Extract text/thinking blocks from assistant messages in SDK engine - Fix voice pipeline to read content directly from evt_data, not nested Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:01:25 -08:00
hailin	a7b42e6b98	feat: add detailed logging to agent engine and task controller Log every SDK message type, event emission, and stream lifecycle to diagnose why text events are missing in voice-agent flow. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:56:09 -08:00
hailin	0dbe711ed3	feat: add detailed logging to voice pipeline (STT/Agent/TTS timing) Log timestamps, content, and event details at each pipeline stage to help diagnose voice-agent integration issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:47:21 -08:00
hailin	1d5c834dfe	feat: add event buffering to agent WS gateway for late subscribers Buffer stream events when no WS clients are subscribed yet, then replay them when a client subscribes. This eliminates the race condition where events are lost between task creation and WS subscription. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:41:38 -08:00
hailin	370e32599f	fix: subscribe to agent WS before creating task to avoid race condition The engine stream could emit text events before the voice pipeline subscribed, causing all text to be lost. Now we connect and subscribe first, then POST the task. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:35:57 -08:00
hailin	abf5e29419	feat: route voice pipeline through agent-service instead of direct LLM Voice calls now use the same agent task + WS subscription flow as the chat UI, enabling tool use and command execution during voice sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 00:47:31 -08:00
hailin	7afbd54fce	fix: rewrite voice pipeline for direct WebSocket I/O, fix TTS and navigation Root cause: Pipecat's WebsocketServerTransport creates its own WebSocket server on (host,port) and expects FrameProcessor subclasses. Our code was passing a FastAPI WebSocket object as 'host' and using plain STT/TTS/VAD service classes that aren't FrameProcessors. The pipeline crashed immediately when receiving audio, causing "disconnects when speaking". Changes: - base_pipeline.py: Complete rewrite — replaced Pipecat Pipeline with direct async loop: WebSocket → VAD → STT → Claude LLM → TTS → WebSocket. Supports barge-in (interrupt TTS when user speaks), audio chunking, and 24kHz→16kHz TTS resampling. - session_router.py: Pass WebSocket directly to pipeline instead of wrapping in AppTransport. - app_transport.py: Deprecated (no longer needed). - kokoro_service.py: Fix misaki compatibility (MutableToken→MToken rename), use correct Chinese voice 'zf_xiaoxiao', handle torch tensors. - main.py: Apply misaki monkey-patch before importing kokoro. - settings.py: Change default TTS voice from 'zh_female_1' (non-existent) to 'zf_xiaoxiao' (valid Kokoro-82M Chinese female voice). - requirements.txt: Remove pipecat-ai dependency, pin kokoro==0.3.5 + misaki==0.7.17, add Chinese NLP deps (pypinyin, cn2an, jieba, ordered-set). - agent_call_page.dart: Wrap each cleanup step in try/catch to ensure Navigator.pop() always executes after call ends. Add 3s timeout on session delete request. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 23:34:35 -08:00
hailin	6cd53e713c	fix: bypass JWT for voice WebSocket route (fixes 401 on WS upgrade) 根因：Kong 日志显示 voice WebSocket 连接被 JWT 插件返回 401，因为 WebSocket RFC 6455 不支持自定义 header，Flutter 的 WebSocketChannel.connect 无法携带 Authorization header。修复策略（业界标准做法）： 1. Kong: 将 voice-service 的 JWT 从 service 级别改为 route 级别，仅在 voice-api 和 twilio-webhook 路由启用 JWT， voice-ws 路由免除（session 创建已通过 JWT 验证， session_id 本身作为认证凭据） 2. 后端: session_router 返回的 websocket_url 改为 /ws/voice/{session_id}（匹配 Kong voice-ws 路由路径） 3. FastAPI: 在 app 级别增加 /ws/voice/{session_id} 顶级 WebSocket 路由，委托给 session_router 的 handler Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 21:30:11 -08:00
hailin	74be945e4a	feat: enable token-level streaming and fix duplicate message bubble Backend: - Add includePartialMessages: true to SDK query options - Handle stream_event/content_block_delta for real-time text streaming - Skip text/thinking blocks from complete assistant messages (already streamed via deltas) to avoid duplication - Change default result summary to empty string Flutter: - Only show CompletedEvent summary when no assistant text was streamed (prevents duplicate message bubble) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 17:24:48 -08:00
hailin	5f827b0961	fix: revert ws/wss protocols - Kong OSS handles WS over http/https Kong 3.7 OSS doesn't support ws/wss protocol identifiers (Enterprise only). WebSocket upgrades are handled transparently over http/https protocols. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 17:02:53 -08:00
hailin	86d7cac631	fix: replace Socket.IO with raw WebSocket to fix 502 on /ws/agent Socket.IO requires its own handshake protocol (EIO=4) which Kong cannot proxy as a plain WebSocket upgrade, causing 502 Bad Gateway. Switch to @nestjs/platform-ws (WsAdapter) with manual session room tracking so Flutter's IOWebSocketChannel can connect directly. Also add ws/wss protocols to Kong WebSocket routes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:52:43 -08:00
hailin	9cdc4933dc	fix: add python-multipart dependency for voice-service Required by FastAPI for form/file upload parsing. Missing dependency may cause import errors and container restart loops. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:10:50 -08:00
hailin	3cb9ebd407	fix: release QueryRunner connections to prevent pool exhaustion TenantAwareRepository.getRepository() was calling createQueryRunner() without ever releasing it, causing database connection pool exhaustion. This caused ops-service (and eventually other services) to hang on all API requests once the pool filled up. Replaced getRepository() with withRepository() pattern that wraps operations in try/finally to always release the QueryRunner. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:55:06 -08:00
hailin	a6cd3c20d9	feat: add WebSocket robustness to voice call (heartbeat, reconnect, jitter buffer) Addresses reliability gaps in the real-time voice WebSocket connection between Flutter client and Python voice-service backend. Backend (voice-service): - Heartbeat: new _heartbeat_sender coroutine sends JSON ping text frames every 15s alongside the Pipecat pipeline; failed send = dead connection - Session preservation: on WebSocket disconnect, sessions are now marked "disconnected" with a timestamp instead of being deleted, allowing reconnection within a configurable TTL (default 60s) - Reconnect endpoint: POST /sessions/{id}/reconnect verifies the session is alive and in "disconnected" state, returns fresh websocket_url - Reconnect-aware WS handler: detects "disconnected" sessions, cancels stale pipeline tasks, creates a new pipeline, sends "session.resumed" - Background cleanup: asyncio loop every 30s removes sessions that have been disconnected longer than session_ttl - Structured event protocol: text frames = JSON control messages (ping/pong/session.resumed/session.ended/error), binary = PCM audio - New settings: session_ttl (60s), heartbeat_interval (15s), heartbeat_timeout (45s) Flutter (agent_call_page.dart): - Heartbeat monitoring: tracks last server ping timestamp, triggers reconnect if no ping received in 45s (3 missed intervals) - Auto-reconnect: exponential backoff (1s→2s→4s→8s→16s), max 5 attempts; calls /reconnect endpoint to verify session, rebuilds WebSocket, resets audio buffer, restarts heartbeat - Reconnecting UI: yellow warning banner "重新连接中... (N/5)" with spinner overlay during reconnection attempts - WebSocket data routing: _onWsData distinguishes String (JSON control) from binary (audio) frames, handles ping/session.resumed/session.ended - User-initiated disconnect guard: _userEndedCall flag prevents reconnect attempts when user intentionally hangs up - session_id field compatibility: supports session_id/sessionId/id Flutter (pcm_player.dart): - Jitter buffer: queues incoming PCM chunks, starts playback only after accumulating 4800 bytes (150ms at 16kHz 16-bit mono) to smooth out network timing variance - reset() method: clears buffer on reconnect to discard stale audio - Buffer underrun handling: re-enters buffering phase if queue empties Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 07:32:19 -08:00
hailin	d4391eef97	fix: run services as non-root user for SDK bypassPermissions SDK blocks bypassPermissions when running as root for security. Add non-root 'appuser' to Dockerfile.service and update volume mounts to use /home/appuser/.claude paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:41:10 -08:00
hailin	04a18a7899	fix: use acceptEdits mode and mount .claude.json for SDK - bypassPermissions blocked by SDK when running as root - Switch to acceptEdits with canUseTool for programmatic control - Mount .claude.json config file into container Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:37:31 -08:00
hailin	db1d0620f2	debug: add stderr callback to SDK engine for error visibility Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:34:42 -08:00
hailin	d40f66ce14	fix: use bypassPermissions mode for headless SDK execution In a Docker container without TTY, permissionMode 'default' blocks waiting for interactive permission prompts. Switch to bypassPermissions with canUseTool callback for programmatic risk-based access control. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:30:38 -08:00
hailin	14e8d7019a	fix: use dynamic import helper for ESM-only claude-agent-sdk tsc with module=commonjs converts `await import()` to require(), which breaks ESM-only packages. Use Function('return import()') workaround to preserve native dynamic import at runtime. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:18:04 -08:00
hailin	b963b7d4da	feat: enable SDK subscription mode with OAuth credentials mount - Mount ~/.claude/ into agent-service container for OAuth token access - Switch default engine to claude_agent_sdk - Remove ANTHROPIC_API_KEY from env in subscription mode so SDK uses OAuth - Keep API key mode for per-tenant billing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:14:45 -08:00
hailin	9126225317	fix: disable TLS verification for Anthropic proxy (self-signed cert) Follow iConsulting pattern: set NODE_TLS_REJECT_UNAUTHORIZED=0 when ANTHROPIC_BASE_URL is configured, enabling connection through the self-signed proxy at 67.223.119.33. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:52:02 -08:00
hailin	810dcd7def	feat: switch default engine to claude_api with base URL support - Change AGENT_ENGINE_TYPE from claude_code_cli to claude_api in docker-compose - Add ANTHROPIC_BASE_URL env var support to claude-api-engine - Add ANTHROPIC_BASE_URL to agent-service environment in docker-compose Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:45:08 -08:00
hailin	9a1ecf10ec	fix: add restart policy, global error handlers, and fix tenant schema bug - Add restart: unless-stopped to all 12 Docker services - Add process.on(unhandledRejection/uncaughtException) to all 7 service main.ts - Fix handleEventTrigger using tenantId UUID as schema name instead of slug lookup - Wrap Redis event subscription callbacks in try/catch Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:30:34 -08:00
hailin	1c291ce6c0	fix: Scheduler 用 slug 构建 tenant schema 名 tenant schema 是 it0_t_{slug}（如 it0_t_default），不是 it0_t_{uuid}。 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:03:04 -08:00
hailin	09274aa6af	fix: Scheduler 查询 public.tenants 而非 it0_shared.tenants 数据库实际 schema 是 public.tenants，不是 it0_shared.tenants。 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:00:17 -08:00
hailin	840318f449	fix: Scheduler 缺少 tenant 上下文导致 ops-service 卡死根因: @Cron 定时任务在 HTTP 请求上下文之外运行， TenantAwareRepository 需要 AsyncLocalStorage 中的 tenant 信息，每分钟抛 "Tenant context not initialized" 错误。修复: - scanCronOrders: 查 it0_shared.tenants 获取所有活跃租户，在 TenantContextService.run() 上下文中逐一执行 - handleEventTrigger: 从 Redis event 中提取 tenantId，同样包裹在 TenantContextService.run() 中 - 每个 tenant 循环加 try/catch 防止单个租户出错影响其他 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 04:55:52 -08:00
hailin	092a561867	feat: 完成 iAgent App 三大功能 + 修复租户上下文 ## 功能一：设置页（完整实现） - 新增浅色主题（lightTheme），支持深色/浅色/跟随系统三种模式 - app.dart 接入 themeMode 动态切换 - 设置页完整重写：个人信息编辑、修改密码、主题切换、通知开关 - 新增 settings_remote_datasource 对接后端 admin/settings API - settings_providers 新增 AccountProfileNotifier 管理远程个人资料 ## 功能二：语音通话（音频集成） - 添加 flutter_sound 依赖，创建 PcmPlayer 流式 PCM 播放器 - agent_call_page 替换空壳：真实麦克风采集（record + GTCRN 降噪） - 真实 PCM 16kHz 流式播放，基于 RMS 能量驱动波形动画 - 修复 WebSocket URL 路径：/ws/voice/ → /api/v1/voice/ws/ - voice_repository_impl 支持后端返回相对路径自动拼接 ## 功能三：推送通知（WebSocket MVP） - 添加 flutter_local_notifications + socket_io_client 依赖 - 创建 AppNotification 实体、NotificationService（Socket.IO 连接 comm-service） - 通知 providers：列表管理 + 未读计数 - 登录后自动连接通知服务，登出断开 - 底部导航 Alerts 标签添加未读角标（Badge） - AndroidManifest 添加 POST_NOTIFICATIONS 权限 - main.dart 初始化本地通知插件 ## 修复：租户上下文未初始化（500错误） - 根因：登录后未设置 currentTenantIdProvider，导致 X-Tenant-Id 头缺失 - Flutter 端：login() 成功后从 JWT 设置 tenantId，logout 时清除 - 后端：tenant-context.middleware 增加 JWT tenantId 回退逻辑 - AuthUser 模型新增 tenantId 字段解析新增 5 个文件，修改 16 个文件，添加 3 个依赖包 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 01:10:52 -08:00
hailin	4e1b75483d	fix: 修复 .gitignore 误忽略 Flutter data/models/ 源码导致构建失败问题描述: 在其他机器上构建报错: "Error when reading 'lib/features/auth/data/models/auth_response.dart': 系统找不到指定的路径" 导致 AuthUser、AuthResponse 等类型找不到，编译失败。根本原因: 根目录 .gitignore 第75行 "models/" 规则本意是忽略 ML 模型大文件，但该规则匹配了所有目录名为 models/ 的路径，包括 Flutter 项目中 DDD 架构的 data/models/ 源码目录（共 11 个 models/ 目录、10 个 .dart 文件）。这些文件在本地存在但从未被 Git 追踪，其他机器 pull 后缺失这些文件。修复内容: 1. 修改 .gitignore: 将宽泛的 "models/" 替换为精确的规则 - packages/services/voice-service/models/ — voice-service 下载的 ML 模型 - .pt, .pth, *.safetensors — PyTorch/HuggingFace 模型二进制文件 - 不再影响 Flutter 的 data/models/ 源码目录 2. 提交之前被忽略的 10 个 Flutter model 文件: - auth/data/models/auth_response.dart — 登录响应 (accessToken, refreshToken, user) - chat/data/models/chat_message_model.dart — 聊天消息模型 - chat/data/models/session_model.dart — 会话模型 - chat/data/models/stream_event_model.dart — SSE 流事件模型 - servers/data/models/server_model.dart — 服务器状态模型 - approvals/data/models/approval_model.dart — 审批请求模型 - alerts/data/models/alert_event_model.dart — 告警事件模型 - agent_call/data/models/voice_session_model.dart — 语音会话模型 - standing_orders/data/models/standing_order_model.dart — 常设指令模型 - tasks/data/models/task_model.dart — 任务模型 3. 同时提交: - it0_app/test/widget_test.dart — Flutter 默认测试 - packages/services/voice-service/src/models/__init__.py — Python 模块初始化 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 16:29:03 -08:00
hailin	a568558585	feat: replace speech_to_text with GTCRN ML noise reduction + backend STT Replace traditional on-device speech_to_text with a modern pipeline: - Record audio via `record` package with hardware noise suppression - Apply GTCRN neural denoising (sherpa-onnx, ICASSP 2024, 48K params) - Trim silence, POST to backend /voice/transcribe (faster-whisper) Changes: - Add /transcribe endpoint to voice-service for audio file upload - Add SpeechEnhancer wrapper for sherpa-onnx GTCRN model (523KB) - Rewrite chat_page.dart voice input: record → denoise → transcribe - Keep NoiseReducer.trimSilence for silence removal only - Upgrade record to v6.2.0, add sherpa_onnx, path_provider Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 07:59:15 -08:00
hailin	89955f6db8	fix: remove SQL comment lines before splitting to prevent filtering CREATE TABLE statements The previous approach split by semicolons then filtered statements starting with '--', which incorrectly removed entire CREATE TABLE blocks that had comment headers (e.g., '-- Agent Sessions\nCREATE TABLE...'). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 03:26:37 -08:00
hailin	7ff012cd91	fix: use underscores in tenant slug for valid PostgreSQL schema names Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 03:23:14 -08:00
hailin	5d81667ddd	feat: add dual tenant registration (self-service + invitation) Backend: - Enhanced register endpoint to accept companyName for self-service tenant creation with schema provisioning and admin user setup - Added TenantInvite entity with token-based invitation system - Added invite CRUD endpoints to TenantController (create/list/revoke) - Added public endpoints for invite validation and acceptance Frontend: - Created registration page with optional organization name field - Created invitation acceptance page at /invite/[token] - Added invite management UI to tenant detail page - Updated login page with link to registration Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 03:10:18 -08:00
hailin	7dbd2c1414	feat: add settings, roles, permissions, and metrics controllers Implement remaining backend controllers for all web admin menu pages: - SettingsController: general, notification, theme, account, API keys - RoleController: CRUD roles with permission assignment - PermissionController: permission matrix for RBAC management - MetricsController: server metrics overview and per-server data Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 01:03:34 -08:00
hailin	8f89b8121c	fix: format tenant API response to match frontend DTO Map flat quota fields to nested quota object and add userCount field to match the frontend's expected Tenant interface. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:41:19 -08:00
hailin	3816d6841d	fix: add users endpoint, admin route, and fix agent-config paths - Add UsersController to auth-service for user CRUD (GET/POST/PUT/DELETE /api/v1/auth/users) - Add Kong route /api/v1/admin -> auth-service for tenant management - Remove AuthGuard from TenantController (Kong handles JWT) - Fix frontend agent-config API paths from /api/v1/agent/config to /api/v1/agent-config Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:35:57 -08:00
hailin	52b85f085e	fix: decode JWT in middleware to populate req.user for RolesGuard Kong validates the JWT but doesn't populate req.user on the backend. The middleware now decodes the JWT payload to extract user info (id, email, tenantId, roles) so RolesGuard can check role-based access. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:25:32 -08:00
hailin	f393a07092	fix: correct alert-rules API paths and remove audit ACL plugin - Frontend alert-rules paths changed from /monitoring/alert-rules to /monitor/alerts/rules to match backend routes - Removed Kong ACL plugin on audit-routes (JWT auth is sufficient) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:21:50 -08:00
hailin	e98cf26587	fix: add missing columns to tenant schema template (runbook.updated_at, contact.role) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:18:46 -08:00
hailin	e0ef15df1e	fix: add SnakeNamingStrategy for TypeORM to match snake_case DB columns TypeORM entities use camelCase properties (tenantId, passwordHash) but database tables use snake_case columns (tenant_id, password_hash). The naming strategy automatically converts between the two conventions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:10:08 -08:00
hailin	a72cbd3778	fix: use any types in TenantContextMiddleware to avoid express dependency The @it0/database package doesn't have @types/express, causing build failures. Use any types for req/res/next parameters instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 00:00:55 -08:00
hailin	5b6e7ee363	fix: add TenantContextMiddleware to initialize tenant context from X-Tenant-Id header All services using TenantAwareRepository require AsyncLocalStorage tenant context to set the correct PostgreSQL search_path. The middleware reads X-Tenant-Id from request headers and wraps the request with TenantContextService.run(), using schema naming convention it0_t_{tenantId}. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:58:01 -08:00
hailin	806113554b	fix: remove AuthGuard('jwt') from all service controllers Kong handles JWT validation at the gateway level. Service-level AuthGuard('jwt') fails because services don't register a Passport JWT strategy (only auth-service does). Removed from 17 controllers across ops, inventory, monitor, comm, audit, and agent services. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:42:37 -08:00
hailin	c710303b60	fix: per-service JWT in Kong, fix auth-service tenant-aware repos - Replace global JWT plugin with per-service JWT (skip auth-service) to fix auth routes being blocked by global JWT in DB-less mode - Fix UserRepository and ApiKeyRepository to use standard TypeORM instead of TenantAwareRepository (users are global, not per-schema) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:31:32 -08:00
hailin	7dd7de4a22	fix: use COPY --chmod for Kong entrypoint (non-root image) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:24:37 -08:00
hailin	48e47975ca	fix: configure Kong JWT auth flow with consumer credentials - Add kid claim to auth-service JWT for Kong validation - Add Kong consumer with JWT credential (shared secret via env) - Add agent-config route to Kong for /api/v1/agent-config - Kong Dockerfile uses entrypoint script to inject JWT_SECRET at runtime - Fix frontend login path (/auth/login → /api/v1/auth/login) - Extract tenantId from JWT on login and store as current_tenant - Add auth guard in admin layout (redirect to /login if no token) - Pass JWT_SECRET env var to Kong container in docker-compose Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:20:06 -08:00
hailin	e5dcfa6113	feat: configure it0.szaiai.com and it0api.szaiai.com domains - Update Kong CORS origins to allow it0.szaiai.com - Update WebSocket URL to wss://it0api.szaiai.com - Fix proxy route to read API_BASE_URL at request time (was being inlined at build time by Next.js standalone) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:54:17 -08:00
hailin	d8cb2a9c6f	fix: use standard TypeORM repos and header-based tenant extraction - Replace TenantAwareRepository with standard @InjectRepository (TenantAwareRepository requires AsyncLocalStorage tenant context middleware which agent-service does not have) - Replace @TenantId() decorator with @Headers('x-tenant-id') for direct HTTP header extraction - Return defaults gracefully when no tenant is selected Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:41:30 -08:00
hailin	f897cfe240	fix: remove AuthGuard('jwt') from agent-service controllers Agent-service does not have a registered Passport JWT strategy — JWT validation is handled by Kong API gateway. The AuthGuard was causing 500 "Unknown authentication strategy" errors on all new controller endpoints. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:36:46 -08:00
hailin	5ee1227800	feat: add backend controllers for agent config, skills, and hooks Implement missing REST API endpoints that the web-admin frontend pages were calling but had no backend support: - GET/POST/PUT /api/v1/agent-config (engine, prompt, turns, budget, tools) - GET/POST/PUT/DELETE /api/v1/agent/skills (CRUD for agent skills) - GET/POST/PUT/DELETE /api/v1/agent/hooks (CRUD for hook scripts) Each endpoint includes entity, repository, service, and controller layers following the existing DDD + tenant-aware patterns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:26:25 -08:00
hailin	8b92abcce9	fix: handle undefined from eventQueue.shift() in SDK engine Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:10:43 -08:00
hailin	c75ad27771	feat: add Claude Agent SDK engine with multi-tenant support Add @anthropic-ai/claude-agent-sdk as a third engine (pure additive, no changes to existing CLI/API engines). Includes full frontend admin page. Backend (agent-service): - ClaudeAgentSdkEngine: implements AgentEnginePort using SDK's query() API - ApprovalGate: L2 tool approval with configurable auto-approve timeout (default 120s) - TenantAgentConfig entity: per-tenant billing mode, encrypted API key, timeout, tool lists - AllowedToolsResolverService: RBAC-based tool whitelist (admin/operator/viewer) - TenantAgentConfigController: REST endpoints for admin config management - Default subscription billing (operator's Claude login, no API key needed) - Optional per-tenant API key with AES-256-GCM encryption Frontend (web-admin): - SDK Config page at /agent-config/sdk with billing, timeout, tool permissions - Sidebar navigation entry under Agent Config - React Query key for tenant SDK config Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 18:38:30 -08:00
hailin	a06b489a1e	fix: load voice models in background thread to unblock startup Model downloads (Whisper, Kokoro, Silero VAD) are synchronous blocking calls that prevent uvicorn from completing startup and responding to healthchecks. Move all model loading to a daemon thread so the server starts immediately. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 00:26:06 -08:00
hailin	3702fa3f52	fix: make voice-service startup graceful and fix device config - Wrap model loading in try/except so server starts even if models fail - Fix device env var mapping (unified 'device' field instead of 'whisper_device') - Default Whisper model to 'base' instead of 'large-v3' (3GB) for CPU deployment - Increase healthcheck start_period to 120s for model download time Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 00:20:12 -08:00
hailin	39718a9a09	fix: resolve runtime errors for NestJS, Kong, and voice-service - Dockerfile.service: fix entry point path (dist/services/{name}/src/main) due to tsconfig paths widening rootDir during compilation - Kong config: remove unsupported ws/wss protocols (WebSocket works automatically over http/https in Kong 3.7) - voice-service: fix pipecat import path for v0.0.30 API (pipecat.transports.network.websocket_server with lowercase class names) - voice-service: add openai dependency required by pipecat anthropic service Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 19:00:03 -08:00
hailin	93c4a21f06	fix: upgrade faster-whisper to 1.2.1 to resolve av build failure faster-whisper 1.0.0 depends on av==11.* which has no prebuilt wheels and fails to compile. Version 1.2.1 uses av 12+ with prebuilt wheels. Also removed unnecessary FFmpeg dev libraries from Dockerfile. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 16:40:04 -08:00
hailin	6deaf16365	fix: add pkg-config and FFmpeg dev libs for PyAV build PyAV (av==11, dep of faster-whisper) requires pkg-config and FFmpeg development headers to compile from source. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 05:20:37 -08:00
hailin	c0b4f77de5	fix: remove China mirrors, add build-essential for voice-service Server is on HK network, no need for China mirrors. Added build-essential for compiling native Python packages (kokoro, etc). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 05:11:39 -08:00
hailin	9a95cdc4a9	fix: update numpy to 1.26.4 for pipecat-ai compatibility pipecat-ai==0.0.30 requires numpy~=1.26.4, conflicting with 1.26.0. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 05:09:01 -08:00
hailin	b382e6e469	fix: add China registry mirrors for npm and pip in Dockerfiles web-admin npm ci was timing out on the server. Added npmmirror.com for npm and tsinghua mirror for pip to resolve network issues. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:59:09 -08:00
hailin	e7570a3710	fix: add missing @it0/common dependency to @it0/testing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:48:49 -08:00
hailin	ee1ee7b484	fix: remove non-existent scripts/ COPY from voice-service Dockerfile Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:39:02 -08:00
hailin	e875cd49bb	fix: resolve Kong image tag and port conflicts for shared server - Change Kong base image from kong:3.7-alpine (non-existent) to kong:3.7 - Remap all host ports to avoid conflicts with existing iconsulting services: - Backend services: 13001-13008 (was 3001-3008) - Web admin: 13000 (was 3000) - API gateway: 18000/18001 (was 8000/8001) - PostgreSQL: 15432 (was 5432) - Redis: 16379 (was 6379) - Add container_name with it0- prefix to all services - Update deploy.sh health check ports to match new mappings Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:36:23 -08:00
hailin	00f8801d51	Initial commit: IT0 AI-powered server cluster operations platform Full-stack monorepo with DDD + Clean Architecture: - Backend: 7 NestJS microservices + 5 shared libraries (TypeScript) - Mobile: Flutter app with Riverpod (Dart) - Web Admin: Next.js dashboard with Zustand + React Query - Voice: Python voice service (STT/TTS/VAD) - Infra: Docker Compose, K8s manifests, Turborepo build Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 22:54:37 -08:00

1 2 3

145 Commits