hailin/it0 - it0 - AI Wolves Team

Commit Graph

Author	SHA1	Message	Date
hailin	3ed20cdf08	refactor: clean up agent SSH setup after fixing host-local routing - Remove iproute2/NET_ADMIN (no longer needed) - Remove ip route hack from entrypoint.sh - rwa-colocation-2 server record updated to use Docker gateway IP since 14.215.128.96 is a host-local NIC on the IT0 server Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:11:44 -08:00
hailin	ae7d9251ec	fix: add route for host-local IP (14.215.128.96) in agent container 14.215.128.96 is bound to a host NIC (enp5s0) and unreachable from Docker bridge via default NAT. Add NET_ADMIN + ip route via gateway. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:05:30 -08:00
hailin	0dea3f82bc	fix: mount correct SSH key (rwadurian_ed25519) in agent-service The IT0 server has its own id_ed25519 which differs from the local key that's authorized on RWADurian servers. Use a dedicated key file. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:05:01 -08:00
hailin	329916e1f6	fix: correct SSH key permissions in agent-service container Mount host key to /tmp/host-ssh-key (read-only), then copy to appuser's .ssh directory with correct ownership at container start. Fixes "Permission denied" due to uid mismatch on bind mount. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:00:02 -08:00
hailin	795e8a11c5	feat: enable SSH access from agent-service container - Add openssh-client to Dockerfile.service (alpine) - Create .ssh directory with correct permissions for appuser - Mount host SSH key into agent-service container (read-only) This allows the Agent SDK to SSH into managed servers using the Bash tool. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 11:55:54 -08:00
hailin	cc0f06e2be	feat: SDK engine native resume with per-tenant HOME isolation Replace prompt-prefix workaround with SDK's native resume mechanism. Each tenant gets isolated HOME directory (/data/claude-tenants/{tenantId}) to prevent cross-tenant session file mixing. SDK session IDs are persisted in session.metadata for cross-request resume support. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 02:27:38 -08:00
hailin	c02c2a9a11	feat: add OpenAI TTS/STT provider support in voice pipeline - Add STT_PROVIDER/TTS_PROVIDER config (local or openai) in settings - Pipeline uses OpenAI API for STT/TTS when provider is "openai" - Skip loading local models (Kokoro/faster-whisper) when using OpenAI - VAD (Silero) always loads for speech detection Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 09:27:38 -08:00
hailin	d43baed3a5	feat: add OpenAI TTS/STT API endpoints for comparison testing - Add openai package to voice-service requirements - Add /api/v1/test/tts/synthesize-openai (tts-1/tts-1-hd/gpt-4o-mini-tts) - Add /api/v1/test/stt/transcribe-openai (gpt-4o-transcribe/whisper-1) - Add OPENAI_API_KEY and OPENAI_BASE_URL env vars to voice-service - Flutter test page: SegmentedButton to toggle Local/OpenAI provider - All endpoints maintain same response format for easy comparison Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 07:20:03 -08:00
hailin	7ac753ada4	fix: add ANTHROPIC_BASE_URL to agent-service for proxy access The agent-service was missing the ANTHROPIC_BASE_URL environment variable, causing the Claude Agent SDK to call api.anthropic.com directly instead of going through the proxy at 67.223.119.33, resulting in 403 Forbidden errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 04:49:27 -08:00
hailin	6876ec569b	fix: remove ANTHROPIC_API_KEY from agent-service to use subscription mode Default to OAuth subscription billing via ~/.claude/.credentials.json instead of consuming API key credits. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 03:43:09 -08:00
hailin	82d12a5ff5	feat: mount voice model cache volumes to avoid re-downloading on restart Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 02:28:28 -08:00
hailin	abf5e29419	feat: route voice pipeline through agent-service instead of direct LLM Voice calls now use the same agent task + WS subscription flow as the chat UI, enabling tool use and command execution during voice sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 00:47:31 -08:00
hailin	d4391eef97	fix: run services as non-root user for SDK bypassPermissions SDK blocks bypassPermissions when running as root for security. Add non-root 'appuser' to Dockerfile.service and update volume mounts to use /home/appuser/.claude paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:41:10 -08:00
hailin	04a18a7899	fix: use acceptEdits mode and mount .claude.json for SDK - bypassPermissions blocked by SDK when running as root - Switch to acceptEdits with canUseTool for programmatic control - Mount .claude.json config file into container Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:37:31 -08:00
hailin	3a6f9d9447	fix: mount .claude directory as read-write for SDK debug logs SDK writes debug logs to ~/.claude/debug/ at runtime. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:21:31 -08:00
hailin	b963b7d4da	feat: enable SDK subscription mode with OAuth credentials mount - Mount ~/.claude/ into agent-service container for OAuth token access - Switch default engine to claude_agent_sdk - Remove ANTHROPIC_API_KEY from env in subscription mode so SDK uses OAuth - Keep API key mode for per-tenant billing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 06:14:45 -08:00
hailin	810dcd7def	feat: switch default engine to claude_api with base URL support - Change AGENT_ENGINE_TYPE from claude_code_cli to claude_api in docker-compose - Add ANTHROPIC_BASE_URL env var support to claude-api-engine - Add ANTHROPIC_BASE_URL to agent-service environment in docker-compose Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:45:08 -08:00
hailin	9a1ecf10ec	fix: add restart policy, global error handlers, and fix tenant schema bug - Add restart: unless-stopped to all 12 Docker services - Add process.on(unhandledRejection/uncaughtException) to all 7 service main.ts - Fix handleEventTrigger using tenantId UUID as schema name instead of slug lookup - Wrap Redis event subscription callbacks in try/catch Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 05:30:34 -08:00
hailin	48e47975ca	fix: configure Kong JWT auth flow with consumer credentials - Add kid claim to auth-service JWT for Kong validation - Add Kong consumer with JWT credential (shared secret via env) - Add agent-config route to Kong for /api/v1/agent-config - Kong Dockerfile uses entrypoint script to inject JWT_SECRET at runtime - Fix frontend login path (/auth/login → /api/v1/auth/login) - Extract tenantId from JWT on login and store as current_tenant - Add auth guard in admin layout (redirect to /login if no token) - Pass JWT_SECRET env var to Kong container in docker-compose Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 23:20:06 -08:00
hailin	e5dcfa6113	feat: configure it0.szaiai.com and it0api.szaiai.com domains - Update Kong CORS origins to allow it0.szaiai.com - Update WebSocket URL to wss://it0api.szaiai.com - Fix proxy route to read API_BASE_URL at request time (was being inlined at build time by Next.js standalone) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 22:54:17 -08:00
hailin	67d5a13c0c	fix: set compose project name to 'it0' for consistent image naming Changes image names from docker-{service} to it0-{service}. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 02:57:42 -08:00
hailin	259838ae88	fix: set HOSTNAME=0.0.0.0 for Next.js standalone to bind all interfaces Next.js standalone server binds to container hostname by default, making it unreachable from 127.0.0.1 for healthchecks and from Docker port forwarding. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 02:52:37 -08:00
hailin	83da374bbb	fix: use 127.0.0.1 in web-admin healthcheck to avoid IPv6 resolution Node.js 18 resolves 'localhost' to ::1 (IPv6) but Next.js standalone only binds to 0.0.0.0 (IPv4), causing Connection Refused. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 02:49:51 -08:00
hailin	3702fa3f52	fix: make voice-service startup graceful and fix device config - Wrap model loading in try/except so server starts even if models fail - Fix device env var mapping (unified 'device' field instead of 'whisper_device') - Default Whisper model to 'base' instead of 'large-v3' (3GB) for CPU deployment - Increase healthcheck start_period to 120s for model download time Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 00:20:12 -08:00
hailin	d0447fb69f	fix: use node/python HTTP healthchecks instead of wget wget returns error on 404, but services are healthy (just no root endpoint). Using node http.get for NestJS services (accepts any non-5xx response) and python urllib for voice-service. Also upgraded api-gateway depends_on to service_healthy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 00:13:47 -08:00
hailin	e7ae82e51d	feat: add healthcheck to all services in docker-compose NestJS services use wget to check API endpoints. voice-service uses curl to check FastAPI /docs endpoint. web-admin uses wget to check Next.js root. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 00:10:38 -08:00
hailin	e875cd49bb	fix: resolve Kong image tag and port conflicts for shared server - Change Kong base image from kong:3.7-alpine (non-existent) to kong:3.7 - Remap all host ports to avoid conflicts with existing iconsulting services: - Backend services: 13001-13008 (was 3001-3008) - Web admin: 13000 (was 3000) - API gateway: 18000/18001 (was 8000/8001) - PostgreSQL: 15432 (was 5432) - Redis: 16379 (was 6379) - Add container_name with it0- prefix to all services - Update deploy.sh health check ports to match new mappings Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:36:23 -08:00
hailin	9120f4927e	fix: add Dockerfiles and fix docker-compose build configuration - Add shared Dockerfile.service for all 7 NestJS microservices using multi-stage build with pnpm workspace support - Add Dockerfile for web-admin (Next.js standalone output) - Add .dockerignore files for root and web-admin - Fix docker-compose.yml: use monorepo root as build context with SERVICE_NAME build arg instead of per-service Dockerfiles - Fix postgres/redis missing network config (services couldn't reach them) - Use .env variables for DB credentials instead of hardcoded values - Add JWT_REFRESH_SECRET and REDIS_URL to services that were missing them - Add DB init script volume mount for postgres - Remove deprecated version: '3.8' from all compose files - Add output: 'standalone' to next.config.js for optimized Docker builds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 04:31:23 -08:00
hailin	00f8801d51	Initial commit: IT0 AI-powered server cluster operations platform Full-stack monorepo with DDD + Clean Architecture: - Backend: 7 NestJS microservices + 5 shared libraries (TypeScript) - Mobile: Flutter app with Riverpod (Dart) - Web Admin: Next.js dashboard with Zustand + React Query - Voice: Python voice service (STT/TTS/VAD) - Infra: Docker Compose, K8s manifests, Turborepo build Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 22:54:37 -08:00

29 Commits