iconsulting

Commit Graph

Author	SHA1	Message	Date
hailin	cd5399eac3	feat(agent): implement consulting strategy engine with V2 agent service - Add 8-stage consulting workflow (greeting → handoff) - Create StrategyEngineService for state management and transitions - Add ClaudeAgentServiceV2 with integrated strategy guidance - Support old user recognition via get_user_context tool - Add device info (IP, fingerprint) for new user icebreaking - Extend ConversationEntity with consulting state fields - Add database migration for new JSONB columns Stages: greeting, needs_discovery, info_collection, assessment, recommendation, objection_handling, conversion, handoff Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-24 06:32:07 -08:00
hailin	dd66c3a892	fix(conversation): resolve TypeScript type errors in token tracking - Fix Usage type cast by using unknown intermediate type - Add PricingTier interface and proper Record type for PRICING Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 08:27:03 -08:00
hailin	849a4a3099	feat(conversation): add token usage tracking for API cost analysis - Add TokenUsageEntity to store per-request token consumption - Add TokenUsageService with cost calculation and statistics APIs - Record input/output/cache tokens per API call - Calculate estimated cost based on Claude pricing - Provide user/conversation/global stats aggregation - Support daily stats and top users ranking - Integrate token tracking in ClaudeAgentService - Track latency, tool calls, response length - Accumulate tokens across tool loop iterations - Add token_usages table to init-db.sql with proper indexes This enables: - Per-user token consumption tracking - Cost analysis and optimization - Future billing/quota features Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 08:23:58 -08:00
hailin	c768e2aa53	fix(agent): stricter max_tokens calculation for response length control - Reduce tokensPerChar from 2 to 1.8 for more accurate Chinese token estimation - Use min() instead of max() to enforce upper limits on token counts - CHAT: max 200 tokens (was min 256) - SIMPLE_QUERY: max 600 tokens (was min 512) - CLARIFICATION: max 300 tokens (was min 256) - CONFIRMATION: max 400 tokens (was min 384) - DEEP_CONSULTATION: 800-1600 tokens (was 1024-4096) - ACTION_NEEDED: 500-1000 tokens (was 768-2048) This should result in more concise AI responses that better match the intent classifier's suggested length limits. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 08:02:34 -08:00
hailin	d9b4c72894	feat(agent): implement 3-layer architecture for better response quality Implement a three-layer architecture to improve AI response quality: Layer 1 - Intent Classifier (intent-classifier.ts): - Classifies user intent into 6 types: SIMPLE_QUERY, DEEP_CONSULTATION, ACTION_NEEDED, CHAT, CLARIFICATION, CONFIRMATION - Determines suggested response length based on intent type - Detects follow-up questions and extracts entities (visa types, etc.) - Uses keyword matching for fast classification (no API calls) Layer 2 - ReAct Agent (system-prompt.ts): - Adds ReAct thinking framework to system prompt - 4-step process: Understand -> Evaluate -> Act -> Generate - Emphasizes concise responses, avoids redundant phrases - Injects intent classification results to guide response strategy Layer 3 - Response Gate (response-gate.ts): - Quality checks: length, relevance, redundancy, completeness, tone - Logs gate results for analysis and future optimization - Can trim responses and remove redundant expressions Integration (claude-agent.service.ts): - Integrates all 3 layers in sendMessage flow - Dynamically adjusts max_tokens based on intent type - Collects full response for gate analysis Documentation: - Added AGENT_THREE_LAYER_ARCHITECTURE.md with detailed design docs Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 07:51:19 -08:00
hailin	4c125f3276	feat(agent): add 4 real-time tools for enhanced agent capabilities Add the following real-time tools to ImmigrationToolsService: - get_current_datetime: Get current date/time with timezone support - web_search: Search internet for latest immigration news/policies (Google CSE) - get_exchange_rate: Query real-time currency exchange rates (for investment immigration) - fetch_immigration_news: Fetch latest immigration announcements All tools include graceful degradation with fallback responses when external APIs are unavailable. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 00:46:55 -08:00
hailin	911132ab3e	feat(agent): upgrade to Level 3 with real RAG, Memory and Evolution integration ## Summary Upgrade iConsulting from Level 2 (48 points) to Level 3 (68 points) by implementing real service-to-service integration between conversation-service and knowledge-service. ## New Files - knowledge-client.service.ts: HTTP client for knowledge-service APIs - knowledge.module.ts: NestJS module for KnowledgeClientService - AGENT_EVALUATION_REPORT.md: Agent capability evaluation report - LEVEL3_UPGRADE_PLAN.md: Upgrade plan and completion report ## Changes ### RAG Integration - search_knowledge tool now calls /api/v1/knowledge/retrieve - check_off_topic tool calls /api/v1/knowledge/check-off-topic - Results include real vector similarity search from knowledge base ### Memory Integration - save_user_memory writes to PostgreSQL + Neo4j via knowledge-service - collect_assessment_info saves user data to long-term memory - generate_payment records payment intent to user memory - New get_user_context tool retrieves user's historical memories ### Evolution Integration - getAccumulatedExperience() fetches approved system experiences - sendMessage() dynamically injects experiences into system prompt - System learns from approved experiences across all conversations ## Expected Score Improvement \| Dimension \| Before \| After \| Delta \| \|------------\|--------\|-------\|-------\| \| Tool Use \| 14/20 \| 18/20 \| +4 \| \| Memory \| 12/20 \| 16/20 \| +4 \| \| RAG \| 10/20 \| 16/20 \| +6 \| \| Evolution \| 8/20 \| 14/20 \| +6 \| \| Total \| 48 \| 68 \| +20 \| Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 23:45:58 -08:00
hailin	d4925719fc	feat(multimodal): add file upload and image support for chat - Add MinIO object storage to docker-compose infrastructure - Create file-service microservice for upload management with presigned URLs - Add files table to database schema - Update nginx and Kong for MinIO proxy routes - Implement file upload UI in chat InputArea with drag-and-drop - Add attachment preview in MessageBubble component - Update conversation-service to handle multimodal messages - Add Claude Vision API integration for image analysis Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-10 05:34:41 -08:00
hailin	93050b6889	perf(claude): enable Prompt Caching for ~90% cost savings on system prompt	2026-01-10 01:42:33 -08:00
hailin	72e67fa5d9	fix(conversation): implement proper tool loop for Claude API - Fix streaming JSON parsing for tool inputs by accumulating partial JSON and parsing only on content_block_stop - Implement proper tool loop to continue conversation after tool execution - Send tool results back to Claude to get final response - Add safety limit of 10 iterations for tool loops This fixes the issue where AI responses were truncated after using tools like search_knowledge. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 21:29:18 -08:00
hailin	f87c089ca2	fix: disable TLS verification for IP-based proxy When ANTHROPIC_BASE_URL points to an IP address (proxy server), disable TLS certificate verification to allow connection. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 20:50:15 -08:00
hailin	7f2fc153b5	refactor: simplify Anthropic client config using baseURL Remove https-proxy-agent dependency since ANTHROPIC_BASE_URL already supports pointing to a proxy server directly. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 20:45:44 -08:00
hailin	a43e0b40e8	fix: use correct type for Anthropic client options Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 20:44:03 -08:00
hailin	c6c9623f36	feat(conversation): add proxy support for Anthropic API - Add https-proxy-agent dependency - Configure httpAgent in ClaudeAgentService when ANTHROPIC_PROXY_URL is set - Add ANTHROPIC_PROXY_URL environment variable to docker-compose.yml Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 20:34:13 -08:00
hailin	a7add8ff90	Initial commit: iConsulting 香港移民咨询智能客服系统项目架构: - Monorepo (pnpm + Turborepo) - 后端: NestJS 微服务 + Claude Agent SDK - 前端: React + Vite + Ant Design 包含服务: - conversation-service: 对话服务 (Claude AI) - user-service: 用户认证服务 - payment-service: 支付服务 (支付宝/微信/Stripe) - knowledge-service: 知识库服务 (RAG + Neo4j) - evolution-service: 自我进化服务 - web-client: 用户前端 - admin-client: 管理后台基础设施: - PostgreSQL + Redis + Neo4j - Kong API Gateway - Nginx 反向代理 - Docker Compose 部署配置 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 00:01:12 -08:00

15 Commits