- Filter "完成资料引用" and other status text from Antaf responses - Use int8 quantized model for faster TTS inference - Add configurable num_threads for sherpa-onnx Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>