Aether

i/Aether

mirror of https://github.com/fawney19/Aether.git synced 2026-01-03 16:22:27 +08:00

Author	SHA1	Message	Date
fawney19	c69a0a8506	refactor: remove stream smoothing config from system settings and improve base image caching - Remove stream_smoothing configuration from SystemConfigService (moved to handler default) - Remove stream smoothing UI controls from admin settings page - Add AdminClearSingleAffinityAdapter for targeted cache invalidation - Add clearSingleAffinity API endpoint to clear specific affinity cache entries - Include global_model_id in affinity list response for UI deletion support - Improve CI/CD workflow with hash-based base image change detection - Add hash label to base image for reliable cache invalidation detection - Use remote image inspection to determine if base image rebuild is needed - Include Dockerfile.base in hash calculation for proper dependency tracking	2025-12-19 13:09:56 +08:00
fawney19	1fae202bde	Merge pull request #30 from AAEE86/master chore: Modify the order of API format enumeration	2025-12-19 12:34:22 +08:00
AAEE86	e42bd35d48	chore: Modify the order of API format enumeration - Move CLAUDE_CLI before OPENAI	2025-12-19 11:44:10 +08:00
fawney19	97425ac68f	refactor: make stream smoothing parameters configurable and add models cache invalidation - Move stream smoothing parameters (chunk_size, delay_ms) to database config - Remove hardcoded stream smoothing constants from StreamProcessor - Simplify dynamic delay calculation by using config values directly - Add invalidate_models_list_cache() function to clear /v1/models endpoint cache - Call cache invalidation on model create, update, delete, and bulk operations - Update admin UI to allow runtime configuration of smoothing parameters - Improve model listing freshness when models are modified	2025-12-19 11:03:46 +08:00
fawney19	912f6643e2	tune: adjust stream smoothing parameters for better user experience - Increase chunk size from 5 to 20 characters for fewer delays - Reduce min delay from 15ms to 8ms for faster playback - Reduce max delay from 24ms to 15ms for better responsiveness - Adjust text thresholds to better differentiate content types - Apply parameter tuning to both StreamProcessor and _LightweightSmoother	2025-12-19 09:51:09 +08:00
fawney19	6c0373fda6	refactor: simplify text splitting logic in stream processor - Remove complex conditional logic for short/medium/long text differentiation - Unify text splitting to always use consistent CHUNK_SIZE-based splitting - Rely on dynamic delay calculation for output speed adjustment - Reduce code complexity in both main smoother and lightweight smoother	2025-12-19 09:48:11 +08:00
fawney19	070121717d	refactor: consolidate stream smoothing into StreamProcessor with intelligent timing - Move StreamSmoother functionality directly into StreamProcessor for better integration - Create ContentExtractor strategy pattern for format-agnostic content extraction - Implement intelligent dynamic delay calculation based on text length - Support three text length tiers: short (char-by-char), medium (chunked), long (chunked) - Remove manual chunk_size and delay_ms configuration - now auto-calculated - Simplify admin UI to single toggle switch with auto timing adjustment - Extract format detection logic to reusable content_extractors module - Improve code maintainability with cleaner architecture	2025-12-19 09:46:22 +08:00
fawney19	85fafeacb8	feat: add stream smoothing feature for improved user experience - Implement StreamSmoother class to split large content chunks into smaller pieces with delay - Support OpenAI, Claude, and Gemini API response formats for smooth playback - Add stream smoothing configuration to system settings (enable, chunk size, delay) - Create streamlined API for stream smoothing with StreamSmoothingConfig dataclass - Add admin UI controls for configuring stream smoothing parameters - Use batch configuration loading to minimize database queries - Enable typing effect simulation for better user experience in streaming responses	2025-12-19 03:15:19 +08:00
fawney19	7e792dabfc	refactor: use background task for client disconnection monitoring - Replace time-based throttling with background task for disconnect checks - Remove time.monotonic() and related throttling logic - Prevent blocking of stream transmission during connection checks - Properly clean up background task with try/finally block - Improve throughput and responsiveness of stream processing	2025-12-19 01:59:56 +08:00
fawney19	cd06169b2f	fix: detect OpenAI format stream completion via finish_reason - Add detection of finish_reason in OpenAI API responses to mark stream completion - Ensures OpenAI API streams are properly marked as complete even without explicit completion events - Complements existing completion event detection for other API formats	2025-12-19 01:44:35 +08:00
fawney19	50ffd47546	fix: handle client disconnection after stream completion gracefully - Check has_completion flag before marking client disconnection as failure - Allow graceful termination if response already completed when client disconnects - Change logging level to info for post-completion disconnections - Prevent false error reporting when client closes connection after receiving full response	2025-12-19 01:36:20 +08:00
fawney19	5f0c1fb347	refactor: remove unused response normalizer module - Delete unused ResponseNormalizer class and its initialization logic - Remove response_normalizer and enable_response_normalization parameters from handlers - Simplify chat adapter base initialization by removing normalizer setup - Clean up unused imports in handler modules	2025-12-19 01:20:30 +08:00
fawney19	7b932d7afb	refactor: optimize middleware with pure ASGI implementation and enhance security measures - Replace BaseHTTPMiddleware with pure ASGI implementation in plugin middleware for better streaming response handling - Add trusted proxy count configuration for client IP extraction in reverse proxy environments - Implement audit log cleanup scheduler with configurable retention period - Replace plaintext token logging with SHA256 hash fingerprints for security - Fix database session lifecycle management in middleware - Improve request tracing and error logging throughout the system - Add comprehensive tests for pipeline architecture	2025-12-18 19:07:20 +08:00
fawney19	293bb592dc	fix: enhance proxy configuration with password preservation and UI improvements - Add 'enabled' field to ProxyConfig for preserving config when disabled - Mask proxy password in API responses (return '***' instead of actual password) - Preserve existing password on update when new password not provided - Add URL encoding for proxy credentials (handle special chars like @, :, /) - Enhanced URL validation: block SOCKS4, require valid host, forbid embedded auth - UI improvements: use Switch component, dynamic password placeholder - Add confirmation dialog for orphaned credentials (URL empty but has username/password) - Prevent browser password autofill with randomized IDs and CSS text-security - Unify ProxyConfig type definition in types.ts	2025-12-18 16:14:37 +08:00
fawney19	3e50c157be	feat: add HTTP/SOCKS5 proxy support for API endpoints - Add proxy field to ProviderEndpoint database model with migration - Add ProxyConfig Pydantic model for proxy URL validation - Extend HTTP client pool with create_client_with_proxy method - Integrate proxy configuration in chat_handler_base.py and cli_handler_base.py - Update admin API endpoints to support proxy configuration CRUD - Add proxy configuration UI in frontend EndpointFormDialog Fixes #28	2025-12-18 14:46:47 +08:00
fawney19	21587449c8	fix: improve error classification and logging system - Enhance error classifier to properly handle API key failures with fallback support - Add error reason/code parsing for better AWS and multi-provider compatibility - Improve error message structure detection for non-standard formats - Refactor file logging with size-based rotation (100MB) instead of daily - Optimize production logging by disabling backtrace and diagnose - Clean up model validation and remove redundant configurations	2025-12-18 10:57:31 +08:00
fawney19	3d0ab353d3	refactor: migrate Pydantic Config to v2 ConfigDict	2025-12-18 02:20:53 +08:00
fawney19	b2a857c164	refactor: consolidate transaction management and remove legacy modules - Remove unused context.py module (replaced by request.state) - Remove provider_cache.py (no longer needed) - Unify environment loading in config/settings.py instead of __init__.py - Add deprecation warning for get_async_db() (consolidating on sync Session) - Enhance database.py documentation with comprehensive transaction strategy - Simplify audit logging to reuse request-level Session (no separate connections) - Extract UsageService._build_usage_params() helper to reduce code duplication - Update model and user cache implementations with refined transaction handling - Remove unnecessary sessionmaker from pipeline - Clean up audit service exception handling	2025-12-18 01:59:40 +08:00
fawney19	4d1d863916	refactor: improve authentication and user data handling - Replace user cache queries with direct database queries to ensure data consistency - Fix token_type parameter in verify_token calls (access token verification) - Fix role-based permission check using dictionary ranking instead of string comparison - Fix logout operation to use correct JWT claim name (user_id instead of sub) - Simplify user authentication flow by removing unnecessary cache layer - Optimize session initialization in main.py using create_session helper - Remove unused imports and exception variables	2025-12-18 01:09:22 +08:00
fawney19	b579420690	refactor: optimize database session lifecycle and middleware architecture - Improve database pool capacity logging with detailed configuration parameters - Optimize database session dependency injection with middleware-managed lifecycle - Simplify plugin middleware by delegating session creation to FastAPI dependencies - Fix import path in auth routes (relative to absolute) - Add safety checks for database session management across middleware exception handlers - Ensure session cleanup only when not managed by middleware (avoid premature cleanup)	2025-12-18 00:35:46 +08:00
fawney19	9d5c84f9d3	refactor: add scheduling mode support and optimize system settings UI - Add fixed_order and cache_affinity scheduling modes to CacheAwareScheduler - Only apply cache affinity in cache_affinity mode; use fixed order otherwise - Simplify Dialog components with title/description props - Remove unnecessary button shadows in SystemSettings - Optimize import dialog UI structure - Update ModelAliasesTab shadow styling - Fix fallback orchestrator type hints - Add scheduling_mode configuration in system config	2025-12-17 19:15:08 +08:00
fawney19	bd11ebdbd5	fix: 修复个人设置页面深色模式切换后刷新失效的问题 - 前端使用 useDarkMode composable 统一主题切换逻辑 - 后端支持 system 主题值（之前只支持 auto） - 主题以本地 localStorage 为准，避免刷新时被服务端旧值覆盖 Fixes #22	2025-12-17 18:02:19 +08:00
fawney19	1dac4cb156	refactor: optimize provider query and stats aggregation logic	2025-12-17 16:41:10 +08:00
fawney19	d24c3885ab	feat(admin): add config and user data import/export functionality Add comprehensive import/export endpoints for: - Provider and model configuration (with key decryption for export) - User data and API keys (preserving encrypted data) Includes merge modes (skip/overwrite/error) for conflict handling, 10MB size limit for imports, and automatic cache invalidation. Also fix optional field in GlobalModelResponse tiered_pricing.	2025-12-16 18:33:14 +08:00
fawney19	46ff5a1a50	refactor(models): enhance model management with official provider marking and extended metadata - Add OFFICIAL_PROVIDERS set to mark first-party vendors in models.dev - Implement official provider marking function with cache compatibility - Extend model metadata with family, context_limit, output_limit fields - Improve frontend model selection UI with wider panel and better search - Add dark mode support for provider logos - Optimize scrollbar styling for model lists - Update deployment documentation with clearer migration steps	2025-12-16 17:28:40 +08:00
fawney19	edce43d45f	fix(auth): make get_current_user and get_current_user_from_header async functions 将 get_current_user 和 get_current_user_from_header 函数声明为 async，并更新 AuthService.verify_token 的调用为 await，以正确处理异步 Token 验证。	2025-12-16 13:42:26 +08:00
fawney19	33265b4b13	refactor(global-model): migrate model metadata to flexible config structure 将模型配置从多个固定字段（description, official_url, icon_url, default_supports_* 等）统一为灵活的 config JSON 字段，提高扩展性。同时优化前端模型创建表单，支持从 models-dev 列表直接选择模型快速填充。主要变更： - 后端：模型表迁移，支持 config JSON 存储模型能力和元信息 - 前端：GlobalModelFormDialog 支持两种创建方式（列表选择/手动填写） - API 类型更新，对齐新的数据结构	2025-12-16 12:21:21 +08:00
fawney19	4e2ba0e57f	feat(usage): add first_byte_time_ms tracking to usage statistics - Enhance usage service to capture and store first byte latency metrics - Update usage API routes to include new timing information	2025-12-16 02:39:36 +08:00
fawney19	a3df41d63d	refactor(cli-handler): improve stream handling and response processing - Refactor CLI handler base for better stream context management - Optimize request/response handling for Claude, OpenAI, and Gemini CLI adapters - Enhance telemetry tracking across CLI handlers	2025-12-16 02:39:20 +08:00
fawney19	ad1c8c394c	refactor(handler): optimize stream processing and telemetry pipeline - Enhance stream context for better token and latency tracking - Refactor stream processor for improved performance metrics - Improve telemetry integration with first_byte_time_ms support - Add comprehensive stream context unit tests	2025-12-16 02:39:03 +08:00
fawney19	9b496abb73	feat(db): add first_byte_time_ms column to usage table	2025-12-16 02:38:43 +08:00
fawney19	f3a69a6160	refactor(handler): implement defensive token update strategy and extract cache creation token utility - Add extract_cache_creation_tokens utility to handle new/old cache creation token formats - Implement defensive update strategy in StreamContext to prevent zero values overwriting valid data - Simplify cache creation token parsing in Claude handler using new utility - Add comprehensive test suite for cache creation token extraction - Improve type hints in handler classes	2025-12-16 00:02:49 +08:00
fawney19	cf67160821	feat(cache): enhance cache monitoring endpoints and handler integrations	2025-12-15 23:12:48 +08:00
fawney19	718f56ba75	refactor(cache): optimize cache service architecture and provider transport	2025-12-15 23:12:34 +08:00
fawney19	f2cd96c34c	feat(api): add model mapping cache management endpoints	2025-12-15 20:39:51 +08:00
fawney19	f16fb28405	feat(model): improve cache invalidation for model updates and deletion - Handle both old and new aliases when invalidating cache during model updates - Preserve cache info before deletion to properly invalidate after deletion - Clear both Redis and in-memory caches on model changes	2025-12-15 20:39:39 +08:00
fawney19	a0ffc2c406	refactor(metrics): rename model_alias_* to model_mapping_* for clarity	2025-12-15 20:39:32 +08:00
fawney19	a7bfab1475	debug: add logging for model support checking and refactor cache resolution priority - 在 aware_scheduler.py 中添加调试日志，用于跟踪模型支持检查过程 - 重构 model_cache.py 的别名解析逻辑：调整优先级为 alias > provider_model_name > direct_match - 优化缓存命中路径，将直接匹配逻辑移到别名匹配失败后执行	2025-12-15 18:52:34 +08:00
fawney19	84d4db0f8d	feat(model): include alias info in cache invalidation - Pass provider_model_name to invalidate_model_cache() when creating models - Pass provider_model_aliases to invalidate_model_cache() when updating models - Ensures alias-based resolve cache keys are properly cleared on model changes	2025-12-15 18:27:49 +08:00
fawney19	903b182fdf	fix(scheduler): correct whitelist validation logic - Use 'is not None' instead of truthiness check for allowed_api_formats - Use 'is not None' instead of truthiness check for allowed_models - Use 'is not None' instead of truthiness check for allowed_providers - Use 'is not None' check for allowed_endpoints to distinguish empty list from None - Fixes issue where empty whitelist (empty list) was incorrectly treated as no restriction	2025-12-15 18:27:41 +08:00
fawney19	d9bd0790fe	feat(cache): improve model cache invalidation for alias resolution - Add provider_model_name and provider_model_aliases to invalidate_model_cache() - Clear resolve cache keys for both model name and aliases when invalidating - Also clear resolve cache in invalidate_global_model_cache() for GlobalModel names - Handle SQLite gracefully by catching OperationalError and ProgrammingError - Optimize fallback query to pre-filter by provider_model_name when JSONB fails	2025-12-15 18:27:31 +08:00
fawney19	11774c69b6	refactor(mapper): use model alias resolution service - Replace direct GlobalModel.name lookup with ModelCacheService.resolve_global_model_by_name_or_alias() - Support model aliases in source_model parameter - Leverage model resolution caching for better performance	2025-12-15 18:13:41 +08:00
fawney19	8f0a0cbdb1	refactor(scheduler): integrate model alias resolution - Use ModelCacheService.resolve_global_model_by_name_or_alias() for model lookups - Support both requested model name and resolved GlobalModel name in validation - Track resolved_model_name for proper allow_models checking - Improve model availability checks to handle alias resolution - Fix transient/detached object handling in global_model merge - Add more descriptive debug logs for alias resolution mismatches - Clean up code formatting (line length, imports organization)	2025-12-15 18:13:35 +08:00
fawney19	51b85915d2	feat(cache): implement model alias resolution with caching - Add resolve_global_model_by_name_or_alias() supporting direct match and alias lookup - Support both provider_model_name and provider_model_aliases matching - Implement caching for resolved models with TTL - Add conflict detection when alias maps to multiple GlobalModels - Record resolution metrics: method, cache hits, duration, conflicts - Fallback to Python-level filtering for non-PostgreSQL databases - Add cache invalidation methods for GlobalModel	2025-12-15 18:13:28 +08:00
fawney19	b0d295c6c9	feat(metrics): add model alias resolution metrics - model_alias_resolution_total: track resolution methods and cache hits - model_alias_resolution_duration_seconds: measure resolution performance - model_alias_conflict_total: monitor alias conflicts across GlobalModels	2025-12-15 18:13:19 +08:00
fawney19	88e37594cf	refactor(backend): update handlers, utilities and core modules after models restructure	2025-12-15 14:30:53 +08:00
fawney19	7068aa9130	refactor(backend): optimize cache system and model/provider services	2025-12-15 14:30:21 +08:00
fawney19	56fb6bf36c	refactor(backend): update model catalog and provider APIs after mappings removal	2025-12-15 14:30:10 +08:00
fawney19	728f9bb126	refactor(backend): remove model mappings module	2025-12-15 14:30:00 +08:00
fawney19	beae7a2616	feat(api): add unified Models API endpoint - Add models_service.py with model query logic and caching - Add models.py unified endpoint supporting Claude/OpenAI/Gemini formats - Auto-detect API format based on request headers - Support /v1/models and /v1beta/models (Gemini) paths - Update route registration and comments	2025-12-14 20:01:19 +08:00

1 2

68 Commits