Aether

i/Aether

mirror of https://github.com/fawney19/Aether.git synced 2026-01-08 10:42:29 +08:00

Author	SHA1	Message	Date
fawney19	d0ce798881	fix: TTL=0时启用Key随机轮换模式 - 当所有Key的cache_ttl_minutes都为0时，使用随机排序代替确定性哈希 - 将hashlib和random的import移到文件顶部 - 简化单Key场景的处理逻辑 Closes #57	2025-12-28 19:07:25 +08:00
fawney19	71bc2e6aab	fix: 增加参数校验防止除零错误	2025-12-25 22:44:17 +08:00
fawney19	afb329934a	fix: 修复端点健康统计时间分段计算的除零错误	2025-12-25 19:54:16 +08:00
fawney19	9dad194130	fix: 修复 API Key 访问限制字段无法清除的问题 - 统一前端创建和更新 API Key 时的空数组处理逻辑 - 后端创建和更新接口都支持空数组转 NULL（表示不限制） - 开启自动刷新时立即刷新一次数据	2025-12-24 22:35:30 +08:00
fawney19	03ad16ea8a	fix: 修复迁移脚本在全新安装时报错及改进统计回填逻辑迁移脚本修复: - 移除 AUTOCOMMIT 模式，改为在同一事务中创建索引 - 分别检查每个索引是否存在，只创建缺失的索引 - 修复全新安装时 AUTOCOMMIT 连接看不到未提交表的问题 (#46) 统计回填改进: - 分别检查 StatsDaily 和 StatsDailyModel 的缺失日期 - 只回填实际缺失的数据而非连续区间 - 添加失败统计计数和 rollback 错误日志	2025-12-24 21:50:05 +08:00
fawney19	1d5c378343	feat: add TTFB timeout detection and improve stream handling - Add stream first byte timeout (TTFB) detection to trigger failover when provider responds too slowly (configurable via STREAM_FIRST_BYTE_TIMEOUT) - Add rate limit fail-open/fail-close strategy configuration - Improve exception handling in stream prefetch with proper error classification - Refactor UsageService with shared _prepare_usage_record method - Add batch deletion for old usage records to avoid long transaction locks - Update CLI adapters to use proper User-Agent headers for each CLI client - Add composite indexes migration for usage table query optimization - Fix streaming status display in frontend to show TTFB during streaming - Remove sensitive JWT secret logging in auth service	2025-12-22 23:44:42 +08:00
fawney19	4e1aed9976	feat: add daily model statistics aggregation with stats_daily_model table	2025-12-20 02:39:10 +08:00
fawney19	af476ff21e	feat: enhance error logging and upstream response tracking for provider failures	2025-12-19 15:29:48 +08:00
fawney19	3bbc1c6b66	feat: add provider compatibility error detection for intelligent failover - Introduce ProviderCompatibilityException for unsupported parameter/feature errors - Add COMPATIBILITY_ERROR_PATTERNS to detect provider-specific limitations - Implement _is_compatibility_error() method in ErrorClassifier - Prioritize compatibility error checking before client error validation - Remove 'max_tokens' from CLIENT_ERROR_PATTERNS as it can indicate compatibility issues - Enable automatic failover when provider doesn't support requested features - Improve error classification accuracy with pattern matching for common compatibility issues	2025-12-19 13:28:26 +08:00
fawney19	c69a0a8506	refactor: remove stream smoothing config from system settings and improve base image caching - Remove stream_smoothing configuration from SystemConfigService (moved to handler default) - Remove stream smoothing UI controls from admin settings page - Add AdminClearSingleAffinityAdapter for targeted cache invalidation - Add clearSingleAffinity API endpoint to clear specific affinity cache entries - Include global_model_id in affinity list response for UI deletion support - Improve CI/CD workflow with hash-based base image change detection - Add hash label to base image for reliable cache invalidation detection - Use remote image inspection to determine if base image rebuild is needed - Include Dockerfile.base in hash calculation for proper dependency tracking	2025-12-19 13:09:56 +08:00
fawney19	97425ac68f	refactor: make stream smoothing parameters configurable and add models cache invalidation - Move stream smoothing parameters (chunk_size, delay_ms) to database config - Remove hardcoded stream smoothing constants from StreamProcessor - Simplify dynamic delay calculation by using config values directly - Add invalidate_models_list_cache() function to clear /v1/models endpoint cache - Call cache invalidation on model create, update, delete, and bulk operations - Update admin UI to allow runtime configuration of smoothing parameters - Improve model listing freshness when models are modified	2025-12-19 11:03:46 +08:00
fawney19	070121717d	refactor: consolidate stream smoothing into StreamProcessor with intelligent timing - Move StreamSmoother functionality directly into StreamProcessor for better integration - Create ContentExtractor strategy pattern for format-agnostic content extraction - Implement intelligent dynamic delay calculation based on text length - Support three text length tiers: short (char-by-char), medium (chunked), long (chunked) - Remove manual chunk_size and delay_ms configuration - now auto-calculated - Simplify admin UI to single toggle switch with auto timing adjustment - Extract format detection logic to reusable content_extractors module - Improve code maintainability with cleaner architecture	2025-12-19 09:46:22 +08:00
fawney19	85fafeacb8	feat: add stream smoothing feature for improved user experience - Implement StreamSmoother class to split large content chunks into smaller pieces with delay - Support OpenAI, Claude, and Gemini API response formats for smooth playback - Add stream smoothing configuration to system settings (enable, chunk size, delay) - Create streamlined API for stream smoothing with StreamSmoothingConfig dataclass - Add admin UI controls for configuring stream smoothing parameters - Use batch configuration loading to minimize database queries - Enable typing effect simulation for better user experience in streaming responses	2025-12-19 03:15:19 +08:00
fawney19	5f0c1fb347	refactor: remove unused response normalizer module - Delete unused ResponseNormalizer class and its initialization logic - Remove response_normalizer and enable_response_normalization parameters from handlers - Simplify chat adapter base initialization by removing normalizer setup - Clean up unused imports in handler modules	2025-12-19 01:20:30 +08:00
fawney19	7b932d7afb	refactor: optimize middleware with pure ASGI implementation and enhance security measures - Replace BaseHTTPMiddleware with pure ASGI implementation in plugin middleware for better streaming response handling - Add trusted proxy count configuration for client IP extraction in reverse proxy environments - Implement audit log cleanup scheduler with configurable retention period - Replace plaintext token logging with SHA256 hash fingerprints for security - Fix database session lifecycle management in middleware - Improve request tracing and error logging throughout the system - Add comprehensive tests for pipeline architecture	2025-12-18 19:07:20 +08:00
fawney19	21587449c8	fix: improve error classification and logging system - Enhance error classifier to properly handle API key failures with fallback support - Add error reason/code parsing for better AWS and multi-provider compatibility - Improve error message structure detection for non-standard formats - Refactor file logging with size-based rotation (100MB) instead of daily - Optimize production logging by disabling backtrace and diagnose - Clean up model validation and remove redundant configurations	2025-12-18 10:57:31 +08:00
fawney19	b2a857c164	refactor: consolidate transaction management and remove legacy modules - Remove unused context.py module (replaced by request.state) - Remove provider_cache.py (no longer needed) - Unify environment loading in config/settings.py instead of __init__.py - Add deprecation warning for get_async_db() (consolidating on sync Session) - Enhance database.py documentation with comprehensive transaction strategy - Simplify audit logging to reuse request-level Session (no separate connections) - Extract UsageService._build_usage_params() helper to reduce code duplication - Update model and user cache implementations with refined transaction handling - Remove unnecessary sessionmaker from pipeline - Clean up audit service exception handling	2025-12-18 01:59:40 +08:00
fawney19	4d1d863916	refactor: improve authentication and user data handling - Replace user cache queries with direct database queries to ensure data consistency - Fix token_type parameter in verify_token calls (access token verification) - Fix role-based permission check using dictionary ranking instead of string comparison - Fix logout operation to use correct JWT claim name (user_id instead of sub) - Simplify user authentication flow by removing unnecessary cache layer - Optimize session initialization in main.py using create_session helper - Remove unused imports and exception variables	2025-12-18 01:09:22 +08:00
fawney19	b579420690	refactor: optimize database session lifecycle and middleware architecture - Improve database pool capacity logging with detailed configuration parameters - Optimize database session dependency injection with middleware-managed lifecycle - Simplify plugin middleware by delegating session creation to FastAPI dependencies - Fix import path in auth routes (relative to absolute) - Add safety checks for database session management across middleware exception handlers - Ensure session cleanup only when not managed by middleware (avoid premature cleanup)	2025-12-18 00:35:46 +08:00
fawney19	9d5c84f9d3	refactor: add scheduling mode support and optimize system settings UI - Add fixed_order and cache_affinity scheduling modes to CacheAwareScheduler - Only apply cache affinity in cache_affinity mode; use fixed order otherwise - Simplify Dialog components with title/description props - Remove unnecessary button shadows in SystemSettings - Optimize import dialog UI structure - Update ModelAliasesTab shadow styling - Fix fallback orchestrator type hints - Add scheduling_mode configuration in system config	2025-12-17 19:15:08 +08:00
fawney19	bd11ebdbd5	fix: 修复个人设置页面深色模式切换后刷新失效的问题 - 前端使用 useDarkMode composable 统一主题切换逻辑 - 后端支持 system 主题值（之前只支持 auto） - 主题以本地 localStorage 为准，避免刷新时被服务端旧值覆盖 Fixes #22	2025-12-17 18:02:19 +08:00
fawney19	1dac4cb156	refactor: optimize provider query and stats aggregation logic	2025-12-17 16:41:10 +08:00
fawney19	33265b4b13	refactor(global-model): migrate model metadata to flexible config structure 将模型配置从多个固定字段（description, official_url, icon_url, default_supports_* 等）统一为灵活的 config JSON 字段，提高扩展性。同时优化前端模型创建表单，支持从 models-dev 列表直接选择模型快速填充。主要变更： - 后端：模型表迁移，支持 config JSON 存储模型能力和元信息 - 前端：GlobalModelFormDialog 支持两种创建方式（列表选择/手动填写） - API 类型更新，对齐新的数据结构	2025-12-16 12:21:21 +08:00
fawney19	4e2ba0e57f	feat(usage): add first_byte_time_ms tracking to usage statistics - Enhance usage service to capture and store first byte latency metrics - Update usage API routes to include new timing information	2025-12-16 02:39:36 +08:00
fawney19	718f56ba75	refactor(cache): optimize cache service architecture and provider transport	2025-12-15 23:12:34 +08:00
fawney19	f16fb28405	feat(model): improve cache invalidation for model updates and deletion - Handle both old and new aliases when invalidating cache during model updates - Preserve cache info before deletion to properly invalidate after deletion - Clear both Redis and in-memory caches on model changes	2025-12-15 20:39:39 +08:00
fawney19	a0ffc2c406	refactor(metrics): rename model_alias_* to model_mapping_* for clarity	2025-12-15 20:39:32 +08:00
fawney19	a7bfab1475	debug: add logging for model support checking and refactor cache resolution priority - 在 aware_scheduler.py 中添加调试日志，用于跟踪模型支持检查过程 - 重构 model_cache.py 的别名解析逻辑：调整优先级为 alias > provider_model_name > direct_match - 优化缓存命中路径，将直接匹配逻辑移到别名匹配失败后执行	2025-12-15 18:52:34 +08:00
fawney19	84d4db0f8d	feat(model): include alias info in cache invalidation - Pass provider_model_name to invalidate_model_cache() when creating models - Pass provider_model_aliases to invalidate_model_cache() when updating models - Ensures alias-based resolve cache keys are properly cleared on model changes	2025-12-15 18:27:49 +08:00
fawney19	903b182fdf	fix(scheduler): correct whitelist validation logic - Use 'is not None' instead of truthiness check for allowed_api_formats - Use 'is not None' instead of truthiness check for allowed_models - Use 'is not None' instead of truthiness check for allowed_providers - Use 'is not None' check for allowed_endpoints to distinguish empty list from None - Fixes issue where empty whitelist (empty list) was incorrectly treated as no restriction	2025-12-15 18:27:41 +08:00
fawney19	d9bd0790fe	feat(cache): improve model cache invalidation for alias resolution - Add provider_model_name and provider_model_aliases to invalidate_model_cache() - Clear resolve cache keys for both model name and aliases when invalidating - Also clear resolve cache in invalidate_global_model_cache() for GlobalModel names - Handle SQLite gracefully by catching OperationalError and ProgrammingError - Optimize fallback query to pre-filter by provider_model_name when JSONB fails	2025-12-15 18:27:31 +08:00
fawney19	11774c69b6	refactor(mapper): use model alias resolution service - Replace direct GlobalModel.name lookup with ModelCacheService.resolve_global_model_by_name_or_alias() - Support model aliases in source_model parameter - Leverage model resolution caching for better performance	2025-12-15 18:13:41 +08:00
fawney19	8f0a0cbdb1	refactor(scheduler): integrate model alias resolution - Use ModelCacheService.resolve_global_model_by_name_or_alias() for model lookups - Support both requested model name and resolved GlobalModel name in validation - Track resolved_model_name for proper allow_models checking - Improve model availability checks to handle alias resolution - Fix transient/detached object handling in global_model merge - Add more descriptive debug logs for alias resolution mismatches - Clean up code formatting (line length, imports organization)	2025-12-15 18:13:35 +08:00
fawney19	51b85915d2	feat(cache): implement model alias resolution with caching - Add resolve_global_model_by_name_or_alias() supporting direct match and alias lookup - Support both provider_model_name and provider_model_aliases matching - Implement caching for resolved models with TTL - Add conflict detection when alias maps to multiple GlobalModels - Record resolution metrics: method, cache hits, duration, conflicts - Fallback to Python-level filtering for non-PostgreSQL databases - Add cache invalidation methods for GlobalModel	2025-12-15 18:13:28 +08:00
fawney19	7068aa9130	refactor(backend): optimize cache system and model/provider services	2025-12-15 14:30:21 +08:00
fawney19	728f9bb126	refactor(backend): remove model mappings module	2025-12-15 14:30:00 +08:00
fawney19	2f9d943647	fix(system): fix timezone handling in dashboard and stats services - Use app timezone instead of UTC for date calculations in dashboard routes - Ensure consistency between stats_daily.date and timezone-aware comparisons - Fix date calculations in cleanup scheduler to handle DST correctly - Update log message in stats aggregator to use business date	2025-12-14 00:16:03 +08:00
fawney19	7d0003e61e	refactor(backend): optimize usage service and database helpers	2025-12-14 00:16:03 +08:00
fawney19	53bf74429e	refactor: 重构流式处理模块，提取 StreamContext/Processor/Telemetry - 将 chat_handler_base.py 中的流式处理逻辑拆分为三个独立模块： - StreamContext: 类型安全的流式上下文数据类，替代原有的 ctx dict - StreamProcessor: SSE 解析、预读、嵌套错误检测 - StreamTelemetryRecorder: 统计记录（Usage/Audit/Candidate） - 将硬编码配置外置到 settings.py，支持环境变量覆盖： - HTTP 超时配置（connect/write/pool） - 流式处理配置（预读行数、统计延迟） - 并发控制配置（槽位 TTL、缓存预留比例）	2025-12-12 15:42:45 +08:00
fawney19	39defce71c	fix: 修复统计聚合的时区问题，启动时自动回填缺失数据 - 统计聚合使用业务时区(APP_TIMEZONE)计算日期，而非UTC - 新增 _get_business_day_range() 将业务日期转换为UTC时间范围 - 启动时检查并自动回填因容器重启等原因缺失的统计数据 - 修复 aggregate_daily_stats/update_summary/get_today_realtime_stats 等方法的时区计算	2025-12-12 10:06:07 +08:00
fawney19	5722b1422e	fix: 启动时自动回填缺失的统计数据	2025-12-12 09:48:17 +08:00
fawney19	0e8bf0a23b	feat: 请求间隔散点图按模型区分颜色 - 后端 get_interval_timeline 接口返回数据添加 model 字段 - 前端散点图按模型分组显示不同颜色的数据点 - 横线统计信息支持按模型分别显示统计数据 - 管理员视图保持按用户分组，用户视图按模型分组 - 更新 mock 数据支持模型字段	2025-12-11 21:33:39 +08:00
fawney19	6e8107e340	fix: 修复管理员散点图只显示部分用户的问题 - 改为按比例采样，保持各用户数据量比例不变 - 散点图默认时间从7天改为当天（24小时） - limit 从 2000 提高到 10000	2025-12-11 19:34:56 +08:00
fawney19	abc41c7d3c	feat: 添加缓存监控和使用量统计 API 端点	2025-12-11 17:47:59 +08:00
fawney19	323a514f77	refactor: 优化活跃请求状态查询逻辑 - 重命名 get_active_requests 为 get_active_requests_status - 支持从端点配置读取超时时间 - 新增 content_length_limit 错误类型	2025-12-11 10:45:06 +08:00
fawney19	913a87d7f3	refactor: 重构活跃请求查询逻辑到 UsageService - 在 UsageService 新增 get_active_requests 方法，统一处理活跃请求查询 - 支持自动清理超时的 pending 请求（默认 5 分钟） - admin 和 user 接口均复用该方法，减少重复代码 - 支持按 ID 列表查询或查询所有活跃请求	2025-12-11 10:04:15 +08:00
fawney19	f784106826	Initial commit	2025-12-10 20:52:44 +08:00

47 Commits