Aether

i/Aether

mirror of https://github.com/fawney19/Aether.git synced 2026-01-11 20:18:30 +08:00

Author	SHA1	Message	Date
fawney19	d378630b38	perf: 添加多层缓存优化减少数据库查询 - 新增 ProviderCacheService 缓存 Provider 和 ProviderAPIKey 数据 - SystemConfigService 添加进程内缓存（TTL 60秒） - API Key last_used_at 更新添加节流策略（60秒间隔） - HTTP 连接池配置改为可配置，支持根据 Worker 数量自动计算 - 前端优先级管理改用 health_score 显示健康度	2026-01-08 02:34:59 +08:00
fawney19	a12b43ce5c	refactor: 清理数据库字段命名歧义 - users 表：重命名 allowed_endpoints 为 allowed_api_formats（修正历史命名错误） - api_keys 表：删除 allowed_endpoints 字段（未使用的功能） - providers 表：删除 rate_limit 字段（与 rpm_limit 重复） - usage 表：重命名 provider 为 provider_name（避免与 provider_id 外键混淆）同步更新前后端所有相关代码	2026-01-07 19:53:32 +08:00
fawney19	d0ce798881	fix: TTL=0时启用Key随机轮换模式 - 当所有Key的cache_ttl_minutes都为0时，使用随机排序代替确定性哈希 - 将hashlib和random的import移到文件顶部 - 简化单Key场景的处理逻辑 Closes #57	2025-12-28 19:07:25 +08:00
fawney19	4e1aed9976	feat: add daily model statistics aggregation with stats_daily_model table	2025-12-20 02:39:10 +08:00
fawney19	21587449c8	fix: improve error classification and logging system - Enhance error classifier to properly handle API key failures with fallback support - Add error reason/code parsing for better AWS and multi-provider compatibility - Improve error message structure detection for non-standard formats - Refactor file logging with size-based rotation (100MB) instead of daily - Optimize production logging by disabling backtrace and diagnose - Clean up model validation and remove redundant configurations	2025-12-18 10:57:31 +08:00
fawney19	b2a857c164	refactor: consolidate transaction management and remove legacy modules - Remove unused context.py module (replaced by request.state) - Remove provider_cache.py (no longer needed) - Unify environment loading in config/settings.py instead of __init__.py - Add deprecation warning for get_async_db() (consolidating on sync Session) - Enhance database.py documentation with comprehensive transaction strategy - Simplify audit logging to reuse request-level Session (no separate connections) - Extract UsageService._build_usage_params() helper to reduce code duplication - Update model and user cache implementations with refined transaction handling - Remove unnecessary sessionmaker from pipeline - Clean up audit service exception handling	2025-12-18 01:59:40 +08:00
fawney19	9d5c84f9d3	refactor: add scheduling mode support and optimize system settings UI - Add fixed_order and cache_affinity scheduling modes to CacheAwareScheduler - Only apply cache affinity in cache_affinity mode; use fixed order otherwise - Simplify Dialog components with title/description props - Remove unnecessary button shadows in SystemSettings - Optimize import dialog UI structure - Update ModelAliasesTab shadow styling - Fix fallback orchestrator type hints - Add scheduling_mode configuration in system config	2025-12-17 19:15:08 +08:00
fawney19	33265b4b13	refactor(global-model): migrate model metadata to flexible config structure 将模型配置从多个固定字段（description, official_url, icon_url, default_supports_* 等）统一为灵活的 config JSON 字段，提高扩展性。同时优化前端模型创建表单，支持从 models-dev 列表直接选择模型快速填充。主要变更： - 后端：模型表迁移，支持 config JSON 存储模型能力和元信息 - 前端：GlobalModelFormDialog 支持两种创建方式（列表选择/手动填写） - API 类型更新，对齐新的数据结构	2025-12-16 12:21:21 +08:00
fawney19	718f56ba75	refactor(cache): optimize cache service architecture and provider transport	2025-12-15 23:12:34 +08:00
fawney19	a0ffc2c406	refactor(metrics): rename model_alias_* to model_mapping_* for clarity	2025-12-15 20:39:32 +08:00
fawney19	a7bfab1475	debug: add logging for model support checking and refactor cache resolution priority - 在 aware_scheduler.py 中添加调试日志，用于跟踪模型支持检查过程 - 重构 model_cache.py 的别名解析逻辑：调整优先级为 alias > provider_model_name > direct_match - 优化缓存命中路径，将直接匹配逻辑移到别名匹配失败后执行	2025-12-15 18:52:34 +08:00
fawney19	903b182fdf	fix(scheduler): correct whitelist validation logic - Use 'is not None' instead of truthiness check for allowed_api_formats - Use 'is not None' instead of truthiness check for allowed_models - Use 'is not None' instead of truthiness check for allowed_providers - Use 'is not None' check for allowed_endpoints to distinguish empty list from None - Fixes issue where empty whitelist (empty list) was incorrectly treated as no restriction	2025-12-15 18:27:41 +08:00
fawney19	d9bd0790fe	feat(cache): improve model cache invalidation for alias resolution - Add provider_model_name and provider_model_aliases to invalidate_model_cache() - Clear resolve cache keys for both model name and aliases when invalidating - Also clear resolve cache in invalidate_global_model_cache() for GlobalModel names - Handle SQLite gracefully by catching OperationalError and ProgrammingError - Optimize fallback query to pre-filter by provider_model_name when JSONB fails	2025-12-15 18:27:31 +08:00
fawney19	8f0a0cbdb1	refactor(scheduler): integrate model alias resolution - Use ModelCacheService.resolve_global_model_by_name_or_alias() for model lookups - Support both requested model name and resolved GlobalModel name in validation - Track resolved_model_name for proper allow_models checking - Improve model availability checks to handle alias resolution - Fix transient/detached object handling in global_model merge - Add more descriptive debug logs for alias resolution mismatches - Clean up code formatting (line length, imports organization)	2025-12-15 18:13:35 +08:00
fawney19	51b85915d2	feat(cache): implement model alias resolution with caching - Add resolve_global_model_by_name_or_alias() supporting direct match and alias lookup - Support both provider_model_name and provider_model_aliases matching - Implement caching for resolved models with TTL - Add conflict detection when alias maps to multiple GlobalModels - Record resolution metrics: method, cache hits, duration, conflicts - Fallback to Python-level filtering for non-PostgreSQL databases - Add cache invalidation methods for GlobalModel	2025-12-15 18:13:28 +08:00
fawney19	7068aa9130	refactor(backend): optimize cache system and model/provider services	2025-12-15 14:30:21 +08:00
fawney19	abc41c7d3c	feat: 添加缓存监控和使用量统计 API 端点	2025-12-11 17:47:59 +08:00
fawney19	f784106826	Initial commit	2025-12-10 20:52:44 +08:00

18 Commits