next-ai-draw-io

mirror of https://github.com/DayuanJiang/next-ai-draw-io.git synced 2026-01-02 22:32:27 +08:00

Author	SHA1	Message	Date
Dayuan Jiang	82f4deb23a	fix: quota daily reset bug and add timezone support (#390 ) - Fixed bug where daily quota counts weren't resetting on new day (if_not_exists only works for missing attributes, not day changes) - Changed to two-phase approach: reset if new day, then increment - Added QUOTA_TIMEZONE env var for local midnight reset (e.g., Asia/Tokyo) - Added timezone validation with UTC fallback	2025-12-24 10:34:54 +09:00
Dayuan Jiang	0d2e7a7ad6	fix: escape HTML in XML attribute values to prevent parse errors (#386 ) - Add HTML escaping (<, >) in convertToLegalXml for attribute values - Update isMxCellXmlComplete to handle any LLM provider's wrapper tags - Add wrapper tag stripping in wrapWithMxFile for DeepSeek/Anthropic tags - Update autoFixXml to escape both < and > in attribute values Fixes 'Malformed XML detected in final output' error when AI generates diagrams with HTML content in value attributes like <b>Title</b>.	2025-12-24 09:31:54 +09:00
Dayuan Jiang	b2dfd5b890	fix: display correct quota values in limit toast (#383 ) - Parse JSON error response from server to get actual used/limit values - Previously showed 0/0 due to race condition (config fetch vs error) - AI SDK puts full response body in error.message for non-OK responses - Updated all quota toasts (request, token, TPM) to use server values	2025-12-23 21:08:21 +09:00
Dayuan Jiang	72d647de7a	fix: use Chat Completions API for OpenAI-compatible proxies (#382 ) Third-party OpenAI-compatible proxies typically don't support the /responses endpoint. Use .chat() for custom baseURLs while keeping Responses API for official OpenAI to preserve reasoning model support. Fixes #377	2025-12-23 20:29:48 +09:00
Dayuan Jiang	7de192e1fa	fix: enable progressive diagram rendering during streaming (#380 ) - Add extractCompleteMxCells() to extract only complete mxCell elements from partial XML - Remove useEffect cleanup that was killing debounce timeouts on every re-render - Wrap XML in <root> tags for proper DOMParser validation Previously, diagrams only rendered after ALL XML finished streaming because: 1. useEffect cleanup cleared the 150ms debounce timeout on every message change 2. DOMParser rejected partial XML like '<mxCell id="2" value="...' (incomplete) Now each complete mxCell renders progressively as it finishes streaming.	2025-12-23 18:54:03 +09:00
Dayuan Jiang	97ae9395cd	feat: add server-side quota tracking with DynamoDB (#379 ) - Add dynamo-quota-manager.ts for atomic quota checks using ConditionExpression - Enforce daily request limit, daily token limit, and TPM limit - Return 429 with quota details (type, used, limit) when exceeded - Quota is opt-in: only enabled when DYNAMODB_QUOTA_TABLE env var is set - Remove client-side quota enforcement (server is now source of truth) - Simplify use-quota-manager.tsx to only display toasts - Add @aws-sdk/client-dynamodb dependency	2025-12-23 18:36:27 +09:00
Dayuan Jiang	5ec05eb100	refactor: simplify Langfuse integration with AI SDK 6 (#375 ) - Remove manual token attribute setting (AI SDK 6 telemetry auto-reports) - Use totalTokens directly instead of inputTokens + outputTokens calculation - Fix sessionId bug in log-save/log-feedback (prevents wrong trace attachment) - Hash IP addresses for privacy instead of storing raw IPs - Fix isLangfuseEnabled() to check both keys for consistency	2025-12-23 16:26:45 +09:00
Dayuan Jiang	0385c45a10	fix: OpenAI reasoning/thinking blocks not showing (#370 ) - Use Responses API instead of Chat Completions API for OpenAI (.chat() -> default call) to support reasoning events - Add o4 to reasoning model detection - Change default reasoningSummary from 'detailed' to 'auto' (not all models support 'detailed') - Update types to match AI SDK: 'auto' \| 'detailed'	2025-12-23 13:38:50 +09:00
Dayuan Jiang	8cb7494d16	feat(i18n): add translations for model configuration UI (#368 ) - Add ~40 new translation keys for model-config-dialog and model-selector - Support English, Chinese, and Japanese translations - Replace all hardcoded strings with dictionary lookups	2025-12-23 11:42:27 +09:00
Biki Kalita	84959637db	Support subdirectory deployment and fix API path handling (#311 ) * feat: support subdirectory deployment (NEXT_PUBLIC_BASE_PATH) * removed unwanted check and fix favicon issue * Use getAssetUrl for manifest assets to avoid undefined NEXT_PUBLIC_BASE_PATH * Add validation warning for NEXT_PUBLIC_BASE_PATH format --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-22 23:28:55 +09:00
pointerhacker	9e9ea10beb	fix:feature/sglang-provider (#302 ) Co-authored-by: zhaochaojin <zhaochaojin@didiglobal.com> Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-22 23:13:45 +09:00
Biki Kalita	deae5c2c38	Fix: Localize TPM rate-limit toast via i18n (#353 ) * TMP error toast hardcoded english fixed * fix: correct JA/ZH translations to use tokens instead of requests --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-22 23:00:20 +09:00
Twelveeee	6e2d98e52d	move Language Selector into SettingDialog (#352 ) * fix:custom model setting bug * refactor: consolidate aiProvider checks for cleaner code * fix:Integrated the language selection option into the `SettingsDialog` * fix:useSearchParams() should be wrapped in a suspense boundary at page * fix: improve semantic HTML and maintainability - Replace nested button>a with proper anchor element for GitHub link - Use i18n.locales.map() with LANGUAGE_LABELS for language options --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-22 22:54:25 +09:00
Dayuan Jiang	85cb441e26	feat: multi-provider model configuration with UI/UX improvements (#355 ) * feat: add multi-provider model configuration - Add model config dialog for managing multiple AI providers - Support for OpenAI, Anthropic, Google, Azure, Bedrock, OpenRouter, DeepSeek, SiliconFlow, Ollama, and AI Gateway - Add model selector dropdown in chat panel header - Add API key validation endpoint - Add custom model ID input with keyboard navigation - Fix hover highlight in Command component - Add suggested models for each provider including latest Claude 4.5 series - Store configuration locally in browser * feat: improve model config UI and move selector to chat input - Move model selector from header to chat input (left of send button) - Add per-model validation status (queued, running, valid, invalid) - Filter model selector to only show verified models - Add editable model IDs in config dialog - Add custom model input field alongside suggested models dropdown - Fix hover states on provider buttons and select triggers - Update OpenAI suggested models with GPT-5 series - Add alert-dialog component for delete confirmation * refactor: revert shadcn component changes, apply hover fix at usage site * feat: add AWS credentials support for Bedrock provider - Add AWS Access Key ID, Secret Access Key, Region fields for Bedrock - Show different credential fields based on provider type - Update validation API to handle Bedrock with AWS credentials - Add region selector with common AWS regions * fix: reset Test button after validation completes * fix: reset validation button to Test after success * fix: complete bedrock support and UI/UX improvements - Add bedrock to ALLOWED_CLIENT_PROVIDERS for client credentials - Pass AWS credentials through full chain (headers → API → provider) - Replace non-existent GPT-5 models with real ones (o1, o3-mini) - Add accessibility: aria-labels, focus-visible rings, inline errors - Add more AWS regions (Ohio, London, Paris, Mumbai, Seoul, São Paulo) - Fix setTimeout cleanup with useRef on component unmount - Fix TypeScript type consistency in getSelectedAIConfig fallback * chore: remove unused code - Remove unused setAccessCodeRequired state in chat-panel.tsx - Remove unused getSelectedModel export in model-config.ts * fix: UI/UX improvements for model configuration dialog - Add gradient header styling with icon badge - Change Configuration section icon from Key to Settings2 - Add duplicate model detection with warning banner and inline removal - Filter out already-added models from suggestions dropdown - Add type-to-confirm for deleting providers with 3+ models - Enhance delete confirmation dialog with warning icon - Improve model selector discoverability (show model name + chevron) - Add truncation for long model names with title tooltip - Remove AI provider settings from Settings dialog (now in Model Config) - Extract ValidationButton into reusable component * fix: prevent duplicate model IDs within same provider - Block adding model if ID already exists in provider - Block editing model ID to match existing model in provider * fix: improve duplicate model ID notifications - Add toast notification when trying to add duplicate model - Allow free typing when editing model ID, validate on blur - Show warning toast instead of blocking input * fix: improve duplicate model validation UX in config dialog - Add inline error display for duplicate model IDs - Show red border on input when error exists - Validate on blur with shake animation for edit errors - Prevent saving empty model names - Clear errors when user starts typing - Simplify error styling (small red text, no heavy chips)	2025-12-22 22:36:36 +09:00
Dayuan Jiang	938faff6b2	feat(mcp): add XML validation and auto-fix to MCP server (#336 ) * feat(mcp): add XML validation and auto-fix to MCP server - Add xml-validation.ts with validateAndFixXml function - Integrate validation into display_diagram tool (fails if unfixable) - Integrate validation into edit_diagram tool (auto-fix each operation) - Fix bug: typo fixes now run before foreign tag removal - Fix bug: use before/after comparison instead of regex .test() * style: auto-format with Biome * chore(mcp): bump version to 0.1.3 --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-12-21 00:32:51 +09:00
Biki Kalita	378bef435e	Add i18n support, language toggle UI, and translate Settings dialog (#334 ) * i18n support added * fix: align i18n implementation with Next.js 16 guide - Rename middleware.ts to proxy.ts (Next.js 16 convention) - Fix params type to Promise<{lang: string}> for layout/metadata - Add 'server-only' directive and dynamic imports to dictionaries.ts - Add hasLocale type guard and notFound() for invalid locales - Wrap LanguageToggle in Suspense for useSearchParams - Fix dictionary key mismatch (learnmore -> learnMore) - Improve Chinese translations per Gemini review: - loading ellipsis, new -> 新建, styledMode -> 精致 - goodResponse/badResponse -> 有帮助/无帮助 - closeProtection -> 关闭确认, fileExceeds phrasing - Improve Japanese translations per Gemini review: - closeProtection -> ページ離脱確認 - invalidAccessCode phrasing, appendDiagram -> に追加 - styledMode -> スタイル付き --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-20 14:48:54 +00:00
Dayuan Jiang	f087b54ee4	feat: add get_shape_library tool for AI icon discovery (#335 ) * feat: add get_shape_library tool for AI icon discovery - Add server-side tool that returns shape library documentation - AI can fetch icon/shape names on-demand before generating diagrams - Includes path traversal protection and input sanitization - Library index embedded in tool description for discoverability - Supports 33 libraries: AWS, Azure, GCP, Kubernetes, Cisco, etc. * fix: improve get_shape_library error handling and imports - Move fs/path imports to top of file (avoid dynamic imports per call) - Distinguish file-not-found vs other errors in catch block - Include invalid input in validation error message - Log unexpected errors for debugging * docs: add get_shape_library to system prompt tool list - Add Tool4 (get_shape_library) to available tools section - Add usage guidance in 'Choose the right tool' section - Update AWS icons note to reference get_shape_library for icon discovery * fix: display get_shape_library tool output in chat UI * fix: correct state check for get_shape_library output display * fix: make get_shape_library output respect fold state * style: auto-format with Biome --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-12-20 23:19:49 +09:00
RainX	a91bd9d1e8	feat: add support for custom AI Gateway base URL (#315 ) * feat: add support for custom AI Gateway base URL - Add createGateway support with configurable baseURL - Allow AI_GATEWAY_BASE_URL environment variable for: * Local development with custom Gateway * Self-hosted AI Gateway deployments * Enterprise proxy configurations - Maintain backward compatibility: defaults to Vercel Gateway when not set - Update documentation with usage examples and configuration notes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: remove errant character in error message --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-18 22:00:21 +09:00
Ted Cao	98b890bb06	feat: add Vercel AI Gateway support (#274 ) * feat: add Vercel AI Gateway support - Updated environment configuration to include AI_GATEWAY_API_KEY for unified access to multiple AI providers. - Added gateway provider to the list of supported AI providers in the codebase. - Enhanced documentation to explain the usage of Vercel AI Gateway and its model format. This change simplifies authentication and allows users to switch between providers seamlessly. * Update package @ai-sdk/gateway to latest version 2.0.21	2025-12-17 12:43:33 +09:00
Dayuan Jiang	cd76fa615e	fix: edit_diagram streaming and JSON repair improvements (#271 ) - Add shared editDiagramOriginalXmlRef between streaming preview and tool handler to avoid conflicts when applying operations (fixes "cell already exists" errors) - Add JSON repair preprocessing to fix LLM-generated malformed JSON like `:=` - Filter out tool calls with invalid/undefined inputs from interrupted streaming - Remove perf console logs	2025-12-15 21:28:31 +09:00
Dayuan Jiang	44840d27b3	fix: prevent SSRF attack via custom base URL (GHSA-9qf7-mprq-9qgm) Require API key when custom base URL is provided to prevent attackers from redirecting server API keys to malicious endpoints. CVSS: 9.3 (Critical)	2025-12-15 15:02:18 +09:00
Dayuan Jiang	f175276872	refactor: replace text-based edit_diagram with ID-based operations (#267 ) * refactor: replace text-based edit_diagram with ID-based operations - Add applyDiagramOperations() function using DOMParser for ID lookup - New schema: operations array with type (update/add/delete), cell_id, new_xml - Update chat-panel.tsx handler for new operations format - Update OperationsDisplay component to show operation type and cell_id - Simplify system prompts with new ID-based examples - Add ID validation for add operations - Add warning for edges referencing deleted cells * fix: add ID validation to update operation and remove dead code - Add ID mismatch validation to update operation (consistency with add) - Remove orphaned replaceXMLParts function (~300 lines of dead code) - Update cell_id schema description for clarity - Add unit tests for applyDiagramOperations (11 tests)	2025-12-15 14:22:56 +09:00
Dayuan Jiang	78a77e102d	fix: prevent browser crash during long streaming sessions (#262 ) - Debounce streaming diagram updates (150ms) to reduce handleDisplayChart calls by 93% - Debounce localStorage writes (1s) to prevent blocking main thread - Limit diagramHistory to 20 entries to prevent unbounded memory growth - Clean up debounce timeout on component unmount to prevent memory leaks - Add console timing markers for performance profiling Fixes #78	2025-12-14 21:23:14 +09:00
Dayuan Jiang	f743219c03	feat: add minimal style mode toggle for faster diagram generation (#260 ) * feat: add minimal style mode toggle for faster diagram generation - Add Minimal/Styled toggle switch in chat input UI - When enabled, removes color/style instructions from system prompt - Faster generation with plain black/white diagrams - Improves XML auto-fix: handle foreign tags, extra closing tags, trailing garbage - Fix isMxCellXmlComplete to strip Anthropic function-calling wrappers - Add debug logging for truncation detection diagnosis * fix: prevent false XML parse errors during streaming - Escape unescaped & characters in convertToLegalXml() before DOMParser validation - Only log console.error for final output, not during streaming updates - Prevents Next.js dev mode error overlay from showing for expected streaming states	2025-12-14 19:38:40 +09:00
Dayuan Jiang	0851b32b67	refactor: simplify LLM XML format to output bare mxCells only (#254 ) * refactor: simplify LLM XML format to output bare mxCells only - Update wrapWithMxFile() to always add root cells (id=0, id=1) automatically - LLM now generates only mxCell elements starting from id=2 (no wrapper tags) - Update system prompts and tool descriptions with new format instructions - Update cached responses to remove root cells and wrapper tags - Update truncation detection to check for complete mxCell endings - Update documentation in xml_guide.md * fix: address PR review issues for XML format refactor - Fix critical bug: inconsistent truncation check using old </root> pattern - Fix stale error message referencing </root> tag - Add isMxCellXmlComplete() helper for consistent truncation detection - Improve regex patterns to handle any attribute order in root cells - Update wrapWithMxFile JSDoc to document root cell removal behavior * fix: handle non-self-closing root cells in wrapWithMxFile regex	2025-12-14 14:04:44 +09:00
Dayuan Jiang	66bd0e5493	feat: add append_diagram tool and improve truncation handling (#252 ) * feat: add append_diagram tool for truncation continuation When LLM output hits maxOutputTokens mid-generation, instead of failing with an error loop, the system now: 1. Detects truncation (missing </root> in XML) 2. Stores partial XML and tells LLM to use new append_diagram tool 3. LLM continues generating from where it stopped 4. Fragments are accumulated until XML is complete 5. Server limits to 5 steps via stepCountIs(5) Key changes: - Add append_diagram tool definition in route.ts - Add append_diagram handler in chat-panel.tsx - Track continuation mode separately from error mode - Continuation mode has unlimited retries (not counted against limit) - Error mode still limited to MAX_AUTO_RETRY_COUNT (1) - Update system prompts to document append_diagram tool * fix: show friendly message and yellow badge for truncated output - Add yellow 'Truncated' badge in UI instead of red 'Error' when XML is incomplete - Show friendly error message for toolUse.input is invalid errors - Built on top of append_diagram continuation feature * refactor: remove debug logs and simplify truncation state - Remove all debug console.log statements - Remove isContinuationModeRef, derive from partialXmlRef.current.length > 0 * docs: fix append_diagram instructions for consistency - Change 'Do NOT include' to 'Do NOT start with' (clearer intent) - Add <mxCell id="0"> to prohibited start patterns - Change 'closing tags </root></mxGraphModel>' to just '</root>' (wrapWithMxFile handles the rest)	2025-12-14 12:34:34 +09:00
Dayuan Jiang	b33e09be05	feat: add XML auto-fix with refined validation logic (#247 ) * feat: add XML auto-fix and improve validator accuracy - Add autoFixXml() to automatically repair common XML issues: - CDATA wrapper removal - Duplicate attribute removal - Unescaped & and < character escaping - Invalid entity reference fixing - Unclosed tag completion - Nested mxCell flattening - Duplicate ID renaming - Improve validateMxCellStructure() with DOM + regex approach: - Use DOMParser for syntax error detection (94% recall) - Add regex checks for edge cases - Stateful parser for handling > in attribute values - Integrate validateAndFixXml() in chat-message-display and diagram-context - Auto-repair invalid XML before loading - Log fixes applied for debugging Metrics: 99.77% accuracy, 94.06% recall, 94.4% auto-fix success rate * fix: improve XML auto-fix from 58.7% to 99% fix rate Key improvements: - Reorder CDATA removal to run before text-before-root check (+35 cases) - Implement Gemini's backslash-quote fix with regex backreference Handles attr="value", value="text\"inner\"more", and mixed patterns - Add aggressive drop-broken-cells fix for unfixable mxCell elements Iteratively removes cells causing DOM parse errors (up to 50) Results on 9,411 XML dataset: - 206 invalid XMLs detected - 204 successfully fixed (99.0% fix rate) - 2 unfixable (completely broken, need regeneration) * refactor: extract XML validation/fix helpers and add constants - Add constants: MAX_XML_SIZE (1MB), MAX_DROP_ITERATIONS (10), STRUCTURAL_ATTRS, VALID_ENTITIES - Extract parseXmlTags helper for shared tag parsing logic - Extract validation helpers: checkDuplicateAttributes, checkDuplicateIds, checkTagMismatches, checkCharacterReferences, checkEntityReferences, checkNestedMxCells - Simplify validateMxCellStructure from ~200 lines to ~55 lines - Add logging to empty catch block in DOMParser section - Add size warning for large XML documents - Remove unused variables (isSelfClose, duplicate idPattern) * fix: improve XML auto-fix with malformed quote pattern - Fix ="..." pattern where " was used as delimiter instead of actual quotes - Common in dashPattern attributes like dashPattern="1 1;"	2025-12-13 23:31:01 +09:00
dayuan.jiang	6024443816	fix: improve XML auto-fix from 58.7% to 99% fix rate Key improvements: - Reorder CDATA removal to run before text-before-root check (+35 cases) - Implement Gemini's backslash-quote fix with regex backreference Handles attr="value", value="text\"inner\"more", and mixed patterns - Add aggressive drop-broken-cells fix for unfixable mxCell elements Iteratively removes cells causing DOM parse errors (up to 50) Results on 9,411 XML dataset: - 206 invalid XMLs detected - 204 successfully fixed (99.0% fix rate) - 2 unfixable (completely broken, need regeneration)	2025-12-13 16:11:48 +09:00
dayuan.jiang	4b838fd6d5	feat: add XML auto-fix and improve validator accuracy - Add autoFixXml() to automatically repair common XML issues: - CDATA wrapper removal - Duplicate attribute removal - Unescaped & and < character escaping - Invalid entity reference fixing - Unclosed tag completion - Nested mxCell flattening - Duplicate ID renaming - Improve validateMxCellStructure() with DOM + regex approach: - Use DOMParser for syntax error detection (94% recall) - Add regex checks for edge cases - Stateful parser for handling > in attribute values - Integrate validateAndFixXml() in chat-message-display and diagram-context - Auto-repair invalid XML before loading - Log fixes applied for debugging Metrics: 99.77% accuracy, 94.06% recall, 94.4% auto-fix success rate	2025-12-13 16:11:47 +09:00
Dayuan Jiang	a0f163fe9e	fix: improve Azure provider auto-detection and validation (#223 ) (#225 ) * fix: improve Azure provider auto-detection and validation (#223) - Fix detectProvider() to only detect Azure when it has complete config (both AZURE_API_KEY and AZURE_RESOURCE_NAME or AZURE_BASE_URL) - Add validation in validateProviderCredentials() for Azure to provide clear error messages when configuration is incomplete - Update docs/ai-providers.md to clarify Azure requires resource name * docs: add Azure reasoning options to documentation	2025-12-11 21:49:50 +09:00
Dayuan Jiang	869391a029	refactor: eliminate code duplication (DRY principle) (#211 ) ## Problem Solved Previous refactoring added 105 lines (1476→1581) by extracting code into separate files without eliminating duplication. This refactor focuses on reducing code size through deduplication while maintaining file separation for maintainability. ## Summary - Reduced total lines from 1581 to 1519 (-62 lines, 3.9% reduction) - Eliminated duplicate patterns using generic helpers and factory functions - Maintained file structure for maintainability - Zero functional changes - same behavior ### Phase 1: DRY use-quota-manager.tsx - Created parseStorageCount() helper (eliminates 6x localStorage read duplication) - Created createQuotaChecker() factory (consolidates 3 check function bodies) - Created createQuotaIncrementer() factory (consolidates 3 increment function bodies) - Result: 242→247 lines (+5 lines, but fully DRY with eliminated duplication) ### Phase 2: DRY chat-panel.tsx (1176→1109 lines, -67 lines) #### 2.1: Extract checkAllQuotaLimits helper - Replaced 3 occurrences of 18-line quota check blocks - Saved 36 lines #### 2.2: Extract sendChatMessage helper - Replaced 3 occurrences of 21-line sendMessage+headers blocks - Saved 42 lines #### 2.3: Extract processFilesAndAppendContent helper - Replaced 2 occurrences of file processing loops - Handles PDF, text, and image files uniformly - Async helper with optional image parts parameter	2025-12-11 14:28:02 +09:00
Dayuan Jiang	8b9336466f	feat: make PDF/text extraction char limit configurable via env (#214 ) Add NEXT_PUBLIC_MAX_EXTRACTED_CHARS environment variable to allow configuring the maximum characters extracted from PDF and text files. Defaults to 150000 (150k chars) if not set.	2025-12-11 14:14:31 +09:00
Dayuan Jiang	ee514efa9e	fix: implement AZURE_RESOURCE_NAME config for Azure OpenAI (#213 ) Previously AZURE_RESOURCE_NAME was documented in env.example but not actually used in the code. This caused Azure OpenAI configuration to fail when users set AZURE_RESOURCE_NAME instead of AZURE_BASE_URL. Changes: - Read AZURE_RESOURCE_NAME from environment and pass to createAzure() - resourceName constructs endpoint: https://{name}.openai.azure.com/openai/v1 - baseURL takes precedence over resourceName when both are set - Updated env.example with clearer documentation Fixes #208	2025-12-11 13:32:33 +09:00
Biki Kalita	a047a6ff97	feat: Display AI reasoning/thinking blocks in chat interface (#152 ) * feat: Add reasoning/thinking blocks display in chat interface * feat: add multi-provider options support and replace custom reasoning UI with AI Elements * resolve conflicting reasoning configs and correct provider-specific reasoning parameters * try to solve conflict * fix: simplify reasoning display and remove unnecessary dependencies - Remove Streamdown dependency (~5MB) - reasoning is plain text only - Fix Bedrock providerOptions merging for Claude reasoning configs - Remove unsupported DeepSeek reasoning configuration - Clean up unused environment variables (REASONING_BUDGET_TOKENS, REASONING_EFFORT, DEEPSEEK_REASONING_) - Remove dead commented code from route.ts Reasoning blocks contain plain thinking text and don't need markdown/diagram/code rendering. feat: comprehensive reasoning support improvements Major improvements: - Auto-enable reasoning display for all supported models - Fix provider-specific reasoning configurations - Remove unnecessary Streamdown dependency (~5MB) - Clean up debug logging Provider changes: - OpenAI: Auto-enable reasoningSummary for o1/o3/gpt-5 models - Google: Auto-enable includeThoughts for Gemini 2.5/3 models - Bedrock: Restrict reasoningConfig to only Claude/Nova (fixes MiniMax error) - Ollama: Add thinking support for qwen3-like models Other improvements: - Remove ENABLE_REASONING toggle (always enabled) - Fix Bedrock providerOptions merging for Claude - Simplify reasoning component (plain text rendering) - Clean up unused environment variables * fix: critical bugs and documentation gaps in reasoning support Critical fixes: - Fix Bedrock shallow merge bug (deep merge preserves anthropicBeta + reasoningConfig) - Add parseInt validation with parseIntSafe helper (prevents NaN errors) - Validate all numeric env vars with min/max ranges Documentation improvements: - Add BEDROCK_REASONING_BUDGET_TOKENS and BEDROCK_REASONING_EFFORT to env.example - Add OLLAMA_ENABLE_THINKING to env.example - Update JSDoc with accurate env var list and ranges Code cleanup: - Remove debug console.log statements from route.ts - Refactor duplicate providerOptions assignments --------- Co-authored-by: Dayuan Jiang <34411969+DayuanJiang@users.noreply.github.com> Co-authored-by: Dayuan Jiang <jdy.toh@gmail.com>	2025-12-11 00:24:43 +09:00
Dayuan Jiang	d2ba133eaf	feat: add PDF and text file upload support (#205 ) - Add client-side PDF text extraction using unpdf library - Support text files (.txt, .md, .json, .csv, .py, .js, .ts, etc.) - Add file preview with character count for PDF/text files - Add 150k character limit for extracted content - Highlight Paper to Diagram example with NEW badge - Fix React hydration error by adding explicit IDs to ResizablePanelGroup - Remove code duplication by centralizing file utilities in pdf-utils.ts	2025-12-10 21:32:35 +09:00
Dayuan Jiang	97ab82e027	feat: add bring-your-own-API-key support (#186 ) - Add AI provider settings to config panel (provider, model, API key, base URL) - Support 7 providers: OpenAI, Anthropic, Google, Azure, OpenRouter, DeepSeek, SiliconFlow - Client API keys stored in localStorage, never stored on server - Client settings override server env vars when provided - Skip server credential validation when client provides API key - Bypass usage limits (request/token/TPM) when using own API key - Add /api/config endpoint for fetching usage limits - Add privacy notices to settings dialog, about pages, and quota toast - Add clear settings button to reset saved API keys - Update README files (EN/CN/JA) with BYOK documentation Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-09 17:50:07 +09:00
Dayuan Jiang	967d63c57e	feat: support minimax model (#185 ) * feat: support minimax model with XML wrapping fix - Add wrapWithMxFile utility to properly wrap XML for draw.io - Fix 'Not a diagram file' error when model generates raw <root> XML - Add supportsPromptCaching check for conditional caching - Only enable Bedrock prompt caching for Claude models * docs: update model mention to minimax-m2 across About pages and READMEs - Update tooltip in chat-panel.tsx to mention minimax-m2 model change - Update English, Chinese, and Japanese About pages with model change info - Update English, Chinese, and Japanese READMEs with demo site model note --------- Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-09 15:53:59 +09:00
Dayuan Jiang	622829b903	feat: add daily token limit with actual usage tracking (#171 ) * feat: add daily token limit with actual usage tracking - Add DAILY_TOKEN_LIMIT env var for configurable daily token limit - Track actual tokens from Bedrock API response metadata (not estimates) - Server sends inputTokens + cachedInputTokens + outputTokens via messageMetadata - Client increments token count in onFinish callback with actual usage - Add NaN guards to prevent corrupted localStorage values - Add token limit toast notification with quota display - Remove client-side token estimation (was blocking legitimate requests) - Switch to js-tiktoken for client compatibility (pure JS, no WASM) * feat: add TPM (tokens per minute) rate limiting - Add 50k tokens/min client-side rate limit - Track tokens per minute with automatic minute rollover - Check TPM limit after daily limits pass - Show toast when rate limit reached - NaN guards for localStorage values * feat: make TPM limit configurable via TPM_LIMIT env var * chore: restore cache debug logs * fix: prevent race condition in TPM tracking checkTPMLimit was resetting TPM count to 0 when checking, which overwrote the count saved by incrementTPMCount. Now checkTPMLimit only reads and incrementTPMCount handles all writes. * chore: improve TPM limit error message clarity	2025-12-08 18:56:34 +09:00
Dayuan Jiang	95aa4b8a56	chore: remove Amplify integration (#164 ) Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-08 11:39:32 +09:00
dayuan.jiang	167f5ed36a	feat: enable recordInputs in Langfuse telemetry Enable full message history recording including XML tool calls for better observability.	2025-12-07 20:58:44 +09:00
Dayuan Jiang	cd8e0e2263	feat: add token counting utility for system prompts (#153 ) Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 20:33:43 +09:00
QiyuanChen	d8cdd049d1	feat: add SiliconFlow as a supported AI provider (#137 ) * feat: add SiliconFlow as a supported AI provider in documentation and configuration * fix: update SiliconFlow configuration comment to English	2025-12-07 10:22:57 +09:00
Dayuan Jiang	b1bc1a6dc6	feat: auto-save and restore session state (#135 ) - Save and restore chat messages, XML snapshots, session ID, and diagram XML to localStorage - Restore diagram when DrawIO becomes ready (using new onLoad callback) - Change close protection default to false since auto-save handles persistence - Clear localStorage when clearing chat - Handle edge cases: undefined edit fields, empty chartXML, missing access code header	2025-12-07 01:39:09 +09:00
Dayuan Jiang	4be64317b3	feat: enhance system prompts with JSON escaping and edge routing rules (#132 ) - Add JSON escaping warnings to help model generate valid tool calls - Add comprehensive edge routing rules to prevent overlapping lines - Add planning guidance for diagram creation - Update token count estimates in comments Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:23 +09:00
Dayuan Jiang	2fac6323f0	fix: add orphaned mxPoint validation and cleanup (#130 ) - Add validation for orphaned mxPoint elements in validateMxCellStructure() - Add cleanup of orphaned mxPoint elements in convertToLegalXml() - Orphaned mxPoints cause 'Could not add object mxPoint' errors in draw.io - mxPoint elements must have 'as' attribute or be inside <Array as="points"> Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:19 +09:00
Dayuan Jiang	a415c46b66	feat: improve XML search/replace matching strategies (#129 ) - Add 6th strategy: match by value attribute (label text) - Add 7th strategy: normalized whitespace match - Remove lastProcessedIndex tracking - always search from beginning - Pairs may not be in document order, so sequential tracking was unreliable Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:16 +09:00
Dayuan Jiang	e893bd60f9	fix: resolve biome lint errors and memory leak in file preview (#118 ) - Disable noisy biome rules (noExplicitAny, useExhaustiveDependencies, etc.) - Fix memory leak in file-preview-list.tsx with useRef pattern - Separate unmount cleanup into dedicated useEffect - Add ToolPartLike interface for type safety in chat-message-display - Add accessibility attributes (role, tabIndex, onKeyDown) - Replace autoFocus with useEffect focus pattern - Minor syntax improvements (optional chaining, key fixes)	2025-12-06 16:18:26 +09:00
Dayuan Jiang	9aaf9bf31f	refactor: deduplicate system prompts with two-phase composition (#117 )	2025-12-06 12:58:53 +09:00
Dayuan Jiang	150eb1ff63	chore: add Biome for formatting and linting (#116 ) - Add Biome as formatter and linter (replaces Prettier) - Configure Husky + lint-staged for pre-commit hooks - Add VS Code settings for format on save - Ignore components/ui/ (shadcn generated code) - Remove semicolons, use 4-space indent - Reformat all files to new style	2025-12-06 12:46:40 +09:00
Dayuan Jiang	e00938d9d3	feat: enhance system prompt with app context and dynamic model name (#114 ) - Add App Context section describing the left/right panel layout - Add App Features section with icon locations (history, theme, upload, export, clear) - Dynamically inject model name into system prompt via {{MODEL_NAME}} placeholder - Expand edit_diagram tool description with usage guidelines	2025-12-06 12:37:37 +09:00

1 2

76 Commits