next-ai-draw-io

mirror of https://github.com/DayuanJiang/next-ai-draw-io.git synced 2026-01-02 22:32:27 +08:00

Author	SHA1	Message	Date
Dayuan Jiang	967d63c57e	feat: support minimax model (#185 ) * feat: support minimax model with XML wrapping fix - Add wrapWithMxFile utility to properly wrap XML for draw.io - Fix 'Not a diagram file' error when model generates raw <root> XML - Add supportsPromptCaching check for conditional caching - Only enable Bedrock prompt caching for Claude models * docs: update model mention to minimax-m2 across About pages and READMEs - Update tooltip in chat-panel.tsx to mention minimax-m2 model change - Update English, Chinese, and Japanese About pages with model change info - Update English, Chinese, and Japanese READMEs with demo site model note --------- Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-09 15:53:59 +09:00
Dayuan Jiang	622829b903	feat: add daily token limit with actual usage tracking (#171 ) * feat: add daily token limit with actual usage tracking - Add DAILY_TOKEN_LIMIT env var for configurable daily token limit - Track actual tokens from Bedrock API response metadata (not estimates) - Server sends inputTokens + cachedInputTokens + outputTokens via messageMetadata - Client increments token count in onFinish callback with actual usage - Add NaN guards to prevent corrupted localStorage values - Add token limit toast notification with quota display - Remove client-side token estimation (was blocking legitimate requests) - Switch to js-tiktoken for client compatibility (pure JS, no WASM) * feat: add TPM (tokens per minute) rate limiting - Add 50k tokens/min client-side rate limit - Track tokens per minute with automatic minute rollover - Check TPM limit after daily limits pass - Show toast when rate limit reached - NaN guards for localStorage values * feat: make TPM limit configurable via TPM_LIMIT env var * chore: restore cache debug logs * fix: prevent race condition in TPM tracking checkTPMLimit was resetting TPM count to 0 when checking, which overwrote the count saved by incrementTPMCount. Now checkTPMLimit only reads and incrementTPMCount handles all writes. * chore: improve TPM limit error message clarity	2025-12-08 18:56:34 +09:00
Dayuan Jiang	95aa4b8a56	chore: remove Amplify integration (#164 ) Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-08 11:39:32 +09:00
dayuan.jiang	167f5ed36a	feat: enable recordInputs in Langfuse telemetry Enable full message history recording including XML tool calls for better observability.	2025-12-07 20:58:44 +09:00
Dayuan Jiang	cd8e0e2263	feat: add token counting utility for system prompts (#153 ) Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 20:33:43 +09:00
QiyuanChen	d8cdd049d1	feat: add SiliconFlow as a supported AI provider (#137 ) * feat: add SiliconFlow as a supported AI provider in documentation and configuration * fix: update SiliconFlow configuration comment to English	2025-12-07 10:22:57 +09:00
Dayuan Jiang	b1bc1a6dc6	feat: auto-save and restore session state (#135 ) - Save and restore chat messages, XML snapshots, session ID, and diagram XML to localStorage - Restore diagram when DrawIO becomes ready (using new onLoad callback) - Change close protection default to false since auto-save handles persistence - Clear localStorage when clearing chat - Handle edge cases: undefined edit fields, empty chartXML, missing access code header	2025-12-07 01:39:09 +09:00
Dayuan Jiang	4be64317b3	feat: enhance system prompts with JSON escaping and edge routing rules (#132 ) - Add JSON escaping warnings to help model generate valid tool calls - Add comprehensive edge routing rules to prevent overlapping lines - Add planning guidance for diagram creation - Update token count estimates in comments Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:23 +09:00
Dayuan Jiang	2fac6323f0	fix: add orphaned mxPoint validation and cleanup (#130 ) - Add validation for orphaned mxPoint elements in validateMxCellStructure() - Add cleanup of orphaned mxPoint elements in convertToLegalXml() - Orphaned mxPoints cause 'Could not add object mxPoint' errors in draw.io - mxPoint elements must have 'as' attribute or be inside <Array as="points"> Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:19 +09:00
Dayuan Jiang	a415c46b66	feat: improve XML search/replace matching strategies (#129 ) - Add 6th strategy: match by value attribute (label text) - Add 7th strategy: normalized whitespace match - Remove lastProcessedIndex tracking - always search from beginning - Pairs may not be in document order, so sequential tracking was unreliable Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:16 +09:00
Dayuan Jiang	e893bd60f9	fix: resolve biome lint errors and memory leak in file preview (#118 ) - Disable noisy biome rules (noExplicitAny, useExhaustiveDependencies, etc.) - Fix memory leak in file-preview-list.tsx with useRef pattern - Separate unmount cleanup into dedicated useEffect - Add ToolPartLike interface for type safety in chat-message-display - Add accessibility attributes (role, tabIndex, onKeyDown) - Replace autoFocus with useEffect focus pattern - Minor syntax improvements (optional chaining, key fixes)	2025-12-06 16:18:26 +09:00
Dayuan Jiang	9aaf9bf31f	refactor: deduplicate system prompts with two-phase composition (#117 )	2025-12-06 12:58:53 +09:00
Dayuan Jiang	150eb1ff63	chore: add Biome for formatting and linting (#116 ) - Add Biome as formatter and linter (replaces Prettier) - Configure Husky + lint-staged for pre-commit hooks - Add VS Code settings for format on save - Ignore components/ui/ (shadcn generated code) - Remove semicolons, use 4-space indent - Reformat all files to new style	2025-12-06 12:46:40 +09:00
Dayuan Jiang	e00938d9d3	feat: enhance system prompt with app context and dynamic model name (#114 ) - Add App Context section describing the left/right panel layout - Add App Features section with icon locations (history, theme, upload, export, clear) - Dynamically inject model name into system prompt via {{MODEL_NAME}} placeholder - Expand edit_diagram tool description with usage guidelines	2025-12-06 12:37:37 +09:00
Dayuan Jiang	ed29e32ba3	feat: restore Langfuse observability integration (#103 ) - Add lib/langfuse.ts with client, trace input/output, telemetry config - Add instrumentation.ts for OpenTelemetry setup with Langfuse span processor - Add /api/log-save endpoint for logging diagram saves - Add /api/log-feedback endpoint for thumbs up/down feedback - Update chat route with sessionId tracking and telemetry - Add feedback buttons (thumbs up/down) to chat messages - Add sessionId tracking throughout the app - Update env.example with Langfuse configuration - Add @langfuse/client, @langfuse/otel, @langfuse/tracing, @opentelemetry/sdk-trace-node	2025-12-05 21:15:02 +09:00
dayuan.jiang	2366255e8f	fix: use credential provider chain for bedrock IAM role support	2025-12-05 09:19:26 +09:00
dayuan.jiang	255308f829	fix: make bedrock credentials optional for IAM role support	2025-12-05 09:11:10 +09:00
dayuan.jiang	ff6f130f8a	refactor: remove Langfuse observability integration - Delete lib/langfuse.ts, instrumentation.ts - Remove API routes: log-save, log-feedback - Remove feedback buttons (thumbs up/down) from chat - Remove sessionId tracking throughout codebase - Remove @langfuse/*, @opentelemetry dependencies - Clean up env.example	2025-12-05 01:30:02 +09:00
dayuan.jiang	562751c913	fix: disable recordInputs to prevent Langfuse media upload timeout When images are included in chat messages, the AI SDK telemetry with recordInputs: true sends base64 image data to Langfuse. Langfuse then attempts to upload these images to media storage, causing 1m31s timeouts. Setting recordInputs: false prevents this while still capturing user text input via setTraceInput().	2025-12-05 01:14:01 +09:00
dayuan.jiang	46cbc3354c	fix: add manual token usage reporting to Langfuse for Bedrock streaming Bedrock streaming responses don't auto-report token usage to OpenTelemetry. This fix manually sets span attributes (ai.usage.promptTokens, gen_ai.usage.input_tokens) from the AI SDK onFinish callback to ensure Langfuse captures token counts.	2025-12-05 00:26:02 +09:00
dayuan.jiang	46d2d4e078	refactor: add input validation and singleton pattern for Langfuse API routes - Add Zod schema validation for log-feedback and log-save endpoints - Create singleton LangfuseClient to avoid per-request instantiation - Simplify log-save to only flag trace (no XML content sent) - Use generic error messages to prevent info leakage	2025-12-04 23:44:00 +09:00
dayuan.jiang	d8f2c85dab	feat: link user feedback and diagram saves to chat traces in Langfuse - Update log-feedback API to find existing chat trace by sessionId and attach score to it - Update log-save API to create span on existing chat trace instead of standalone trace - Add thumbs up/down feedback buttons on assistant messages - Add message regeneration and edit functionality - Add save dialog with format selection (drawio, png, svg) - Pass sessionId through components for Langfuse linking	2025-12-04 22:56:59 +09:00
Dayuan Jiang	5f4d31e708	fix: auto-detect AI provider from configured API keys (#74 ) - Remove default bedrock provider requirement - Auto-detect provider when only one API key is configured - Show helpful error when no keys or multiple keys without AI_PROVIDER - Fixes #73	2025-12-04 14:13:10 +09:00
Dayuan Jiang	3534cb13f7	refactor: extract system prompts and add extended prompt for Opus/Haiku 4.5 (#71 ) - Extract system prompts to dedicated lib/system-prompts.ts module - Add extended system prompt (~4000 tokens) for models with higher cache minimums (Opus 4.5, Haiku 4.5) - Clean up debug logs while preserving informational and cache-related logs - Improve code formatting and organization in chat route	2025-12-04 13:26:06 +09:00
Dayuan Jiang	9d9613a8d1	feat: add trace-level input/output to Langfuse observability (#69 ) * feat: add trace-level input/output to Langfuse observability - Add @langfuse/client and @langfuse/tracing dependencies - Wrap POST handler with observe() for proper tracing - Use updateActiveTrace() to set trace input, output, sessionId, userId - Filter Next.js HTTP spans in shouldExportSpan so AI SDK spans become root traces - Enable recordInputs/recordOutputs in experimental_telemetry * refactor: extract Langfuse logic to separate lib/langfuse.ts module	2025-12-04 11:24:26 +09:00
Dayuan Jiang	a8e627f1f8	feat: add XML structure guide to system prompt for smaller models (#51 ) - Add essential draw.io XML structure rules to system prompt - Include critical rules about mxCell nesting (all must be direct children of root) - Add shape/vertex and connector/edge examples with proper structure - Improve tool description for display_diagram with validation rules - Update xml_guide.md with better swimlane examples showing flat structure - Add client-side XML validation to catch nested mxCell errors early Helps address issues #40 (local Ollama models not working) and #39 (mxCell nesting errors)	2025-12-03 16:14:53 +09:00
dayuan.jiang	45ab934288	feat: add DeepSeek as AI provider - Install @ai-sdk/deepseek package - Add DeepSeek provider support to lib/ai-providers.ts - Add DeepSeek configuration to env.example - Update README.md with DeepSeek in provider list - Support both default and custom base URL for DeepSeek	2025-12-02 11:52:09 +09:00
Dan Zheng	d4fb635d98	fix: add customize anthropic baseURL (#28 ) * fix: add custom anthropic baseURL * feat: add baseURL support for all AI providers - Add GOOGLE_BASE_URL for Google Generative AI - Add AZURE_BASE_URL for Azure OpenAI - Add OLLAMA_BASE_URL support (was documented but not implemented) - Add OPENROUTER_BASE_URL for OpenRouter - Fix missing semicolon in Anthropic case - Update env.example with new environment variables Closes #20 --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-02 01:08:06 +09:00
Dayuan Jiang	5b31216917	feat: cache example prompt responses to save tokens (#34 ) - Add lib/cached-responses.ts with pre-generated XML for 4 example prompts - Modify chat API route to check cache before calling AI - Cache returns instant response (~0.26s) vs AI generation (~20-25s) - Add "(cached for instant response)" text to example panel - Cache only activates for first message with empty diagram	2025-12-01 14:07:50 +09:00
Dayuan Jiang	0d0d553e23	fix: correct anthropic beta header config for fine-grained tool streaming (#27 ) * fix: correct anthropic beta header config for fine-grained tool streaming - Use bedrock.anthropicBeta for Bedrock provider (not additionalModelRequestFields) - Use top-level headers for direct Anthropic API - Update @ai-sdk/amazon-bedrock to 3.0.62 - Add headers support to ModelConfig interface * fix: update @ai-sdk/amazon-bedrock to 3.0.62 for tool streaming support	2025-11-30 16:34:42 +09:00
ylxmf	d2dd501f3f	feat: support OpenAI compatible llm	2025-11-21 17:03:47 +08:00
dayuan.jiang	58dcb3c41a	feat: add OpenRouter support and fix input disabling - Add OpenRouter provider support with @openrouter/ai-sdk-provider - Fix input not disabling during 'submitted' state for fast providers - Apply disable logic to all interactive elements (textarea, buttons, handlers) - Clean up env.example by removing model examples and separator blocks - Upgrade zod to v4.1.12 for compatibility with ollama-ai-provider-v2 - Add debug logging for status changes in chat components	2025-11-15 14:29:18 +09:00
dayuan.jiang	4a3abc2e39	add multiple provider	2025-11-15 13:36:42 +09:00
dayuan.jiang	6940a5156d	refactor: improve diagram handling and error messaging in chat components	2025-11-10 11:27:25 +09:00
dayuan.jiang	de2a6938b1	feat: improve XML handling and edit_diagram tool - Add formatXML function to format single-line XML with proper indentation - Format chartXml after fetching to ensure consistency - Update replaceXMLParts to handle single-line XML with substring fallback - Improve edit_diagram tool guidance with SEARCH/REPLACE best practices - Add concrete examples to help AI use minimal, targeted edits	2025-08-31 20:52:04 +09:00
dayuan.jiang	13ace596d2	refactor: move extractDiagramXML function to utils and remove unused file	2025-03-27 06:45:38 +00:00
dayuan.jiang	5d152c66d5	fix: flash problem	2025-03-25 08:56:24 +00:00
dayuan.jiang	d2a630929b	refactor: chat-example-panel.tsx	2025-03-25 02:24:12 +00:00
dayuan.jiang	a27c94f798	feat: add XML handling in ChatPanel and utility function for legal XML conversion	2025-03-22 15:45:49 +00:00
dayuan.jiang	e26ef731e9	initialize project with Next.js, Tailwind CSS, and essential configurations	2025-03-19 06:04:06 +00:00

1 2

90 Commits