next-ai-draw-io

mirror of https://github.com/DayuanJiang/next-ai-draw-io.git synced 2026-01-02 22:32:27 +08:00

Author	SHA1	Message	Date
singledog957	95c5a75ca3	feat: Show detailed error messages instead of generic 'Internal server error' (#144 ) (#154 ) * feat: Show detailed error messages instead of generic 'Internal server error' (#144) * refactor: simplify error handling logic per feedback * refactor: imported AI SDK error handler * fix: remove unused import and expand sensitive data filter - Remove unused NoSuchModelError import - Add 'secret', 'password', 'credential' to sensitive data filter --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-08 20:52:18 +09:00
Dayuan Jiang	622829b903	feat: add daily token limit with actual usage tracking (#171 ) * feat: add daily token limit with actual usage tracking - Add DAILY_TOKEN_LIMIT env var for configurable daily token limit - Track actual tokens from Bedrock API response metadata (not estimates) - Server sends inputTokens + cachedInputTokens + outputTokens via messageMetadata - Client increments token count in onFinish callback with actual usage - Add NaN guards to prevent corrupted localStorage values - Add token limit toast notification with quota display - Remove client-side token estimation (was blocking legitimate requests) - Switch to js-tiktoken for client compatibility (pure JS, no WASM) * feat: add TPM (tokens per minute) rate limiting - Add 50k tokens/min client-side rate limit - Track tokens per minute with automatic minute rollover - Check TPM limit after daily limits pass - Show toast when rate limit reached - NaN guards for localStorage values * feat: make TPM limit configurable via TPM_LIMIT env var * chore: restore cache debug logs * fix: prevent race condition in TPM tracking checkTPMLimit was resetting TPM count to 0 when checking, which overwrote the count saved by incrementTPMCount. Now checkTPMLimit only reads and incrementTPMCount handles all writes. * chore: improve TPM limit error message clarity	2025-12-08 18:56:34 +09:00
Dayuan Jiang	728dda5267	feat: add daily request limit with custom toast notification (#167 ) - Add DAILY_REQUEST_LIMIT env var support in config API - Track request count in localStorage (resets daily) - Show friendly quota limit toast with self-host/sponsor links - Apply limit to send, regenerate, and edit message actions	2025-12-08 14:26:01 +09:00
Dayuan Jiang	ecea8a6005	fix: use static maxDuration value for Next.js 16 compatibility (#160 )	2025-12-08 10:56:37 +09:00
Dayuan Jiang	ee9267d54c	chore: make maxDuration configurable via env variable (#157 ) Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-08 10:20:52 +09:00
Dayuan Jiang	8c431ee6ed	fix: preserve message parts order in chat display (#151 ) - Fix bug where text after tool calls was merged with initial text - Group consecutive text/file parts into bubbles while keeping tools in order - Parts now display as: plan -> tool_result -> additional text - Remove debug logs from fixToolCallInputs function Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 19:56:31 +09:00
Dayuan Jiang	86420a42c6	fix: implement client-side caching for example diagrams (#150 ) - Add client-side cache check in onFormSubmit to bypass API calls for example prompts - Use findCachedResponse to match input against cached examples - Directly set messages with cached tool response when example matches - Hide regenerate button for cached example responses (toolCallId starts with 'cached-') - Prevents unnecessary API calls when using example buttons Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 19:36:09 +09:00
Dayuan Jiang	0baf21fadb	fix: validate XML before displaying diagram to catch duplicate IDs (#147 ) - Add validation to loadDiagram in diagram-context, returns error or null - display_diagram and edit_diagram tools now check validation result - Return error to AI agent with state: output-error so it can retry - Skip validation for trusted sources (localStorage, history, internal templates) - Add debug logging for tool call inputs to diagnose Bedrock API issues	2025-12-07 14:38:15 +09:00
Biki Kalita	8b578a456e	fix: Remove hardcoded temperature parameter to support models that don't support it (#133 ) * Fix: remove hardcoded temperature parameter to support reasoning models * feat: make temperature configurable via AI_TEMPERATURE env var - Instead of removing temperature entirely, make it optional via env var - Set AI_TEMPERATURE=0 for deterministic output (recommended for diagrams) - Leave unset for models that don't support temperature (e.g., GPT-5.1 reasoning) * docs: add AI_TEMPERATURE env var documentation - Update env.example with AI_TEMPERATURE option - Update README.md configuration section - Add Temperature Setting section in ai-providers.md * docs: add TEMPERATURE env var documentation - Update env.example with TEMPERATURE option - Update README.md, README_CN.md, README_JA.md configuration sections - Add Temperature Setting section in ai-providers.md - Update route.ts to use TEMPERATURE env var --------- Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 01:34:59 +09:00
Dayuan Jiang	3894abd9ed	feat: add tool call JSON repair and Bedrock compatibility (#127 ) - Add fixToolCallInputs() to fix Bedrock API requirement (JSON object, not string) - Add experimental_repairToolCall for malformed JSON from model - Add stepCountIs(5) limit to prevent infinite loops - Update edit_diagram tool description with JSON escaping warning Co-authored-by: dayuan.jiang <jiangdy@amazon.co.jp>	2025-12-07 00:40:13 +09:00
Dayuan Jiang	46567cb0b8	feat: verify access code with server before saving (#128 )	2025-12-07 00:21:59 +09:00
Dayuan Jiang	cbb92bd636	fix: set maxDuration to 60 for Vercel hobby plan (#122 )	2025-12-06 18:09:30 +09:00
Dayuan Jiang	8d898d8adc	fix: revert maxDuration to static value (Next.js requirement) (#121 )	2025-12-06 18:04:23 +09:00
Dayuan Jiang	1e0b1ed970	feat: make maxDuration configurable via MAX_DURATION env (#120 )	2025-12-06 17:47:50 +09:00
Dayuan Jiang	e893bd60f9	fix: resolve biome lint errors and memory leak in file preview (#118 ) - Disable noisy biome rules (noExplicitAny, useExhaustiveDependencies, etc.) - Fix memory leak in file-preview-list.tsx with useRef pattern - Separate unmount cleanup into dedicated useEffect - Add ToolPartLike interface for type safety in chat-message-display - Add accessibility attributes (role, tabIndex, onKeyDown) - Replace autoFocus with useEffect focus pattern - Minor syntax improvements (optional chaining, key fixes)	2025-12-06 16:18:26 +09:00
Dayuan Jiang	150eb1ff63	chore: add Biome for formatting and linting (#116 ) - Add Biome as formatter and linter (replaces Prettier) - Configure Husky + lint-staged for pre-commit hooks - Add VS Code settings for format on save - Ignore components/ui/ (shadcn generated code) - Remove semicolons, use 4-space indent - Reformat all files to new style	2025-12-06 12:46:40 +09:00
Dayuan Jiang	215a101f54	fix: revert edit_diagram tool description to original (#115 )	2025-12-06 12:41:01 +09:00
Dayuan Jiang	e00938d9d3	feat: enhance system prompt with app context and dynamic model name (#114 ) - Add App Context section describing the left/right panel layout - Add App Features section with icon locations (history, theme, upload, export, clear) - Dynamically inject model name into system prompt via {{MODEL_NAME}} placeholder - Expand edit_diagram tool description with usage guidelines	2025-12-06 12:37:37 +09:00
Twelveeee	3fb349fb3e	clear button cant clear error msg & feat: add setting dialog and add accesscode (#77 ) * fix: clear button cant clear error msg * new: add setting dialog and add accesscode * fix: address review feedback - dark mode, types, formatting * feat: only show Settings button when access code is required * refactor: rename ACCESS_CODES to ACCESS_CODE_LIST --------- Co-authored-by: dayuan.jiang <jdy.toh@gmail.com>	2025-12-05 22:09:34 +09:00
Dayuan Jiang	ed29e32ba3	feat: restore Langfuse observability integration (#103 ) - Add lib/langfuse.ts with client, trace input/output, telemetry config - Add instrumentation.ts for OpenTelemetry setup with Langfuse span processor - Add /api/log-save endpoint for logging diagram saves - Add /api/log-feedback endpoint for thumbs up/down feedback - Update chat route with sessionId tracking and telemetry - Add feedback buttons (thumbs up/down) to chat messages - Add sessionId tracking throughout the app - Update env.example with Langfuse configuration - Add @langfuse/client, @langfuse/otel, @langfuse/tracing, @opentelemetry/sdk-trace-node	2025-12-05 21:15:02 +09:00
Dayuan Jiang	4cd78dc561	chore: remove complex 503 error handling code (#102 ) - Remove 15s streaming timeout detection (too slow, added complexity) - Remove status indicator (issue resolved by switching model) - Remove streamingError state and related refs - Simplify onFinish callback (remove 503 detection logging) - Remove errorHandler function (use default AI SDK errors) The real fix was switching from global.* to us.* Bedrock model. This removes ~134 lines of unnecessary complexity.	2025-12-05 20:18:19 +09:00
Dayuan Jiang	e0c5d966e3	feat: add image upload validation with 2MB limit and max 5 files (#101 ) - Add 2MB file size limit with client and server-side validation - Add max 5 files limit per upload - Add sonner toast library for better error notifications - Create ErrorToast component with keyboard accessibility - Batch multiple validation errors into single toast - Validate file size in all upload methods (input, paste, drag-drop) - Add server-side validation in /api/chat endpoint	2025-12-05 19:30:50 +09:00
Dayuan Jiang	95160f5a21	fix: handle Bedrock 503 streaming errors with timeout detection (#92 ) - Add 15s streaming timeout to detect mid-stream stalls (e.g., Bedrock 503) - Add stop() call to allow user retry after timeout - Add streamingError state for timeout-detected errors - Improve server-side error logging for empty usage detection - Add user-friendly error messages for ServiceUnavailable and Throttling errors	2025-12-05 14:23:47 +09:00
dayuan.jiang	ff6f130f8a	refactor: remove Langfuse observability integration - Delete lib/langfuse.ts, instrumentation.ts - Remove API routes: log-save, log-feedback - Remove feedback buttons (thumbs up/down) from chat - Remove sessionId tracking throughout codebase - Remove @langfuse/*, @opentelemetry dependencies - Clean up env.example	2025-12-05 01:30:02 +09:00
dayuan.jiang	46cbc3354c	fix: add manual token usage reporting to Langfuse for Bedrock streaming Bedrock streaming responses don't auto-report token usage to OpenTelemetry. This fix manually sets span attributes (ai.usage.promptTokens, gen_ai.usage.input_tokens) from the AI SDK onFinish callback to ensure Langfuse captures token counts.	2025-12-05 00:26:02 +09:00
dayuan.jiang	46d2d4e078	refactor: add input validation and singleton pattern for Langfuse API routes - Add Zod schema validation for log-feedback and log-save endpoints - Create singleton LangfuseClient to avoid per-request instantiation - Simplify log-save to only flag trace (no XML content sent) - Use generic error messages to prevent info leakage	2025-12-04 23:44:00 +09:00
dayuan.jiang	d8f2c85dab	feat: link user feedback and diagram saves to chat traces in Langfuse - Update log-feedback API to find existing chat trace by sessionId and attach score to it - Update log-save API to create span on existing chat trace instead of standalone trace - Add thumbs up/down feedback buttons on assistant messages - Add message regeneration and edit functionality - Add save dialog with format selection (drawio, png, svg) - Pass sessionId through components for Langfuse linking	2025-12-04 22:56:59 +09:00
Dayuan Jiang	3534cb13f7	refactor: extract system prompts and add extended prompt for Opus/Haiku 4.5 (#71 ) - Extract system prompts to dedicated lib/system-prompts.ts module - Add extended system prompt (~4000 tokens) for models with higher cache minimums (Opus 4.5, Haiku 4.5) - Clean up debug logs while preserving informational and cache-related logs - Improve code formatting and organization in chat route	2025-12-04 13:26:06 +09:00
Dayuan Jiang	9d9613a8d1	feat: add trace-level input/output to Langfuse observability (#69 ) * feat: add trace-level input/output to Langfuse observability - Add @langfuse/client and @langfuse/tracing dependencies - Wrap POST handler with observe() for proper tracing - Use updateActiveTrace() to set trace input, output, sessionId, userId - Filter Next.js HTTP spans in shouldExportSpan so AI SDK spans become root traces - Enable recordInputs/recordOutputs in experimental_telemetry * refactor: extract Langfuse logic to separate lib/langfuse.ts module	2025-12-04 11:24:26 +09:00
Dayuan Jiang	fa1b02ad78	feat: integrate Langfuse for LLM observability (#66 ) * feat: integrate Langfuse for LLM observability - Add instrumentation.ts with Langfuse OpenTelemetry exporter - Enable experimental telemetry on streamText calls - Add instrumentationHook to Next.js config - Install required dependencies (@vercel/otel, langfuse-vercel, etc.) * feat: add optional Langfuse observability integration - Add session tracking with unique sessionId per conversation - Add user tracking via IP address (x-forwarded-for header) - Make telemetry conditional - only enabled if LANGFUSE_PUBLIC_KEY is set - Add environment variable validation in instrumentation.ts - Add sessionId validation (type check + 200 char limit) - Update env.example with Langfuse configuration docs - Remove unused langfuse-vercel and @vercel/otel packages * fix: remove deprecated instrumentationHook (enabled by default in Next.js 15)	2025-12-04 00:23:09 +09:00
Dayuan Jiang	595f24857a	fix: show user-friendly error when model doesn't support images (#55 ) When models like DeepSeek (deepseek-chat, deepseek-reasoner) receive image inputs, they return a cryptic error about 'unknown variant image_url'. This change detects such errors and shows a clear message asking users to remove the image or switch to a vision-capable model. Fixes #42	2025-12-03 19:49:58 +09:00
Dayuan Jiang	a8e627f1f8	feat: add XML structure guide to system prompt for smaller models (#51 ) - Add essential draw.io XML structure rules to system prompt - Include critical rules about mxCell nesting (all must be direct children of root) - Add shape/vertex and connector/edge examples with proper structure - Improve tool description for display_diagram with validation rules - Update xml_guide.md with better swimlane examples showing flat structure - Add client-side XML validation to catch nested mxCell errors early Helps address issues #40 (local Ollama models not working) and #39 (mxCell nesting errors)	2025-12-03 16:14:53 +09:00
Dayuan Jiang	5b31216917	feat: cache example prompt responses to save tokens (#34 ) - Add lib/cached-responses.ts with pre-generated XML for 4 example prompts - Modify chat API route to check cache before calling AI - Cache returns instant response (~0.26s) vs AI generation (~20-25s) - Add "(cached for instant response)" text to example panel - Cache only activates for first message with empty diagram	2025-12-01 14:07:50 +09:00
Dayuan Jiang	c7d0260328	feat: add Bedrock prompt caching for system and conversation messages (#32 ) * feat: add Bedrock prompt caching for system and conversation messages - Add cache point to system message (2558+ tokens cached) - Add cache point to last assistant message in conversation history - This caches the entire conversation prefix for subsequent requests - Reduces latency and costs for multi-turn conversations * refactor: remove duplicated system prompt	2025-12-01 10:43:33 +09:00
Dayuan Jiang	d2d4dd01cc	fix: filter out messages with empty content arrays for Bedrock API (#31 ) * fix: filter out messages with empty content arrays for Bedrock API The convertToModelMessages function from AI SDK can produce messages with empty content arrays when assistant messages have only tool call parts or when tool results aren't properly converted. Bedrock API rejects these with 400 errors. This fix filters out invalid messages before sending to the API. * fix: add diagnostic logging for empty message content Added logging to capture the original UI message structure when empty content is detected after conversion. This helps debug the root cause while the filter provides a safety net for Bedrock API compatibility.	2025-12-01 01:15:43 +09:00
Dayuan Jiang	b4679f6598	fix: increase maxDuration to 300s for Fluid Compute (#30 )	2025-12-01 00:46:40 +09:00
Dayuan Jiang	0d0d553e23	fix: correct anthropic beta header config for fine-grained tool streaming (#27 ) * fix: correct anthropic beta header config for fine-grained tool streaming - Use bedrock.anthropicBeta for Bedrock provider (not additionalModelRequestFields) - Use top-level headers for direct Anthropic API - Update @ai-sdk/amazon-bedrock to 3.0.62 - Add headers support to ModelConfig interface * fix: update @ai-sdk/amazon-bedrock to 3.0.62 for tool streaming support	2025-11-30 16:34:42 +09:00
dayuan.jiang	7a6a7eaf7c	docs: Update examples with new prompts and demo images - Add Examples section to README with 2-column grid layout - Include demo images for GCP, AWS, Azure, animated connectors, and cat - Update example panel buttons with clearer labels - Add animated connector example button - Add instruction for animated connectors in chat route	2025-11-17 15:12:16 +09:00
dayuan.jiang	4a3abc2e39	add multiple provider	2025-11-15 13:36:42 +09:00
dayuan.jiang	7b08c7332a	feat: add automatic fallback from edit_diagram to display_diagram with 3-retry policy - Updated system prompt to allow up to 3 retry attempts with adjusted search patterns - Simplified error response to provide current diagram XML and reference retry policy - AI model self-manages retries based on system instructions	2025-11-13 22:51:03 +09:00
dayuan.jiang	61aa0937d6	feat: add tool input streaminig	2025-11-10 19:45:59 +09:00
dayuan.jiang	6940a5156d	refactor: improve diagram handling and error messaging in chat components	2025-11-10 11:27:25 +09:00
dayuan.jiang	bd1c113bec	minor: fix prompt	2025-11-10 09:12:30 +09:00
dayuan.jiang	4efbe78d5a	fix: enhance permissions in settings and update .gitignore for local config	2025-11-10 00:00:02 +09:00
dayuan.jiang	d8d0a800fe	fix: update dependencies and add settings for npm permissions	2025-11-09 23:24:15 +09:00
dayuan.jiang	de2a6938b1	feat: improve XML handling and edit_diagram tool - Add formatXML function to format single-line XML with proper indentation - Format chartXml after fetching to ensure consistency - Update replaceXMLParts to handle single-line XML with substring fallback - Improve edit_diagram tool guidance with SEARCH/REPLACE best practices - Add concrete examples to help AI use minimal, targeted edits	2025-08-31 20:52:04 +09:00
dayuan.jiang	b110f1cb63	change model back to gpt5	2025-08-31 12:59:29 +09:00
dayuan.jiang	44ec398f30	upgrade to ai-sdk 5	2025-08-31 12:54:14 +09:00
dayuan.jiang	1da5976235	update model to gpt-5	2025-08-19 01:17:17 +00:00
dayuan.jiang	e36aedf2fd	change the model back to gpt4.1	2025-07-31 22:01:24 +00:00

1 2

71 Commits