Memoh

mirror of https://github.com/memohai/Memoh.git synced 2026-04-25 07:00:48 +09:00

Author	SHA1	Message	Date
Acbox	8136ef6ed6	feat(usage): add per-call token usage records table Expose a paginated endpoint and UI table that lists individual LLM call records (assistant messages with usage) per bot, showing time, session type, model, provider, and token counts. Respects existing date / model / session-type filters and adds full-height loaders plus a max-width layout to keep the usage page consistent with other top-level pages.	2026-04-24 15:05:53 +08:00
Acbox	473d559042	feat(channel): structured tool-call IM display with edit-in-place Introduce a new `show_tool_calls_in_im` bot setting plus a full overhaul of how tool calls are surfaced in IM channels: - Add per-bot setting + migration (0072) and expose through settings API / handlers / frontend SDK. - Introduce a `toolCallDroppingStream` wrapper that filters tool_call_* events when the setting is off, keeping the rest of the stream intact. - Add a shared `ToolCallPresentation` model (Header / Body blocks / Footer) with plain and Markdown renderers, and a per-tool formatter registry that produces rich output (e.g. `web_search` link lists, `list` directory previews, `exec` stdout/stderr tails) instead of raw JSON dumps. - High-capability adapters (Telegram, Feishu, Matrix, Slack, Discord) now flush pre-text and then send ONE tool-call message per call, editing it in-place from `running` to `completed` / `failed`; mapping from callID to platform message ID is tracked per stream, with a fallback to a new message if the edit fails. Low-capability adapters (WeCom, QQ, DingTalk) keep posting a single final message, but now benefit from the same rich per-tool formatting. - Suppress the early duplicate `EventToolCallStart` (from `sdk.ToolInputStartPart`) so that the SDK's final `StreamToolCallPart` remains the single source of truth for tool call start, preventing duplicated "running" bubbles in IM. - Stop auto-populating `InputSummary` / `ResultSummary` after a per-tool formatter runs, which previously leaked the raw JSON result as a fallback footer underneath the formatted body. Add regression tests for the formatters, the Markdown renderer, the edit-in-place flow on Telegram/Matrix, and the JSON-leak guard on `list`.	2026-04-23 20:49:44 +08:00
Acbox	925fdee478	feat: transcription support (#394 ) * feat: expand speech provider support with new client types and configuration schema * feat: add icon support for speech providers and update related configurations * feat: add SVG support for Deepgram and Elevenlabs with Vue components * feat: except -speech client type in llm provider feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: remove go.mod replace * feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: update go module dependencies * feat: Ear and Mouth * fix: separate ear/mouth page * fix: separate audio domain and restore transcription templates Move speech and transcription internals into the audio domain, restore template-driven transcription providers, and regenerate Swagger/SDK so the frontend can stop hand-calling /transcription-* APIs. --------- Co-authored-by: aki <arisu@ieee.org>	2026-04-22 00:12:01 +08:00
Acbox	fd8f1ec078	Revert "Feat/speech support (#392 )" (#393 ) This reverts commit `c9dcfe287f`.	2026-04-22 00:11:16 +08:00
Acbox	c9dcfe287f	Feat/speech support (#392 ) * feat: expand speech provider support with new client types and configuration schema * feat: add icon support for speech providers and update related configurations * feat: add SVG support for Deepgram and Elevenlabs with Vue components * feat: except -speech client type in llm provider feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: remove go.mod replace * feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: update go module dependencies * feat: Ear and Mouth * fix: separate ear/mouth page * fix: separate audio domain and restore transcription templates Move speech and transcription internals into the audio domain, restore template-driven transcription providers, and regenerate Swagger/SDK so the frontend can stop hand-calling /transcription-* APIs. --------- Co-authored-by: aki <arisu@ieee.org>	2026-04-22 00:09:46 +08:00
Yiming Qi	8d78925a23	feat: expand speech provider support with new client types and config… (#389 ) * feat: expand speech provider support with new client types and configuration schema * feat: add icon support for speech providers and update related configurations * feat: add SVG support for Deepgram and Elevenlabs with Vue components * feat: except -speech client type in llm provider feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: remove go.mod replace * feat: enhance speech provider functionality with advanced settings and model import capabilities * chore: update go module dependencies --------- Co-authored-by: Acbox <acbox0328@gmail.com>	2026-04-19 21:58:16 +08:00
Acbox	6328281fc2	fix: enforce speech/LLM isolation in providers and models SQL queries (CountProviders, CountModels, ListModels, ListEnabledModels, ListModelsByProviderID) now exclude speech types. Added IsLLMClientType guard to prevent cross-domain queries via /models?client_type and /providers/:id/import-models. Frontend provider forms no longer offer edge-speech as a client type option. Also fixed pre-existing SA5011 staticcheck warnings in proxy_test.go and executor_test.go.	2026-04-14 21:07:27 +08:00
Acbox	84f1d0612a	refactor: replace context_token_budget with model context_window for context trimming The per-bot context_token_budget column was unused (no frontend UI) and has been removed. Context trimming now derives the budget from the chat model's context_window setting, which is already configured per model.	2026-04-14 21:04:42 +08:00
BBQ	60517bc2a6	feat(acl): add bot security policy presets Initialize new bots with preset ACL templates and an allow-by-default fallback so common access setups can be selected during bot creation instead of being configured manually afterward.	2026-04-14 05:51:38 +08:00
BBQ	447d647aef	fix(sqlc): cast optional scalar settings args (#370 ) Add explicit scalar casts for optional timezone and context token budget fields so sqlc can parse UpsertBotSettings again. Regenerating sqlc also syncs the stale generated bot/query structs that drifted from the schema.	2026-04-14 04:36:44 +08:00
LiBr	df8fbd8859	feat(provider): add github copilot device flow provider (#364 )	2026-04-13 19:38:33 +08:00
KasuganoSora	a40207ab6d	feat: Misskey channel adapter, agent reliability hardening & stream error resilience (#359 )	2026-04-13 17:10:50 +08:00
Acbox Liu	8d5c38f0e5	refactor: unify providers and models tables (#338 ) * refactor: unify providers and models tables - Rename `llm_providers` → `providers`, `llm_provider_oauth_tokens` → `provider_oauth_tokens` - Remove `tts_providers` and `tts_models` tables; speech models now live in the unified `models` table with `type = 'speech'` - Replace top-level `api_key`/`base_url` columns with a JSONB `config` field on `providers` - Rename `llm_provider_id` → `provider_id` across all references - Add `edge-speech` client type and `conf/providers/edge.yaml` default provider - Create new read-only speech endpoints (`/speech-providers`, `/speech-models`) backed by filtered views of the unified tables - Remove old TTS CRUD handlers; simplify speech page to read-only + test - Update registry loader to skip malformed YAML files instead of failing entirely - Fix YAML quoting for model names containing colons in openrouter.yaml - Regenerate sqlc, swagger, and TypeScript SDK * fix: exclude speech providers from providers list endpoint ListProviders now filters out client_type matching '%-speech' so Edge and future speech providers no longer appear on the Providers page. ListSpeechProviders uses the same pattern match instead of hard-coding 'edge-speech'. * fix: use explicit client_type list instead of LIKE pattern Replace '%-speech' pattern with explicit IN ('edge-speech') for both ListProviders (exclusion) and ListSpeechProviders (inclusion). New speech client types must be added to both queries. * fix: use EXECUTE for dynamic SQL in migrations referencing old schema PL/pgSQL pre-validates column/table references in static SQL statements inside DO blocks before evaluating IF/RETURN guards. This caused migrations 0010-0061 to fail on fresh databases where the canonical schema uses `providers`/`provider_id` instead of `llm_providers`/ `llm_provider_id`. Wrap all SQL that references potentially non-existent old schema objects (llm_providers, llm_provider_id, tts_providers, tts_models, etc.) in EXECUTE strings so they are only parsed at runtime when actually reached. * fix: revert canonical schema to use llm_providers for migration compatibility The CI migrations workflow (up → down → up) failed because 0061 down renames `providers` back to `llm_providers`, but 0001 down only dropped `providers` — leaving `llm_providers` as a remnant. On the second migrate up, 0010 found the stale `llm_providers` and tried to reference `models.llm_provider_id` which no longer existed. Revert 0001 canonical schema to use original names (llm_providers, tts_providers, tts_models) so incremental migrations work naturally and 0061 handles the final rename. Remove EXECUTE wrappers and unnecessary guards from migrations that now always operate on llm_providers. * fix: icons * fix: sync canonical schema with 0061 migration to fix sqlc column mismatch 0001_init.up.sql still used old names (llm_providers, llm_provider_id) and included dropped tts_providers/tts_models tables. sqlc could not parse the PL/pgSQL EXECUTE in migration 0061, so generated code retained stale columns (input_modalities, supports_reasoning) causing runtime "column does not exist" errors when adding models. - Update 0001_init.up.sql to current schema (providers, provider_id, no tts tables, add provider_oauth_tokens) - Use ALTER TABLE IF EXISTS in 0010/0041/0042 for backward compat - Regenerate sqlc * fix: guard all legacy migrations against fresh schema for CI compat On fresh databases, 0001_init.up.sql creates providers/provider_id (not llm_providers/llm_provider_id). Migrations 0013, 0041, 0046, 0047 referenced the old names without guards, causing CI migration failures. - 0013: check llm_provider_id column exists before adding old constraint - 0041: check llm_providers table exists before backfill/constraint DDL - 0046: wrap CREATE TABLE in DO block with llm_providers existence check - 0047: use ALTER TABLE IF EXISTS + DO block guard	2026-04-08 01:03:44 +08:00
Acbox Liu	43c4153938	feat: introduce DCP pipeline layer for unified context assembly (#329 ) * refactor: introduce DCP pipeline layer for unified context assembly Introduce a Deterministic Context Pipeline (DCP) inspired by Cahciua, providing event-driven context assembly for LLM conversations. - Add `internal/pipeline/` package with Canonical Event types, Projection (reduce), Rendering (XML RC), Pipeline manager, and EventStore persistence - Change user message format from YAML front-matter to XML `<message>` tags with self-contained attributes (sender, channel, conversation, type) - Merge CLI/Web dual API into single `/local/` endpoint, remove CLI handler - Add `bot_session_events` table for event persistence and cold-start replay - Add `discuss` session type (reserved for future Cahciua-style mode) - Wire pipeline into HandleInbound: adapt → persist → push on every message - Lazy cold-start replay: load events from DB on first session access * feat: implement discuss mode with reactive driver and probe gate Add discuss session mode where the bot autonomously decides when to speak in group chats via tool-gated output (send tool only, no direct text reply). - Add discuss driver (per-session goroutine, RC watch, step loop via agent.Generate, TR persistence, late-binding prompt with mention hints) - Add system_discuss.md prompt template ("text = inner monologue, send = speak") - Add context composition (MergeContext, ComposeContext, TrimContext) for RC + assistant/tool message interleaving by timestamp - Add probe gate: when discuss_probe_model_id is set, cheap model pre-filters group messages; no tool calls = silence, tool calls = activate primary - Add /new [chat\|discuss] command: explicit mode selection, defaults to discuss in groups, chat in DMs, chat-only for WebUI - Add ResolveRunConfig on flow.Resolver for discuss driver to reuse model/tools/system-prompt resolution without reimplementing - Fix send tool for discuss mode: same-conversation sends now go through SendDirect (channel adapter) instead of the local emitter shortcut - Add target attribute to XML message format (reply_target for routing) - Add discuss_probe_model_id to bots table settings - Remove pipeline compaction (SetCompactCursor) — reuse existing compaction.Service - Persist full SDK messages (including tool calls) in discuss mode * refactor: unify DCP event layer, fix persistence and local channel - Fix bot_session_events dedup index to include event_kind so that message + edit events for the same external_message_id coexist. - Change CreateSessionEvent from :one to :exec so ON CONFLICT DO NOTHING does not produce spurious errors on duplicate delivery. - Move ACL evaluation before event ingest; denied messages no longer enter bot_session_events or the in-memory pipeline. - Let chat mode consume RenderedContext from the DCP pipeline when available, sharing the same event-driven context assembly as discuss. - Collapse local WebSocket handler to route through HandleInbound instead of directly calling StreamChatWS, eliminating the dual business entry point. - Extract buildBaseRunConfig shared builder so resolve() and ResolveRunConfig() no longer duplicate model/credentials/skills setup. - Add StoreRound to RunConfigResolver interface so discuss driver persists assistant output with full metadata, usage, and memory extraction (same quality as chat mode). - Fix discuss driver context: use context.Background() instead of the short-lived HTTP request context that was getting cancelled. - Fix model ID passed to StoreRound: return database UUID from ResolveRunConfig instead of SDK model name. - Remove dead CLIAdapter/CLIType and update legacy web/cli references in tests and comments. * fix: stop idle discuss goroutines after 10min timeout Discuss session goroutines were never cleaned up when a session became inactive (e.g. after /new). Add a 10-minute idle timer that auto-exits the goroutine and removes it from the sessions map when no new RC arrives. * refactor: pipeline details — event types, structured reply, display content - Remove [User sent N attachments] placeholder text from buildInboundQuery; attachment info is now expressed via pipeline <attachment> tags. - Unify in-reply-to as structured ReplyRef (Sender/Preview fields) across Telegram, Discord, Feishu, and Matrix adapters instead of prepending [Reply to ...] text into the message body. Remove now-unused buildTelegramQuotedText, buildDiscordQuotedText, buildMatrixQuotedText. - Make AdaptInbound return CanonicalEvent interface and dispatch to adaptMessage/adaptEdit/adaptService based on metadata["event_type"]. - Add event_id column to bot_history_messages (migration 0059) so user messages can reference their canonical pipeline event. - PersistEvent now returns the event UUID; HandleInbound passes it through to both persistPassiveMessage and ChatRequest.EventID for storeRound. - Add FillDisplayContent to message service: extracts plain text from event_data for clean frontend display. - Frontend extractMessageText prefers display_content when available, falling back to legacy strip logic for old messages. - Fix: always generate headerifiedQuery for storage even when usePipeline is true, so user messages are persisted via storeRound in chat mode. * fix: use json.Marshal for pipeline context content serialization The manual string escaping in buildMessagesFromPipeline only handled double quotes but not newlines, backslashes, and other JSON special characters, producing invalid json.RawMessage values. The LLM then received empty/malformed context and complained about having no history. * fix: restore WebSocket handler to use StreamChatWS directly The previous refactoring replaced the WS handler with HandleInbound + RouteHub subscription, which broke streaming because RouteHub events use a different format (channel.StreamEvent) than what the frontend expects (flow.WSStreamEvent with text_delta, tool_call_start, etc.). Restore the original direct StreamChatWS call path so WebUI streaming works again. The WS handler now matches the pre-refactoring behavior while all other changes (pipeline, ACL, event types, etc.) are kept. * feat: store display_text directly in bot_history_messages Instead of computing display content at API response time by querying bot_session_events via event_id, store the raw user text in a dedicated display_text column at write time. This works for all paths including the WebSocket handler which does not go through the pipeline/event layer. - Migration 0060: add display_text TEXT column - PersistInput gains DisplayText; filled from trimmedText (passive) and req.Query (storeRound) - toMessageFields reads display_text into DisplayContent - Remove FillDisplayContent runtime query and ListSessionEventsByEventID - Frontend already prefers display_content when available (no change) * fix: display_text should contain raw user text, not XML-wrapped query req.Query gets overwritten to headerifiedQuery (with XML <message> tags) before storeRound runs. Add RawQuery field to ChatRequest to preserve the original user text, and use it for display_text in storeMessages. * fix(web): show discuss sessions * refactor: introduce DCP pipeline layer for unified context assembly Introduce a Deterministic Context Pipeline (DCP) inspired by Cahciua, providing event-driven context assembly for LLM conversations. - Add `internal/pipeline/` package with Canonical Event types, Projection (reduce), Rendering (XML RC), Pipeline manager, and EventStore persistence - Change user message format from YAML front-matter to XML `<message>` tags with self-contained attributes (sender, channel, conversation, type) - Merge CLI/Web dual API into single `/local/` endpoint, remove CLI handler - Add `bot_session_events` table for event persistence and cold-start replay - Add `discuss` session type (reserved for future Cahciua-style mode) - Wire pipeline into HandleInbound: adapt → persist → push on every message - Lazy cold-start replay: load events from DB on first session access * feat: implement discuss mode with reactive driver and probe gate Add discuss session mode where the bot autonomously decides when to speak in group chats via tool-gated output (send tool only, no direct text reply). - Add discuss driver (per-session goroutine, RC watch, step loop via agent.Generate, TR persistence, late-binding prompt with mention hints) - Add system_discuss.md prompt template ("text = inner monologue, send = speak") - Add context composition (MergeContext, ComposeContext, TrimContext) for RC + assistant/tool message interleaving by timestamp - Add probe gate: when discuss_probe_model_id is set, cheap model pre-filters group messages; no tool calls = silence, tool calls = activate primary - Add /new [chat\|discuss] command: explicit mode selection, defaults to discuss in groups, chat in DMs, chat-only for WebUI - Add ResolveRunConfig on flow.Resolver for discuss driver to reuse model/tools/system-prompt resolution without reimplementing - Fix send tool for discuss mode: same-conversation sends now go through SendDirect (channel adapter) instead of the local emitter shortcut - Add target attribute to XML message format (reply_target for routing) - Add discuss_probe_model_id to bots table settings - Remove pipeline compaction (SetCompactCursor) — reuse existing compaction.Service - Persist full SDK messages (including tool calls) in discuss mode * refactor: unify DCP event layer, fix persistence and local channel - Fix bot_session_events dedup index to include event_kind so that message + edit events for the same external_message_id coexist. - Change CreateSessionEvent from :one to :exec so ON CONFLICT DO NOTHING does not produce spurious errors on duplicate delivery. - Move ACL evaluation before event ingest; denied messages no longer enter bot_session_events or the in-memory pipeline. - Let chat mode consume RenderedContext from the DCP pipeline when available, sharing the same event-driven context assembly as discuss. - Collapse local WebSocket handler to route through HandleInbound instead of directly calling StreamChatWS, eliminating the dual business entry point. - Extract buildBaseRunConfig shared builder so resolve() and ResolveRunConfig() no longer duplicate model/credentials/skills setup. - Add StoreRound to RunConfigResolver interface so discuss driver persists assistant output with full metadata, usage, and memory extraction (same quality as chat mode). - Fix discuss driver context: use context.Background() instead of the short-lived HTTP request context that was getting cancelled. - Fix model ID passed to StoreRound: return database UUID from ResolveRunConfig instead of SDK model name. - Remove dead CLIAdapter/CLIType and update legacy web/cli references in tests and comments. * fix: stop idle discuss goroutines after 10min timeout Discuss session goroutines were never cleaned up when a session became inactive (e.g. after /new). Add a 10-minute idle timer that auto-exits the goroutine and removes it from the sessions map when no new RC arrives. * refactor: pipeline details — event types, structured reply, display content - Remove [User sent N attachments] placeholder text from buildInboundQuery; attachment info is now expressed via pipeline <attachment> tags. - Unify in-reply-to as structured ReplyRef (Sender/Preview fields) across Telegram, Discord, Feishu, and Matrix adapters instead of prepending [Reply to ...] text into the message body. Remove now-unused buildTelegramQuotedText, buildDiscordQuotedText, buildMatrixQuotedText. - Make AdaptInbound return CanonicalEvent interface and dispatch to adaptMessage/adaptEdit/adaptService based on metadata["event_type"]. - Add event_id column to bot_history_messages (migration 0059) so user messages can reference their canonical pipeline event. - PersistEvent now returns the event UUID; HandleInbound passes it through to both persistPassiveMessage and ChatRequest.EventID for storeRound. - Add FillDisplayContent to message service: extracts plain text from event_data for clean frontend display. - Frontend extractMessageText prefers display_content when available, falling back to legacy strip logic for old messages. - Fix: always generate headerifiedQuery for storage even when usePipeline is true, so user messages are persisted via storeRound in chat mode. * fix: use json.Marshal for pipeline context content serialization The manual string escaping in buildMessagesFromPipeline only handled double quotes but not newlines, backslashes, and other JSON special characters, producing invalid json.RawMessage values. The LLM then received empty/malformed context and complained about having no history. * fix: restore WebSocket handler to use StreamChatWS directly The previous refactoring replaced the WS handler with HandleInbound + RouteHub subscription, which broke streaming because RouteHub events use a different format (channel.StreamEvent) than what the frontend expects (flow.WSStreamEvent with text_delta, tool_call_start, etc.). Restore the original direct StreamChatWS call path so WebUI streaming works again. The WS handler now matches the pre-refactoring behavior while all other changes (pipeline, ACL, event types, etc.) are kept. * feat: store display_text directly in bot_history_messages Instead of computing display content at API response time by querying bot_session_events via event_id, store the raw user text in a dedicated display_text column at write time. This works for all paths including the WebSocket handler which does not go through the pipeline/event layer. - Migration 0060: add display_text TEXT column - PersistInput gains DisplayText; filled from trimmedText (passive) and req.Query (storeRound) - toMessageFields reads display_text into DisplayContent - Remove FillDisplayContent runtime query and ListSessionEventsByEventID - Frontend already prefers display_content when available (no change) * fix: display_text should contain raw user text, not XML-wrapped query req.Query gets overwritten to headerifiedQuery (with XML <message> tags) before storeRound runs. Add RawQuery field to ChatRequest to preserve the original user text, and use it for display_text in storeMessages. * fix(web): show discuss sessions * chore(feishu): change discuss output to stream card * fix(channel): unify discuss/chat send path and card markdown delivery * feat(discuss): switch to stream execution with RouteHub broadcasting * refactor(pipeline): remove context trimming from ComposeContext The pipeline path should not trim context by token budget — the upstream IC/RC already bounds the event window. Remove TrimContext, FindWorkingWindowCursor, EstimateTokens, FormatLastProcessedMs (all unused or only used for trimming), the maxTokens parameter from ComposeContext, and MaxContextTokens from DiscussSessionConfig. --------- Co-authored-by: 晨苒 <16112591+chen-ran@users.noreply.github.com>	2026-04-06 21:56:25 +08:00
Acbox	a9a9f7e955	feat: add image generation model and generate_image agent tool Bots can now be configured with an image generation model (must have image-output compatibility). When set, the agent exposes a generate_image tool that calls the model via Twilight AI SDK, saves the result to the bot container filesystem, and returns the file path. - Add image_model_id column to bots table (migration 0053) - Update settings SQL queries, service, and types - New ImageGenProvider tool provider in internal/agent/tools/ - Wire provider in both cmd/agent and cmd/memoh entry points - Add image model selector to frontend bot settings with compat filtering - Regenerate swagger, SDK types, and sqlc code	2026-04-03 01:17:34 +08:00
Acbox	33b57ee345	feat: rename info to status, add /status slash command Rename session info endpoint from /sessions/:id/info to /sessions/:id/status and update frontend tab label accordingly. Add /status slash command that displays current session metrics (message count, context usage, cache hit rate, used skills) as formatted text in any channel.	2026-04-03 01:17:33 +08:00
Acbox	b3c783fb0b	feat: add session info panel with message count, context usage, cache stats, and skills Add GET /bots/:bot_id/sessions/:session_id/info API endpoint that returns per-session message count, latest input token usage with model context window, aggregated KV cache hit rate, and skills invoked via use_skill tool calls. Frontend Info tab in the right sidebar now displays this data in a compact key-value layout with a context usage progress bar and clickable skill links.	2026-04-03 01:17:33 +08:00
Acbox	0e646625bf	feat: add compaction ratio setting to control partial context compaction Allow users to configure what percentage of older messages to compact, keeping the most recent portion intact. Default ratio is 80%, meaning the oldest 80% of uncompacted messages are summarized while the newest 20% remain as-is for full-fidelity context.	2026-03-29 19:14:43 +08:00
Acbox	bcda6f6fe6	refactor: replace Load More with Pagination across frontend and backend - Replace all "Load More" / "Show More" buttons with Pagination components in model-list, bot-compaction, and bot-heartbeat views - Convert backend log APIs (compaction, heartbeat, schedule) from cursor-based (before+limit) to offset+limit pagination with total_count - Update SQL queries to use OFFSET+LIMIT and add COUNT queries - Add shared parseOffsetLimit helper in handler_helpers.go - Regenerate sqlc, Swagger docs, and TypeScript SDK - Clean up unused i18n keys (loadMore, showMore, history.loadMore)	2026-03-29 18:49:30 +08:00
Acbox	0730ff2945	refactor: remove max_context_load_time and max_context_tokens from bot settings These two fields controlled history context window (time-based) and token-based trimming. They are no longer needed — the resolver now always uses the hardcoded 24-hour default and skips token-based history trimming.	2026-03-29 00:00:10 +08:00
Acbox	90ac222bc9	feat: auto-create search/tts providers at startup with enable toggle - Add `enable` column (default false) to search_providers and tts_providers tables - Auto-create default entries for all provider types on startup (disabled by default) - Add enable/disable Switch toggle in frontend for both search and TTS providers - Show green status dot in sidebar for enabled providers, sort enabled first - Filter bot settings dropdowns to only show enabled providers	2026-03-28 23:47:09 +08:00
BBQ	7f9d6e4aba	feat(acl): redesign ACL with conversation scope selector (#297 ) Backend - New subject kinds: all / channel_identity / channel_type - Source scope fields on bot_acl_rules: source_channel, source_conversation_type, source_conversation_id, source_thread_id - Fix source_scope_check constraint: resolve source_channel server-side (channel_type → subject_channel_type; channel_identity → DB lookup) - Add GET /bots/:id/acl/channel-types/:type/conversations to list observed conversations by platform type - ListObservedConversations: include private/DM chats, normalise conversation_type; COALESCE(name, handle) for display name - enrichConversationAvatar: persist entry.Name → conversation_name (keeps Telegram group titles current on every message) - Unify Priority type to int32 across Go types to match DB INTEGER; remove all int/int32 casts in service layer - Fix duplicate nil guard in Evaluate; drop dead SourceScope.Channel field - Migration 0048_acl_redesign Frontend - Drag-and-drop rule priority reordering (SortableJS/useSortable); fix reorder: compute new order from oldIndex/newIndex directly, not from the array (which useSortable syncs after onEnd) - Conversation scope selector: searchable popover backed by observed conversations (by identity or platform type); collapsible manual-ID fallback - Display: name as primary label, stable channel·type·id always shown as subtitle for verification - bot-terminal: accessibility fix on close-tab button (keyboard events) - i18n: drag-to-reorder, conversation source, manual IDs (en/zh) Tests: update fakeChatACL to Evaluate interface; fix SourceScope literals. SDK/spec regenerated.	2026-03-28 01:06:13 +08:00
Yiming Qi	64378d29ed	feat: openai codex support (#292 ) * feat(web): add provider oauth management ui * feat: add OAuth callback support on port 1455 * feat: enhance reasoning effort options and support for OpenAI Codex OAuth * feat: update twilight-ai dependency to v0.3.4 * refactor: promote openai-codex to first-class client_type, remove auth_type Replace the previous openai-responses + metadata auth_type=openai-codex-oauth combo with a dedicated openai-codex client_type. OAuth requirement is now determined solely by client_type, eliminating the auth_type concept from the LLM provider domain entirely. - Add openai-codex to DB CHECK constraint (migration 0047) with data migration - Add ClientTypeOpenAICodex constant and dedicated SDK/probe branches - Remove AuthType from SDKModelConfig, ModelCredentials, TriggerConfig, etc. - Simplify supportsOAuth to check client_type == openai-codex - Add conf/providers/codex.yaml preset with Codex catalog models - Frontend: replace auth_type selector with client_type-driven OAuth UI --------- Co-authored-by: Acbox <acbox0328@gmail.com>	2026-03-27 19:30:45 +08:00
Yiming Qi	03ba13e7e5	feat: add timezone support for schedule and user runtime (#282 )	2026-03-26 01:32:02 +08:00
AlexMa233	609ca49cf5	feat: matrix support (part 1) (#242 ) * feat(channel): add Matrix adapter support * fix(channel): prevent reasoning leaks in Matrix replies * fix(channel): persist Matrix sync cursors * fix(channel): improve Matrix markdown rendering * fix(channel): support Matrix attachments and multimodal history * fix(channel): expand Matrix reply media context * fix(handlers): allow media downloads for chat-access bots * fix(channel): classify Matrix DMs as direct chats * fix(channel): auto-join Matrix room invites * fix(channel): resolve Matrix room aliases for outbound send * fix(web): use Matrix brand icon in channel badges Replace the generic Matrix hashtag badge with the official brand asset so channel badges feel recognizable and fit the circular mask cleanly. * fix(channel): add Matrix room whitelist controls Let Matrix bots decide whether to auto-join invites and restrict inbound activity to allowed rooms or aliases. Expose the new controls in the web settings UI with line-based whitelist input so access rules stay explicit. * fix(channel): stabilize Matrix multimodal follow-ups and settings * fix(flow): avoid gosec panic on byte decoding * fix: fix golangci-lint * fix(channel): remove Matrix built-in ACL * fix(channel): preserve Matrix image captions * fix(channel): validate Matrix homeserver and sync access Fail Matrix connections early when the homeserver, access token, or /sync capability is misconfigured so bot health checks surface actionable errors. * fix(channel): preserve optional toggles and relax Matrix startup validation * fix(channel): tighten Matrix mention fallback parsing * fix(flow): skip structured assistant tool-call outputs * fix(flow): resolve merged resolver duplication Keep the internal agent resolver implementation after merging main so split helper files do not redeclare flow symbols. Restore user message normalization in sanitize and persistence paths to keep flow tests and command packages building. * fix(flow): remove unused merged resolver helper Drop the leftover truncate helper and import from the resolver merge fix so golangci-lint passes again without affecting flow behavior. --------- Co-authored-by: Acbox Liu <acbox0328@gmail.com>	2026-03-22 21:55:34 +08:00
Acbox Liu	b3a39ad93d	refactor: replace persistent subagents with ephemeral spawn tool (#280 ) * refactor: replace persistent subagents with ephemeral spawn tool (#subagent) - Drop subagents table, remove all persistent subagent infrastructure - Add 'subagent' session type with parent_session_id on bot_sessions - Rewrite subagent tool as single 'spawn' tool with parallel execution - Create system_subagent.md prompt, add _subagent.md include for chat - Limit subagent tools to file, exec, web_search, web_fetch only - Merge subagent token usage into parent chat session in reporting - Remove frontend subagent management page, update chat UI for spawn - Fix UTF-8 truncation in session title, fix query not passed to agent * refactor: remove history message page	2026-03-22 19:03:28 +08:00
Acbox Liu	b88ca96064	refactor: provider & models (#277 ) * refactor: move client_type to provider, replace model fields with config JSONB - Move `client_type` from `models` to `llm_providers` table - Add `icon` field to `llm_providers` - Replace `dimensions`, `input_modalities`, `supports_reasoning` on `models` with a single `config` JSONB column containing `dimensions`, `compatibilities` (vision, tool-call, image-output, reasoning), and `context_window` - Auto-imported models default to vision + tool-call + reasoning - Update all backend consumers (agent, flow resolver, handlers, memory) - Regenerate sqlc, swagger, and TypeScript SDK - Update frontend forms, display, and i18n for new schema * ui: show provider icon avatar in sidebar and detail header, remove icon input * feat: add built-in provider registry with YAML definitions and enable toggle - Add `enable` column to llm_providers (default true, backward-compatible) - Create internal/registry package to load YAML provider/model definitions on startup and upsert into database (new providers disabled by default) - Add conf/providers/ with OpenAI, Anthropic, Google YAML definitions - Add RegistryConfig to TOML config (providers_dir, default conf/providers) - Model listing APIs and conversation flow now filter by enabled providers - Frontend: enable switch in provider form, green status dot in sidebar, enabled providers sorted to top * fix: make 0041 migration idempotent for fresh databases Guard data migration steps with column-existence checks so the migration succeeds on databases created from the updated init schema.	2026-03-22 17:24:45 +08:00
Acbox Liu	de62f94315	feat: add context compaction to automatically summarize old messages (#compaction) (#276 ) When input tokens exceed a configurable threshold after a conversation round, the system asynchronously compacts older messages into a summary. Cascading compactions reference prior summaries via <prior_context> tags to maintain conversational continuity without duplicating content. - Add bot_history_message_compacts table and compact_id on messages - Add compaction_enabled, compaction_threshold, compaction_model_id to bots - Implement compaction service (internal/compaction) with LLM summarization - Integrate into conversation flow: replace compacted messages with summaries wrapped in <summary> tags during context loading - Add REST API endpoints (GET/DELETE /bots/:bot_id/compaction/logs) - Add frontend Compaction tab with settings and log viewer - Wire compaction service into both dev (cmd/agent) and prod (cmd/memoh) entry points - Update test mocks to include new GetBotByID columns	2026-03-22 14:26:00 +08:00
Acbox Liu	80b36f79f3	refactor: unify token usage stats across all session types (#274 ) - Rewrite SQL queries to join bot_history_messages with bot_sessions, supporting chat/heartbeat/schedule usage from a single source - Update Go handler and CLI command to use unified queries - Fix daily chart stacking: each session type gets its own bar group - Add total input/output trend lines to the daily token chart - Fix summary cards reactivity by restricting aggregation to allDays range - Fix cache chart reactive dependency tracking by inlining data access - Add i18n keys for schedule, totalInput, totalOutput - Default time range changed to 7 days - Regenerate sqlc, swagger, and SDK	2026-03-21 19:14:37 +08:00
Acbox Liu	7d7d0e4b51	refactor: introduce multi-session chat support (#session) (#267 ) * refactor: introduce multi-session chat support (#session) Replace the single-context-per-bot model with multiple chat sessions. Database: - Add bot_sessions table (route_id, channel_type, title, metadata, soft delete) - Migrate bot_history_messages from (route_id, channel_type) to session_id - Add active_session_id to bot_channel_routes - Migration 0036 handles data migration from existing messages Backend: - New internal/session service for session CRUD - Update message service/types to use session_id instead of route_id - Update conversation flow (resolver, history, store) for session context - Channel inbound auto-creates/retrieves active session via SessionEnsurer - New REST endpoints: /bots/:bot_id/sessions (CRUD) - WebSocket and message handlers accept optional session_id - Wire session service into FX dependency graph (agent + memoh) Frontend: - Refactor chat store: sessions replaces chats, sessionId replaces chatId - Session-aware message loading, sending, and pagination - WebSocket sends include session_id - New session sidebar component with select/delete - Chat area header shows active session title + new session button - API layer updated: fetchSessions, createSession, deleteSession - i18n strings for session management (en + zh) SDK: - Regenerated TypeScript SDK and Swagger docs with session endpoints * fix: update tests for session refactoring (RouteID → SessionID) Remove references to removed RouteID and Platform fields from PersistInput/Message in channel_test.go and service_integration_test.go. * fix: restore accidentally deleted SDK files and guard migration 0032 - Restore packages/sdk/src/container-stream.ts and extra/index.ts that were accidentally removed during SDK regeneration - Wrap migration 0032 route_id index creation in a column existence check to avoid failure on fresh databases where 0001_init.up.sql no longer has route_id * fix: guard migration 0036 data steps for fresh databases Wrap steps 3-7 (which reference route_id/channel_type on bot_history_messages) in a column existence check so the migration is safe on fresh databases where 0001_init.up.sql already reflects the final schema without those columns. * feat: add title model setting and auto-generate session titles on user input - Add title_model_id to bots table (migration 0037) and bot settings API - Implement async title generation triggered at user message time (not after assistant response) for faster title availability - Publish session_title_updated events via SSE event hub for real-time frontend updates without page refresh - Fix SSE message event parsing: use direct JSON.parse instead of normalizeStreamEvent which silently dropped non-chat-stream event types - Add title model selector in bot settings UI with i18n support * fix: session-scoped message filtering and URL-based chat routing - Filter realtime SSE messages by session_id to prevent cross-session message leakage after page refresh - Add /chat/:sessionId? route with bidirectional URL ↔ store sync - Visiting /chat shows a clean state with no bot or session pre-selected - Visiting /chat/:sessionId loads the specific session directly - Session switches from sidebar automatically update the URL - Fix stale RouteID field in dedupe test (removed during session refactor) * fix: skip cross-channel stream events to prevent session leakage The bot-level web stream pushes events from all channels (Telegram, Discord, etc.) without session_id context. Previously these were rendered inline in the current chat view regardless of session. Now cross-channel events are ignored in handleLocalStreamEvent; persisted messages arrive via the SSE message events stream with proper session_id filtering through appendRealtimeMessage. * feat: show IM avatars and platform badges on session sidebar - Add sender_avatar_url to route metadata from identity resolution - Resolve group avatar and handle via directory adapter for group chats - JOIN bot_channel_routes in ListSessionsByBot to return route metadata - Display avatar with ChannelBadge on IM session items (group avatar for groups, sender avatar for private chats) - Show @groupname or @username as session sub-label * fix: clean up RunConfig unused fields, fix skill system and copy bug - Remove unused RunConfig fields: Tools, Channels, CurrentChannel, ActiveContextTime - Remove unused SessionContext fields: DisplayName, ConversationType - Fix EnabledSkillNames copy bug: make([]string, 0, n) + copy copies zero elements; changed to make([]string, n) - Fix prepareRunConfig dead code: remove no-op loop over CurrentPlatform runes; compute supportsImageInput from model's InputModalities - Fix EnabledSkills always nil in system prompt: resolve enabled skill entries from EnabledSkillNames + Skills - Fix use_skill tool returning empty response: now returns full skill content (description + instructions) so LLM gets it in the same turn - Skip use_skill tool registration when no skills are available - Conditionally render Skills section in system prompt (hidden when no skills exist) * feat: add session type field and bind sessions to heartbeat/schedule executions - Add `type` column to `bot_sessions` (chat \| heartbeat \| schedule) - Add `session_id` to `bot_heartbeat_logs` for per-execution session tracking - Create `schedule_logs` table binding schedule_id + session_id - Heartbeat and schedule runs now create independent sessions and persist agent messages via storeRound, enabling full conversation replay - Add schedule logs API endpoints (list by bot, list by schedule, delete) - Update Triggerer interfaces to return TriggerResult with status/usage/model * refactor: modular system prompts per session type (chat/heartbeat/schedule) Split the monolithic system.md into three type-specific system prompts with shared fragments via {{include:_xxx}} syntax, so each session type gets a focused prompt without irrelevant instructions. * fix: prevent message duplication after task completion message_created events from Persist() had an empty platform field because toMessageFromCreate() didn't extract it from the session. This caused appendRealtimeMessage to fail the platform === 'web' guard, and hasMessageWithId to fail because local IDs differ from server UUIDs, resulting in all messages being appended as duplicates. - Extract platform from metadata in toMessageFromCreate so published events carry the correct value - Pass channel_type: 'web' when creating sessions from the web frontend so List queries return the correct platform via the session JOIN * fix: use per-message usage from SDK instead of misaligned step-level usages Previously, token usage was stored via a separate per-step usages array that didn't align with messages (off-by-one from prepending user message, step count != message count). This caused: - User messages incorrectly receiving usage data - Usage values shifted across messages in multi-step rounds - Last assistant message getting the accumulated total instead of its own step usage - InputTokenDetails/OutputTokenDetails lost during manual accumulation Now each sdk.Message carries its own per-step Usage (set by the SDK in buildStepMessages), which is extracted in sdkMessagesToModelMessages and stored directly via ModelMessage.Usage. The storeRound/storeMessages path no longer needs external usage/usages parameters. Also fixes the totalUsage accumulation in runStream to include all detail fields (InputTokenDetails, OutputTokenDetails). * feat: add /new slash command to create a new active session from IM channels Users in Telegram/Discord/Feishu can now send /new to start a fresh conversation, resetting the session context for the current chat thread. The command resolves the channel route, creates a new session, sets it as the active session on the route, and replies with a confirmation message. * feat: distinguish heartbeat and schedule sessions with dedicated icons in sidebar Heartbeat sessions show a heart-pulse icon (rose), schedule sessions show a clock icon (amber), and both display a type label beneath the session title. * refactor: remove enabledSkills system prompt injection, keep sorted skill listing use_skill now returns skill content directly as tool output, so there is no need to inject enabled skill body text into the system prompt. Remove the entire enabledSkills tracking chain (RunConfig.EnabledSkillNames, StreamEvent.Skills, GenerateResult.Skills, ChatRequest/Response.Skills, enableSkill closures in runStream/runGenerate, prepareRunConfig matching). Keep a lightweight skills listing (name + description only) in the system prompt so the model knows which skills are available. Sort entries by name to guarantee deterministic ordering and maximize KV cache reuse. * refactor: remove inbox system, persist passive messages directly to history Replace the bot_inbox table and service with direct writes to bot_history_messages for group conversations where the bot is not @mentioned. Trigger-path messages continue to be persisted after the agent responds (unchanged). - Drop bot_inbox table and max_inbox_items column (migration 0039) - Delete internal/inbox/, handlers/inbox.go, command/inbox.go, agent/tools/inbox.go and the MCP message provider - Add persistPassiveMessage() in channel inbound to write user messages into the active session immediately - Rewrite ListObservedConversationsByChannelIdentity to query bot_history_messages + bot_sessions instead of bot_inbox - Extract shared send/react logic into internal/messaging/executor.go; agent/tools/message.go is now a thin SDK adapter - Clean up all inbox references from agent prompts, flow resolver, email trigger, settings, commands, DI wiring, and frontend - Regenerate sqlc, swagger, and SDK * feat: add list_sessions and search_messages agent tools Provide agents with the ability to query session metadata and search message history across all sessions. search_messages supports filtering by time range, keyword (JSONB-aware ILIKE), session, contact, and role, with a default 7-day lookback when no start_time is given. * feat: inject last_heartbeat time and improve heartbeat search guidance Query the previous heartbeat's started_at timestamp and pass it through TriggerPayload into the heartbeat prompt template. Update system prompt and HEARTBEAT.md checklist to guide agents to use search_messages with start_time=last_heartbeat for efficient cross-session message review. * fix: pass BridgeProvider to FSClient and store full heartbeat prompt FSClient was always created with nil provider, causing all container file reads (IDENTITY.md, SOUL.md, MEMORY.md, HEARTBEAT.md, etc.) to silently return empty strings. Expose Agent.BridgeProvider() and wire it into Resolver. Also fix heartbeat trigger to store the full prompt template as the user message instead of the literal "heartbeat" string. * feat: add line numbers to container file read output Move line-number formatting from the bridge gRPC server to the agent tool layer so that the raw content stored and transmitted via gRPC remains clean, while the read_file tool output includes numbered lines for easier reference by the agent. * chore(deps): update twilight-ai to v0.3.2 * fix: lint, test	2026-03-21 15:57:22 +08:00
Acbox Liu	1680316c7f	refactor(agent): remove agent gateway instead of twilight sdk (#264 ) * refactor(agent): replace TypeScript agent gateway with in-process Go agent using twilight-ai SDK - Remove apps/agent (Bun/Elysia gateway), packages/agent (@memoh/agent), internal/bun runtime manager, and all embedded agent/bun assets - Add internal/agent package powered by twilight-ai SDK for LLM calls, tool execution, streaming, sential logic, tag extraction, and prompts - Integrate ToolGatewayService in-process for both built-in and user MCP tools, eliminating HTTP round-trips to the old gateway - Update resolver to convert between sdk.Message and ModelMessage at the boundary (resolver_messages.go), keeping agent package free of persistence concerns - Prepend user message before storeRound since SDK only returns output messages (assistant + tool) - Clean up all Docker configs, TOML configs, nginx proxy, Dockerfile.agent, and Go config structs related to the removed agent gateway - Update cmd/agent and cmd/memoh entry points with setter-based ToolGateway injection to avoid FX dependency cycles * fix(web): move form declaration before computed properties that reference it The `form` reactive object was declared after computed properties like `selectedMemoryProvider` and `isSelectedMemoryProviderPersisted` that reference it, causing a TDZ ReferenceError during setup. * fix: prevent UTF-8 character corruption in streaming text output StreamTagExtractor.Push() used byte-level string slicing to hold back buffer tails for tag detection, which could split multi-byte UTF-8 characters. After json.Marshal replaced invalid bytes with U+FFFD, the corruption became permanent — causing garbled CJK characters (�) in agent responses. Add safeUTF8SplitIndex() to back up split points to valid character boundaries. Also fix byte-level truncation in command/formatter.go and command/fs.go to use rune-aware slicing. * fix: add agent error logging and fix Gemini tool schema validation - Log agent stream errors in both SSE and WebSocket paths with bot/model context - Fix send tool `attachments` parameter: empty `items` schema rejected by Google Gemini API (INVALID_ARGUMENT), now specifies `{"type": "string"}` - Upgrade twilight-ai to d898f0b (includes raw body in API error messages) * chore(ci): remove agent gateway from Docker build and release pipelines Agent gateway has been replaced by in-process Go agent; remove the obsolete Docker image matrix entry, Bun/UPX CI steps, and agent-binary build logic from the release script. * fix: preserve attachment filename, metadata, and container path through persistence - Add `name` column to `bot_history_message_assets` (migration 0034) to persist original filenames across page refreshes. - Add `metadata` JSONB column (migration 0035) to store source_path, source_url, and other context alongside each asset. - Update SQL queries, sqlc-generated code, and all Go types (MessageAsset, AssetRef, OutboundAssetRef, FileAttachment) to carry name and metadata through the full lifecycle. - Extract filenames from path/URL in AttachmentsResolver before clearing raw paths; enrich streaming event metadata with name, source_path, and source_url in both the WebSocket and channel inbound ingestion paths. - Implement `LinkAssets` on message service and `LinkOutboundAssets` on flow resolver so WebSocket-streamed bot attachments are persisted to the correct assistant message after streaming completes. - Frontend: update MessageAsset type with metadata field, pass metadata through to attachment items, and reorder attachment-block.vue template so container files (identified by metadata.source_path) open in the sidebar file manager instead of triggering a download. * refactor(agent): decouple built-in tools from MCP, load via ToolProvider interface Migrate all 13 built-in tool providers from internal/mcp/providers/ to internal/agent/tools/ using the twilight-ai sdk.Tool structure. The agent now loads tools through a ToolProvider interface instead of the MCP ToolGatewayService, which is simplified to only manage external federation sources. This enables selective tool loading and removes the coupling between business tools and the MCP protocol layer. * refactor(flow): split monolithic resolver.go into focused modules Break the 1959-line resolver.go into 12 files organized by concern: - resolver.go: core orchestration (Resolver struct, resolve, Chat, prepareRunConfig) - resolver_stream.go: streaming (StreamChat, StreamChatWS, tryStoreStream) - resolver_trigger.go: schedule/heartbeat triggers - resolver_attachments.go: attachment routing, inlining, encoding - resolver_history.go: message loading, deduplication, token trimming - resolver_store.go: persistence (storeRound, storeMessages, asset linking) - resolver_memory.go: memory provider integration - resolver_model_selection.go: model selection and candidate matching - resolver_identity.go: display name and channel identity resolution - resolver_settings.go: bot settings, loop detection, inbox - user_header.go: YAML front-matter formatting - resolver_util.go: shared utilities (sanitize, normalize, dedup, UUID) * fix(agent): enable Anthropic extended thinking by passing ReasoningConfig to provider Anthropic's thinking requires WithThinking() at provider creation time, unlike OpenAI which uses per-request ReasoningEffort. The config was never wired through, so Claude models could not trigger thinking. * refactor(agent): extract prompts into embedded markdown templates Move inline prompt strings from prompt.go into separate .md files under internal/agent/prompts/, using {{key}} placeholders and a simple render engine. Remove obsolete SystemPromptParams fields (Language, MaxContextLoadTime, Channels, CurrentChannel) and their call-site usage. * fix: lint	2026-03-19 13:31:54 +08:00
BBQ	1c19ec1022	feat(acl): source-aware chat trigger ACL (#252 )	2026-03-16 11:06:50 +08:00
Acbox	ac8a935545	refactor: remove bot type	2026-03-15 00:42:09 +08:00
BBQ	839e63acda	feat(access): add guest chat ACL (#235 )	2026-03-14 17:15:41 +08:00
Fodesu	b46e494d3a	feat(tts): introduce `TTS` system (#195 )	2026-03-13 02:49:52 +08:00
Ran	3ddb4f361c	fix(migrations): sync email oauth tokens with init schema	2026-03-11 02:46:08 +08:00
Yiming Qi	a5c364911e	feat(email/oauth): implement OAuth2 support for Gmail provider (#212 )	2026-03-09 23:37:43 +08:00
Acbox Liu	bafd327b6b	feat: agent browser (#200 ) * feat: agent browser * chore: complete docker and action config * feat: more actions * feat: browser tab switch * fix: browser build * fix: lint * fix: migrations	2026-03-07 15:06:00 +08:00
BBQ	21999b49f4	feat(container): add explicit data workflows and snapshot rollback (#193 ) * feat(container): add explicit data workflows and snapshot rollback Make container upgrades and recreation data-safe by adding explicit preserve, export, import, restore, and rollback flows across the backend, SDK, and web UI. * fix(container): resolve go lint issues Fix formatting and lint violations introduced by the container data workflow changes so the Go CI lint job passes cleanly.	2026-03-06 17:57:48 +08:00
Acbox	707e04fd38	fix(migration): repair migration version	2026-03-05 00:44:14 +08:00
Acbox	674e8c6ce9	fix: make `query` parameter of tool `search_inbox` optional	2026-03-04 22:26:24 +08:00
BBQ	9ceabf68c4	feat(mcp): replace bind-mount+exec with in-container gRPC service (#179 ) Replace the host bind-mount + containerd exec approach with a per-bot in-container gRPC server (ContainerService, port 9090). All file I/O, exec, and MCP stdio sessions now go through gRPC instead of running shell commands or reading host-mounted directories. Architecture changes: - cmd/mcp: rewritten as a gRPC server (ContainerService) with full file and exec API (ReadFile, WriteFile, ListDir, ReadRaw, WriteRaw, Exec, Stat, Mkdir, Rename, DeleteFile) - internal/mcp/mcpcontainer: protobuf definitions and generated stubs - internal/mcp/mcpclient: gRPC client wrapper with connection pool (Pool) and Provider interface for dependency injection - mcp.Manager: add per-bot IP cache, gRPC connection pool, and SetContainerIP/MCPClient methods; remove DataDir/Exec helpers - containerd.Service: remove ExecTask/ExecTaskStreaming; network setup now returns NetworkResult{IP} for pool routing - internal/fs/service.go: deleted (replaced by mcpclient) - handlers/fs.go: deleted; MCP stdio session logic moved to mcp_stdio.go - container provider Executor: all tools (read/write/list/edit/exec) now call gRPC client instead of running shell via exec - storefs, containerfs, media, skills, memory: all I/O ported to mcpclient.Provider Database: - migration 0022: drop host_path column from containers table One-time data migration: - migrateBindMountData: on first Start() after upgrade, copies old bind-mount data into the container via gRPC, then renames src dir to prevent re-migration; runs in background goroutine Bug fixes: - mcp_stdio: callRaw now returns full JSON-RPC envelope {"jsonrpc","id","result"\|"error"} matching protocol spec; explicit "initialize" call now advances session init state to prevent duplicate handshake on next non-initialize call - mcpclient Pool: properly evict stale gRPC connection after snapshot replace (container process recreated); use SetContainerIP instead of direct map write so IP changes always evict pool entry - migrateBindMountData: walkErr on directories now counted as failure so partially-walked trees don't get incorrectly marked as migrated - cmd/mcp/Dockerfile: removed dead file (docker/Dockerfile.mcp is the canonical production build) Tests: - provider_test.go: restored with bufconn in-process gRPC mock (fakeContainerService + staticProvider), 14 cases covering all 5 tools plus edge cases - mcp_session_test.go: new, covers JSON-RPC envelope, init state machine, pending cleanup on cancel/close, readLoop cancel - storefs/service_test.go: restored (pure function roundtrip tests)	2026-03-04 21:50:08 +08:00
Acbox Liu	64609c2101	feat: MCP OAuth (#178 ) * feat: MCP OAuth * fix: redirect url and oauth	2026-03-04 00:41:05 +08:00
Acbox Liu	ea719f7ca7	refactor: memory provider (#140 ) * refactor: memory provider * fix: migrations * feat: divide collection from different built-in memory * feat: add `MEMORY.md` and `PROFILES.md` * use .env for docker compose. fix #142 (#143) * feat(web): add brand icons for search providers (#144) Add custom FontAwesome icon definitions for all 9 search providers: - Yandex: uses existing faYandex from FA free brands - Tavily, Jina, Exa, Bocha, Serper: custom icons from brand SVGs - DuckDuckGo, SearXNG, Sogou: custom icons from Simple Icons Icons are registered with a custom 'fac' prefix and rendered as monochrome (currentColor) via FontAwesome's standard rendering. * fix: resolve multiple UI bugs (#147) * feat: add email service with multi-adapter support (#146) * feat: add email service with multi-adapter support Implement a full-stack email service with global provider management, per-bot bindings with granular read/write permissions, outbox audit storage, and MCP tool integration for direct mailbox access. Backend: - Email providers: CRUD with dynamic config schema (generic SMTP/IMAP, Mailgun) - Generic adapter: go-mail (SMTP) + go-imap/v2 (IMAP IDLE real-time push via UnilateralDataHandler + UID-based tracking + periodic check fallback) - Mailgun adapter: mailgun-go/v5 with dual inbound mode (webhook + poll) - Bot email bindings: per-bot provider binding with independent r/w permissions - Outbox: outbound email audit log with status tracking - Trigger: inbound emails push notification to bot_inbox (from/subject only, LLM reads full content on demand via MCP tools) - MailboxReader interface: on-demand IMAP queries for listing/reading emails - MCP tools: email_accounts, email_send, email_list (paginated mailbox), email_read (by UID) — all with multi-binding and provider_id selection - Webhook: /email/mailgun/webhook/:config_id (JWT-skipped, signature-verified) - DB migration: 0019_add_email (email_providers, bot_email_bindings, email_outbox) Frontend: - Email Providers page: /email-providers with MasterDetailSidebarLayout - Dynamic config form rendered from ordered provider meta schema with i18n keys - Bot detail: Email tab with bindings management + outbox audit table - Sidebar navigation entry - Full i18n support (en + zh) - Auto-generated SDK from Swagger Closes #17 * feat(email): trigger bot conversation immediately on inbound email Instead of only storing an inbox item and waiting for the next chat, the email trigger now proactively invokes the conversation resolver so the bot processes new emails right away — aligned with the schedule/heartbeat trigger pattern. * fix: lint --------- Co-authored-by: Acbox <acbox0328@gmail.com> * chore: update AGENTS.md * feat: files preview * feat(web): improve MCP details page * refactor(skills): import skill with pure markdown string * merge main into refactor/memory * fix: migration * refactor: temp delete qdrant and bm25 index * fix: clean merge code * fix: update memory handler --------- Co-authored-by: Leohearts <leohearts@leohearts.com> Co-authored-by: Menci <mencici@msn.com> Co-authored-by: Quincy <69751197+dqygit@users.noreply.github.com> Co-authored-by: BBQ <35603386+HoneyBBQ@users.noreply.github.com> Co-authored-by: Ran <16112591+chen-ran@users.noreply.github.com>	2026-03-03 15:33:50 +08:00
Acbox Liu	0cdf822603	feat: token usage state (#153 ) * feat: token usage state * fix: typo	2026-03-01 02:19:07 +08:00
BBQ	cc5f00355f	feat: add email service with multi-adapter support (#146 ) * feat: add email service with multi-adapter support Implement a full-stack email service with global provider management, per-bot bindings with granular read/write permissions, outbox audit storage, and MCP tool integration for direct mailbox access. Backend: - Email providers: CRUD with dynamic config schema (generic SMTP/IMAP, Mailgun) - Generic adapter: go-mail (SMTP) + go-imap/v2 (IMAP IDLE real-time push via UnilateralDataHandler + UID-based tracking + periodic check fallback) - Mailgun adapter: mailgun-go/v5 with dual inbound mode (webhook + poll) - Bot email bindings: per-bot provider binding with independent r/w permissions - Outbox: outbound email audit log with status tracking - Trigger: inbound emails push notification to bot_inbox (from/subject only, LLM reads full content on demand via MCP tools) - MailboxReader interface: on-demand IMAP queries for listing/reading emails - MCP tools: email_accounts, email_send, email_list (paginated mailbox), email_read (by UID) — all with multi-binding and provider_id selection - Webhook: /email/mailgun/webhook/:config_id (JWT-skipped, signature-verified) - DB migration: 0019_add_email (email_providers, bot_email_bindings, email_outbox) Frontend: - Email Providers page: /email-providers with MasterDetailSidebarLayout - Dynamic config form rendered from ordered provider meta schema with i18n keys - Bot detail: Email tab with bindings management + outbox audit table - Sidebar navigation entry - Full i18n support (en + zh) - Auto-generated SDK from Swagger Closes #17 * feat(email): trigger bot conversation immediately on inbound email Instead of only storing an inbox item and waiting for the next chat, the email trigger now proactively invokes the conversation resolver so the bot processes new emails right away — aligned with the schedule/heartbeat trigger pattern. * fix: lint --------- Co-authored-by: Acbox <acbox0328@gmail.com>	2026-02-28 21:03:59 +08:00
Acbox Liu	fe10abf3fc	refactor: inbox (#137 ) * refactor: inbox * fix: migrations * fix: migrations	2026-02-26 20:16:02 +08:00
Acbox Liu	2f38662d4d	feat: heartbeat (#108 ) * feat: heartbeat * feat: independent heartbeat model	2026-02-25 16:32:52 +08:00
Acbox	a440bf122b	feat(search): add bing and google support	2026-02-23 15:41:47 +08:00
Acbox Liu	17cd077f34	feat: add thinking support (#100 ) * feat: add thinking support * feat: improve thinking block render in web and filter thinking content in channels * fix: migrate	2026-02-23 14:41:27 +08:00

1 2

94 Commits