Memoh

mirror of https://github.com/memohai/Memoh.git synced 2026-04-25 07:00:48 +09:00

Author	SHA1	Message	Date
Acbox	0549f5cafc	feat(command): improve slash command UX Make slash commands easier to navigate in chat by splitting help into levels, compacting list output, and surfacing current selections for model, search, memory, and browser settings. Also route /status to the active conversation session and add an access inspector so users can understand their current command and ACL context.	2026-04-12 17:25:10 +08:00
Acbox Liu	7a21fd5f07	feat: ui message (#357 )	2026-04-11 13:29:41 +08:00
BBQ	f376a2abe3	fix(channel): add wechatoa webhook delivery and proxy config (#356 ) Unify webhook handling across channel adapters and add the WeChat Official Account channel so inbound routing and replies work without platform-specific handlers. Add adapter-scoped proxy support and stable config field ordering so restricted network environments can deliver WeChat and Telegram messages reliably.	2026-04-10 21:26:11 +08:00
Ming Lin	4d3f2de7e2	feat: Add GPU CDI support for workspace containers (#332 ) * feat: add CDI GPU support for workspace containers * feat: expose GPU CDI settings in bot container UI * feat: move GPU settings into advanced container options * docs: document advanced CDI device configuration	2026-04-10 14:52:17 +08:00
Fodesu	19619d73a9	fix(chat): respect override model selection (#354 )	2026-04-10 02:31:29 +08:00
BBQ	d3bf6bc90a	fix(channel,attachment): channel quality refactor & attachment pipeline fixes (#349 ) * feat(channel): add DingTalk channel adapter - Add DingTalk channel adapter (`internal/channel/adapters/dingtalk/`) using dingtalk-stream-sdk-go, supporting inbound message receiving and outbound text/markdown reply - Register DingTalk adapter in cmd/agent and cmd/memoh - Add go.mod dependency: github.com/memohai/dingtalk-stream-sdk-go - Add Dingtalk and Wecom SVG icons and Vue components to @memohai/icon - Refactor existing icon components to remove redundant inline wrappers - Add `channelTypeDisplayName` util for consistent channel label resolution - Add DingTalk/WeCom i18n entries (en/zh) for types and typesShort - Extend channel-icon, bot-channels, channel-settings-panel to support dingtalk/wecom - Use channelTypeDisplayName in profile page to replace ad-hoc i18n lookup * fix(channel,attachment): channel quality refactor & attachment pipeline fixes Channel module: - Fix RemoveAdapter not cleaning connectionMeta (stale status leak) - Fix preparedAttachmentTypeFromMime misclassifying image/gif - Fix sleepWithContext time.After goroutine/timer leak - Export IsDataURL/IsHTTPURL/IsDataPath, dedup across packages - Cache OutboundPolicy in managerOutboundStream to avoid repeated lookups - Split OutboundAttachmentStore: extract ContainerAttachmentIngester interface - Add ManagerOption funcs (WithInboundQueueSize, WithInboundWorkers, WithRefreshInterval) - Add thread-safety docs on OutboundStream / managerOutboundStream - Add debug logs on successful send/edit paths - Expand outbound_prepare_test.go with 21 new cases - Convert no-receiver adapter helpers to package-level funcs; drop unused params DingTalk adapter: - Implement AttachmentResolver: download inbound media via /v1.0/robot/messageFiles/download - Fix pure-image inbound messages failing due to missing resolver Attachment pipeline: - Fix images invisible to LLM in pipeline (DCP) path: inject InlineImages into last user message when cfg.Query is empty - Fix public_url fallback: skip direct URL-to-LLM when ContentHash is set, always prefer inlined persisted asset - Inject path: carry ImageParts through agent.InjectMessage; inline persisted attachments in resolver inject goroutine so mid-stream images reach the model - Fix ResolveMime for images: prefer content-sniffed MIME over platform-declared MIME (fixes Feishu sending image/png header for actual JPEG content → API 400)	2026-04-09 14:36:11 +08:00
zhangxx	8c4e9e218e	Merge pull request #323 from mx1700/feat/stop-command feat: add /stop command to abort agent generation on external channels	2026-04-08 01:25:09 +08:00
Acbox Liu	8d5c38f0e5	refactor: unify providers and models tables (#338 ) * refactor: unify providers and models tables - Rename `llm_providers` → `providers`, `llm_provider_oauth_tokens` → `provider_oauth_tokens` - Remove `tts_providers` and `tts_models` tables; speech models now live in the unified `models` table with `type = 'speech'` - Replace top-level `api_key`/`base_url` columns with a JSONB `config` field on `providers` - Rename `llm_provider_id` → `provider_id` across all references - Add `edge-speech` client type and `conf/providers/edge.yaml` default provider - Create new read-only speech endpoints (`/speech-providers`, `/speech-models`) backed by filtered views of the unified tables - Remove old TTS CRUD handlers; simplify speech page to read-only + test - Update registry loader to skip malformed YAML files instead of failing entirely - Fix YAML quoting for model names containing colons in openrouter.yaml - Regenerate sqlc, swagger, and TypeScript SDK * fix: exclude speech providers from providers list endpoint ListProviders now filters out client_type matching '%-speech' so Edge and future speech providers no longer appear on the Providers page. ListSpeechProviders uses the same pattern match instead of hard-coding 'edge-speech'. * fix: use explicit client_type list instead of LIKE pattern Replace '%-speech' pattern with explicit IN ('edge-speech') for both ListProviders (exclusion) and ListSpeechProviders (inclusion). New speech client types must be added to both queries. * fix: use EXECUTE for dynamic SQL in migrations referencing old schema PL/pgSQL pre-validates column/table references in static SQL statements inside DO blocks before evaluating IF/RETURN guards. This caused migrations 0010-0061 to fail on fresh databases where the canonical schema uses `providers`/`provider_id` instead of `llm_providers`/ `llm_provider_id`. Wrap all SQL that references potentially non-existent old schema objects (llm_providers, llm_provider_id, tts_providers, tts_models, etc.) in EXECUTE strings so they are only parsed at runtime when actually reached. * fix: revert canonical schema to use llm_providers for migration compatibility The CI migrations workflow (up → down → up) failed because 0061 down renames `providers` back to `llm_providers`, but 0001 down only dropped `providers` — leaving `llm_providers` as a remnant. On the second migrate up, 0010 found the stale `llm_providers` and tried to reference `models.llm_provider_id` which no longer existed. Revert 0001 canonical schema to use original names (llm_providers, tts_providers, tts_models) so incremental migrations work naturally and 0061 handles the final rename. Remove EXECUTE wrappers and unnecessary guards from migrations that now always operate on llm_providers. * fix: icons * fix: sync canonical schema with 0061 migration to fix sqlc column mismatch 0001_init.up.sql still used old names (llm_providers, llm_provider_id) and included dropped tts_providers/tts_models tables. sqlc could not parse the PL/pgSQL EXECUTE in migration 0061, so generated code retained stale columns (input_modalities, supports_reasoning) causing runtime "column does not exist" errors when adding models. - Update 0001_init.up.sql to current schema (providers, provider_id, no tts tables, add provider_oauth_tokens) - Use ALTER TABLE IF EXISTS in 0010/0041/0042 for backward compat - Regenerate sqlc * fix: guard all legacy migrations against fresh schema for CI compat On fresh databases, 0001_init.up.sql creates providers/provider_id (not llm_providers/llm_provider_id). Migrations 0013, 0041, 0046, 0047 referenced the old names without guards, causing CI migration failures. - 0013: check llm_provider_id column exists before adding old constraint - 0041: check llm_providers table exists before backfill/constraint DDL - 0046: wrap CREATE TABLE in DO block with llm_providers existence check - 0047: use ALTER TABLE IF EXISTS + DO block guard	2026-04-08 01:03:44 +08:00
Acbox Liu	43c4153938	feat: introduce DCP pipeline layer for unified context assembly (#329 ) * refactor: introduce DCP pipeline layer for unified context assembly Introduce a Deterministic Context Pipeline (DCP) inspired by Cahciua, providing event-driven context assembly for LLM conversations. - Add `internal/pipeline/` package with Canonical Event types, Projection (reduce), Rendering (XML RC), Pipeline manager, and EventStore persistence - Change user message format from YAML front-matter to XML `<message>` tags with self-contained attributes (sender, channel, conversation, type) - Merge CLI/Web dual API into single `/local/` endpoint, remove CLI handler - Add `bot_session_events` table for event persistence and cold-start replay - Add `discuss` session type (reserved for future Cahciua-style mode) - Wire pipeline into HandleInbound: adapt → persist → push on every message - Lazy cold-start replay: load events from DB on first session access * feat: implement discuss mode with reactive driver and probe gate Add discuss session mode where the bot autonomously decides when to speak in group chats via tool-gated output (send tool only, no direct text reply). - Add discuss driver (per-session goroutine, RC watch, step loop via agent.Generate, TR persistence, late-binding prompt with mention hints) - Add system_discuss.md prompt template ("text = inner monologue, send = speak") - Add context composition (MergeContext, ComposeContext, TrimContext) for RC + assistant/tool message interleaving by timestamp - Add probe gate: when discuss_probe_model_id is set, cheap model pre-filters group messages; no tool calls = silence, tool calls = activate primary - Add /new [chat\|discuss] command: explicit mode selection, defaults to discuss in groups, chat in DMs, chat-only for WebUI - Add ResolveRunConfig on flow.Resolver for discuss driver to reuse model/tools/system-prompt resolution without reimplementing - Fix send tool for discuss mode: same-conversation sends now go through SendDirect (channel adapter) instead of the local emitter shortcut - Add target attribute to XML message format (reply_target for routing) - Add discuss_probe_model_id to bots table settings - Remove pipeline compaction (SetCompactCursor) — reuse existing compaction.Service - Persist full SDK messages (including tool calls) in discuss mode * refactor: unify DCP event layer, fix persistence and local channel - Fix bot_session_events dedup index to include event_kind so that message + edit events for the same external_message_id coexist. - Change CreateSessionEvent from :one to :exec so ON CONFLICT DO NOTHING does not produce spurious errors on duplicate delivery. - Move ACL evaluation before event ingest; denied messages no longer enter bot_session_events or the in-memory pipeline. - Let chat mode consume RenderedContext from the DCP pipeline when available, sharing the same event-driven context assembly as discuss. - Collapse local WebSocket handler to route through HandleInbound instead of directly calling StreamChatWS, eliminating the dual business entry point. - Extract buildBaseRunConfig shared builder so resolve() and ResolveRunConfig() no longer duplicate model/credentials/skills setup. - Add StoreRound to RunConfigResolver interface so discuss driver persists assistant output with full metadata, usage, and memory extraction (same quality as chat mode). - Fix discuss driver context: use context.Background() instead of the short-lived HTTP request context that was getting cancelled. - Fix model ID passed to StoreRound: return database UUID from ResolveRunConfig instead of SDK model name. - Remove dead CLIAdapter/CLIType and update legacy web/cli references in tests and comments. * fix: stop idle discuss goroutines after 10min timeout Discuss session goroutines were never cleaned up when a session became inactive (e.g. after /new). Add a 10-minute idle timer that auto-exits the goroutine and removes it from the sessions map when no new RC arrives. * refactor: pipeline details — event types, structured reply, display content - Remove [User sent N attachments] placeholder text from buildInboundQuery; attachment info is now expressed via pipeline <attachment> tags. - Unify in-reply-to as structured ReplyRef (Sender/Preview fields) across Telegram, Discord, Feishu, and Matrix adapters instead of prepending [Reply to ...] text into the message body. Remove now-unused buildTelegramQuotedText, buildDiscordQuotedText, buildMatrixQuotedText. - Make AdaptInbound return CanonicalEvent interface and dispatch to adaptMessage/adaptEdit/adaptService based on metadata["event_type"]. - Add event_id column to bot_history_messages (migration 0059) so user messages can reference their canonical pipeline event. - PersistEvent now returns the event UUID; HandleInbound passes it through to both persistPassiveMessage and ChatRequest.EventID for storeRound. - Add FillDisplayContent to message service: extracts plain text from event_data for clean frontend display. - Frontend extractMessageText prefers display_content when available, falling back to legacy strip logic for old messages. - Fix: always generate headerifiedQuery for storage even when usePipeline is true, so user messages are persisted via storeRound in chat mode. * fix: use json.Marshal for pipeline context content serialization The manual string escaping in buildMessagesFromPipeline only handled double quotes but not newlines, backslashes, and other JSON special characters, producing invalid json.RawMessage values. The LLM then received empty/malformed context and complained about having no history. * fix: restore WebSocket handler to use StreamChatWS directly The previous refactoring replaced the WS handler with HandleInbound + RouteHub subscription, which broke streaming because RouteHub events use a different format (channel.StreamEvent) than what the frontend expects (flow.WSStreamEvent with text_delta, tool_call_start, etc.). Restore the original direct StreamChatWS call path so WebUI streaming works again. The WS handler now matches the pre-refactoring behavior while all other changes (pipeline, ACL, event types, etc.) are kept. * feat: store display_text directly in bot_history_messages Instead of computing display content at API response time by querying bot_session_events via event_id, store the raw user text in a dedicated display_text column at write time. This works for all paths including the WebSocket handler which does not go through the pipeline/event layer. - Migration 0060: add display_text TEXT column - PersistInput gains DisplayText; filled from trimmedText (passive) and req.Query (storeRound) - toMessageFields reads display_text into DisplayContent - Remove FillDisplayContent runtime query and ListSessionEventsByEventID - Frontend already prefers display_content when available (no change) * fix: display_text should contain raw user text, not XML-wrapped query req.Query gets overwritten to headerifiedQuery (with XML <message> tags) before storeRound runs. Add RawQuery field to ChatRequest to preserve the original user text, and use it for display_text in storeMessages. * fix(web): show discuss sessions * refactor: introduce DCP pipeline layer for unified context assembly Introduce a Deterministic Context Pipeline (DCP) inspired by Cahciua, providing event-driven context assembly for LLM conversations. - Add `internal/pipeline/` package with Canonical Event types, Projection (reduce), Rendering (XML RC), Pipeline manager, and EventStore persistence - Change user message format from YAML front-matter to XML `<message>` tags with self-contained attributes (sender, channel, conversation, type) - Merge CLI/Web dual API into single `/local/` endpoint, remove CLI handler - Add `bot_session_events` table for event persistence and cold-start replay - Add `discuss` session type (reserved for future Cahciua-style mode) - Wire pipeline into HandleInbound: adapt → persist → push on every message - Lazy cold-start replay: load events from DB on first session access * feat: implement discuss mode with reactive driver and probe gate Add discuss session mode where the bot autonomously decides when to speak in group chats via tool-gated output (send tool only, no direct text reply). - Add discuss driver (per-session goroutine, RC watch, step loop via agent.Generate, TR persistence, late-binding prompt with mention hints) - Add system_discuss.md prompt template ("text = inner monologue, send = speak") - Add context composition (MergeContext, ComposeContext, TrimContext) for RC + assistant/tool message interleaving by timestamp - Add probe gate: when discuss_probe_model_id is set, cheap model pre-filters group messages; no tool calls = silence, tool calls = activate primary - Add /new [chat\|discuss] command: explicit mode selection, defaults to discuss in groups, chat in DMs, chat-only for WebUI - Add ResolveRunConfig on flow.Resolver for discuss driver to reuse model/tools/system-prompt resolution without reimplementing - Fix send tool for discuss mode: same-conversation sends now go through SendDirect (channel adapter) instead of the local emitter shortcut - Add target attribute to XML message format (reply_target for routing) - Add discuss_probe_model_id to bots table settings - Remove pipeline compaction (SetCompactCursor) — reuse existing compaction.Service - Persist full SDK messages (including tool calls) in discuss mode * refactor: unify DCP event layer, fix persistence and local channel - Fix bot_session_events dedup index to include event_kind so that message + edit events for the same external_message_id coexist. - Change CreateSessionEvent from :one to :exec so ON CONFLICT DO NOTHING does not produce spurious errors on duplicate delivery. - Move ACL evaluation before event ingest; denied messages no longer enter bot_session_events or the in-memory pipeline. - Let chat mode consume RenderedContext from the DCP pipeline when available, sharing the same event-driven context assembly as discuss. - Collapse local WebSocket handler to route through HandleInbound instead of directly calling StreamChatWS, eliminating the dual business entry point. - Extract buildBaseRunConfig shared builder so resolve() and ResolveRunConfig() no longer duplicate model/credentials/skills setup. - Add StoreRound to RunConfigResolver interface so discuss driver persists assistant output with full metadata, usage, and memory extraction (same quality as chat mode). - Fix discuss driver context: use context.Background() instead of the short-lived HTTP request context that was getting cancelled. - Fix model ID passed to StoreRound: return database UUID from ResolveRunConfig instead of SDK model name. - Remove dead CLIAdapter/CLIType and update legacy web/cli references in tests and comments. * fix: stop idle discuss goroutines after 10min timeout Discuss session goroutines were never cleaned up when a session became inactive (e.g. after /new). Add a 10-minute idle timer that auto-exits the goroutine and removes it from the sessions map when no new RC arrives. * refactor: pipeline details — event types, structured reply, display content - Remove [User sent N attachments] placeholder text from buildInboundQuery; attachment info is now expressed via pipeline <attachment> tags. - Unify in-reply-to as structured ReplyRef (Sender/Preview fields) across Telegram, Discord, Feishu, and Matrix adapters instead of prepending [Reply to ...] text into the message body. Remove now-unused buildTelegramQuotedText, buildDiscordQuotedText, buildMatrixQuotedText. - Make AdaptInbound return CanonicalEvent interface and dispatch to adaptMessage/adaptEdit/adaptService based on metadata["event_type"]. - Add event_id column to bot_history_messages (migration 0059) so user messages can reference their canonical pipeline event. - PersistEvent now returns the event UUID; HandleInbound passes it through to both persistPassiveMessage and ChatRequest.EventID for storeRound. - Add FillDisplayContent to message service: extracts plain text from event_data for clean frontend display. - Frontend extractMessageText prefers display_content when available, falling back to legacy strip logic for old messages. - Fix: always generate headerifiedQuery for storage even when usePipeline is true, so user messages are persisted via storeRound in chat mode. * fix: use json.Marshal for pipeline context content serialization The manual string escaping in buildMessagesFromPipeline only handled double quotes but not newlines, backslashes, and other JSON special characters, producing invalid json.RawMessage values. The LLM then received empty/malformed context and complained about having no history. * fix: restore WebSocket handler to use StreamChatWS directly The previous refactoring replaced the WS handler with HandleInbound + RouteHub subscription, which broke streaming because RouteHub events use a different format (channel.StreamEvent) than what the frontend expects (flow.WSStreamEvent with text_delta, tool_call_start, etc.). Restore the original direct StreamChatWS call path so WebUI streaming works again. The WS handler now matches the pre-refactoring behavior while all other changes (pipeline, ACL, event types, etc.) are kept. * feat: store display_text directly in bot_history_messages Instead of computing display content at API response time by querying bot_session_events via event_id, store the raw user text in a dedicated display_text column at write time. This works for all paths including the WebSocket handler which does not go through the pipeline/event layer. - Migration 0060: add display_text TEXT column - PersistInput gains DisplayText; filled from trimmedText (passive) and req.Query (storeRound) - toMessageFields reads display_text into DisplayContent - Remove FillDisplayContent runtime query and ListSessionEventsByEventID - Frontend already prefers display_content when available (no change) * fix: display_text should contain raw user text, not XML-wrapped query req.Query gets overwritten to headerifiedQuery (with XML <message> tags) before storeRound runs. Add RawQuery field to ChatRequest to preserve the original user text, and use it for display_text in storeMessages. * fix(web): show discuss sessions * chore(feishu): change discuss output to stream card * fix(channel): unify discuss/chat send path and card markdown delivery * feat(discuss): switch to stream execution with RouteHub broadcasting * refactor(pipeline): remove context trimming from ComposeContext The pipeline path should not trim context by token budget — the upstream IC/RC already bounds the event window. Remove TrimContext, FindWorkingWindowCursor, EstimateTokens, FormatLastProcessedMs (all unused or only used for trimming), the maxTokens parameter from ComposeContext, and MaxContextTokens from DiscussSessionConfig. --------- Co-authored-by: 晨苒 <16112591+chen-ran@users.noreply.github.com>	2026-04-06 21:56:25 +08:00
晨苒	830c521f11	feat(feishu): keep mention(at) target	2026-04-06 06:18:03 +08:00
晨苒	aa39ea3357	fix(containerd): ctx with namespace	2026-04-06 06:03:56 +08:00
Lakr	daed345908	feat(browser): add remote Playwright session support (Tier 2) (#325 ) Add native Playwright WebSocket sessions alongside the existing curated browser tools. Agents can now create remote sessions that expose a full Playwright API over WebSocket, enabling advanced use cases like HttpOnly cookie injection, storage state management, and route interception. Key changes: - Per-bot isolated browser processes (launchServer via Node child process) - New session module with create/close/status/heartbeat endpoints - New browser_remote_session agent tool (Go) - Storage state export/import on existing browser contexts - Bot ID plumbing through context creation for process isolation - Inflight deduplication to prevent duplicate browser launches - Session janitor for automatic expiry cleanup	2026-04-04 23:49:05 +08:00
Acbox	c172699466	fix(channel): allow attachment-only messages via WebSocket (#331 ) The WebSocket handler rejected messages with empty text even when attachments were present, while the HTTP POST endpoint correctly used Message.IsEmpty(). Move the empty-check after attachment parsing so only truly empty messages are rejected.	2026-04-04 21:50:40 +08:00
Ringo.Typowriter	09c523f0b8	refactor(agent): merge read_media tool into read tool (#326 )	2026-04-04 20:56:00 +08:00
Acbox Liu	5cfbaa40e2	refactor(agent): replace XML tag extraction with tool-based send/react/speak (#330 ) * refactor(agent): replace XML tag extraction with tool-based send/react/speak Remove the <attachments>, <reactions>, and <speech> XML tag extraction system from the agent streaming pipeline. Instead, the send/react/speak tools now handle both same-conversation and cross-conversation delivery: - send: omit target to deliver attachments in the current conversation; specify target for cross-channel messaging - react: omit target to react in the current conversation - speak: omit target to speak in the current conversation Backend changes: - Add StreamEmitter callback to tools.SessionContext so tools can push attachment/reaction/speech events directly into the agent stream - Wire emitter in agent.go for both streaming and non-streaming paths - Remove StreamTagExtractor, DefaultTagResolvers, emitTagEvents, and delete internal/agent/tags.go entirely - Remove StripAgentTags calls from assistant_output.go - Add IsSameConversation detection in messaging executor; same-conv sends pass raw paths through the emitter for downstream ingestion - Auto-resolve relative paths (e.g. "IDENTITY.md" -> "/data/IDENTITY.md") - Add Metadata propagation through the full attachment chain (tools.Attachment -> agent.FileAttachment -> parseAttachmentDelta) - Update system_chat.md and _contacts.md prompts Frontend changes (apps/web): - Hide send/react/speak tool_call blocks when result indicates delivered to current conversation - Defer attachment_delta blocks to end of message (flush on stream completion) for consistent positioning with DB-loaded history * fix(agent): speak tool emits synthesized audio directly as voice attachment Instead of emitting speech_delta (which requires downstream re-synthesis), the speak tool now emits the already-synthesized audio as an attachment_delta with voice type. This avoids double TTS synthesis and eliminates dependency on ttsService being configured on the inbound processor. Also fixes speak on WebUI where ReplyTarget is empty (same fix as send).	2026-04-04 20:55:03 +08:00
Acbox	a5f59ea6a5	fix(channel): strip agent tags from content parts in outbound messages StripAgentTags was only applied to the merged content string but not to individual ContentParts. On channels that don't support RichText (e.g. Telegram), buildChannelMessage joins part texts directly, causing raw <attachments>/<reactions>/<speech> blocks to appear in the final message.	2026-04-04 17:17:35 +08:00
Acbox	a9a9f7e955	feat: add image generation model and generate_image agent tool Bots can now be configured with an image generation model (must have image-output compatibility). When set, the agent exposes a generate_image tool that calls the model via Twilight AI SDK, saves the result to the bot container filesystem, and returns the file path. - Add image_model_id column to bots table (migration 0053) - Update settings SQL queries, service, and types - New ImageGenProvider tool provider in internal/agent/tools/ - Wire provider in both cmd/agent and cmd/memoh entry points - Add image model selector to frontend bot settings with compat filtering - Regenerate swagger, SDK types, and sqlc code	2026-04-03 01:17:34 +08:00
Acbox	a73bac05fe	fix(agent): skip tool injection for models without tool-call capability Models that lack the "tool-call" compatibility flag now run without tools, preventing provider errors when the model does not support function calling.	2026-04-03 01:17:34 +08:00
Acbox	a31995424c	feat: add per-route message dispatch modes (inject/parallel/queue) Introduce three inbound message handling modes for channel adapters: - inject (default, /btw): when a route has an active agent stream, inject the new user message into the running stream via the SDK's PrepareStep hook between tool rounds. The message is interleaved at the correct position in the persisted round. - parallel (/now): start a new agent stream immediately, running concurrently with any existing stream (preserves current behavior). - queue (/next): enqueue the message and process it after the current stream completes. Key components: - RouteDispatcher: per-route state management with inject channel, task queue, and active-stream tracking. - PrepareStep integration: drains inject channel between tool rounds, records insertion position via InjectedRecorder for correct persistence ordering. - interleaveInjectedMessages: inserts injected user messages at their actual injection position within the persisted message round. - Parallel mode isolation: /now streams do not interact with the dispatcher, preventing them from clearing another stream's active state.	2026-04-03 01:17:33 +08:00
Acbox	33b57ee345	feat: rename info to status, add /status slash command Rename session info endpoint from /sessions/:id/info to /sessions/:id/status and update frontend tab label accordingly. Add /status slash command that displays current session metrics (message count, context usage, cache hit rate, used skills) as formatted text in any channel.	2026-04-03 01:17:33 +08:00
Acbox	b3c783fb0b	feat: add session info panel with message count, context usage, cache stats, and skills Add GET /bots/:bot_id/sessions/:session_id/info API endpoint that returns per-session message count, latest input token usage with model context window, aggregated KV cache hit rate, and skills invoked via use_skill tool calls. Frontend Info tab in the right sidebar now displays this data in a compact key-value layout with a context usage progress bar and clickable skill links.	2026-04-03 01:17:33 +08:00
Acbox	bb14bcb3bc	refactor: move skills directory from .skills to skills and enrich prompt - Change skills storage path from `/data/.skills` to `/data/skills` - Add usage instructions and directory location to the Skills section in the system prompt	2026-04-03 01:17:31 +08:00
Acbox Liu	faaf13a0e9	feat: add Supermarket integration (MCP & Skill marketplace) (#309 ) * feat: add Supermarket integration (MCP & Skill marketplace) Backend: - Add [supermarket] config section with base_url (default: supermarket.memoh.ai) - Add SupermarketHandler with proxy endpoints for MCPs, Skills, and Tags - Add install endpoints: POST /bots/:id/supermarket/install-mcp (creates MCP connection with env vars) and install-skill (downloads tar.gz, extracts to container via gRPC) - Register handler in FX wiring, generate Swagger docs and TypeScript SDK Frontend: - Add /settings/supermarket route with Store icon in sidebar - Create supermarket page with search, tag filtering, MCP and Skill sections - Add MCP/Skill card components with tag badges and install buttons - Add install dialogs: MCP (bot selector + env var form), Skill (bot selector) - Add i18n entries for en.json and zh.json * fix: improve supermarket install UX - Create BotSelect component with avatar + name using UI Select - Replace NativeSelect in install dialogs and usage page with BotSelect - Change MCP install flow: navigate to bot detail MCP tab with pre-filled draft instead of direct install, letting users review before saving - Move Supermarket sidebar entry between Browser and Usage * web: remove supermarket page top tag selector bar Drop the horizontal tag chips and getSupermarketTags fetch; keep search and tag filter via card tag clicks with clearable badge. * web: add homepage link to supermarket MCP and Skill cards Show an external-link icon next to the card title when homepage is available, opening in a new tab on click.	2026-04-03 01:17:31 +08:00
Acbox	3fa311c6cb	fix(media): proxy ContainerFileOpener through fallback storage provider The fallback provider introduced in `5aeb2fd3` wrapped containerfs but did not implement storage.ContainerFileOpener, causing IngestContainerFile to fail with "provider does not support container file reading". This broke outbound file attachments on all IM channels (Telegram, Discord, etc.) because container paths like /data/xxx.xlsx were passed as-is to the platform API instead of being ingested into the media store first.	2026-04-03 01:14:20 +08:00
Acbox	5aeb2fd3fc	fix(media): add local filesystem fallback and fix gallery lightbox matching - Add localfs storage provider as fallback when containerfs is unreachable - Wrap media service with fallback provider in both entry points - Fix gallery lightbox src matching by comparing pathnames only	2026-04-03 00:01:49 +08:00
Acbox	fc2b603018	fix(agent): skip tools for models without tool-call capability and parse image output - Add SupportsToolCall to RunConfig; only inject tools into SDK when set - Update twilight-ai to 497ad09 which adds SSE scanner 10MB buffer (fixes token-too-long on large image payloads) and parses the images array from OpenAI-compatible chat completions into StreamFilePart	2026-04-03 00:01:14 +08:00
Acbox	c1e6e0cc7a	feat(agent): add pagination and smart collapsing to container list tool Large directories like node_modules/.venv could return thousands of entries, wasting tokens and causing timeouts. Add offset/limit pagination to ListDir RPC and collapse heavy subdirectories (>50 items) into summaries in recursive mode. Collapsing runs at the bridge layer before pagination so the page window reflects the collapsed view.	2026-04-02 01:51:19 +08:00
Acbox	f1dd30a388	fix: strip agent tags from IM/WebUI output and fix attachment display after refresh Three independent bugs fixed: 1. IM channels were sending raw <attachments>/<reactions>/<speech> tag blocks alongside file attachments. Now ExtractAssistantOutputs strips these tags before building the outbound channel message. 2. WebUI rendered these tags as markdown after page refresh. Now extractMessageText strips agent tags for non-user messages. 3. WebUI lost attachment blocks after refresh because convertMessagesToChats did not call buildAssetBlocks when merging assistant messages into a pending tool-call group. Also made LinkOutboundAssets session-aware so assets are linked to the correct assistant message.	2026-03-31 15:11:57 +08:00
Yiming Qi	ba0569c1fa	feat(email): use popup flow for gmail oauth callback (#307 ) * feat(email): use popup flow for gmail oauth callback * fix(email): satisfy lint in oauth callback helper	2026-03-29 20:13:45 +08:00
Acbox	33f39c20ff	feat: add per-message model and reasoning effort override Allow users to select a different model and reasoning effort level directly from the chat input toolbar, overriding the bot defaults on a per-message basis. The backend accepts optional model_id and reasoning_effort parameters via both WebSocket and HTTP APIs, with request-level values taking priority over bot/session settings. - Backend: extend wsClientMessage and LocalChannelMessageRequest with model_id/reasoning_effort fields; add ReasoningEffort to ChatRequest; update resolver to prioritize request-level reasoning effort - Frontend: add ModelOptions and ReasoningEffortSelect shared components; refactor model-select to reuse ModelOptions; add model/reasoning selectors to chat input toolbar; initialize from bot settings - Regenerate swagger spec and TypeScript SDK	2026-03-29 19:45:55 +08:00
Acbox	86d83108d9	fix: use readline-capable shell for interactive terminal sessions Container terminals were echoing raw ANSI escape sequences (^[[A, ^[[B, etc.) instead of handling arrow keys because /bin/sh (dash/ash) lacks readline support. Two changes fix this: 1. Bridge execPTY now directly exec's bare paths (e.g. /bin/bash) instead of always wrapping through "/bin/sh -c", preserving readline behavior. 2. Terminal handler detects bash/zsh in the container and prefers them over /bin/sh for interactive PTY sessions.	2026-03-29 19:31:24 +08:00
Acbox	0e646625bf	feat: add compaction ratio setting to control partial context compaction Allow users to configure what percentage of older messages to compact, keeping the most recent portion intact. Default ratio is 80%, meaning the oldest 80% of uncompacted messages are summarized while the newest 20% remain as-is for full-fidelity context.	2026-03-29 19:14:43 +08:00
Acbox	bcda6f6fe6	refactor: replace Load More with Pagination across frontend and backend - Replace all "Load More" / "Show More" buttons with Pagination components in model-list, bot-compaction, and bot-heartbeat views - Convert backend log APIs (compaction, heartbeat, schedule) from cursor-based (before+limit) to offset+limit pagination with total_count - Update SQL queries to use OFFSET+LIMIT and add COUNT queries - Add shared parseOffsetLimit helper in handler_helpers.go - Regenerate sqlc, Swagger docs, and TypeScript SDK - Clean up unused i18n keys (loadMore, showMore, history.loadMore)	2026-03-29 18:49:30 +08:00
Acbox	6c2da4b2f5	feat(web,server): expose server version and commit hash in Profile page Add version and commit_hash fields to the /ping endpoint response, sourced from the existing internal/version package (ldflags or Go build info). The frontend capabilities store reads these values and displays them as badges at the bottom of the Profile page.	2026-03-29 17:38:33 +08:00
晨苒	0b56fb0bf7	fix(mail): callback URL for Gmail OAuth (#303 )	2026-03-29 16:29:34 +08:00
Acbox	0730ff2945	refactor: remove max_context_load_time and max_context_tokens from bot settings These two fields controlled history context window (time-based) and token-based trimming. They are no longer needed — the resolver now always uses the hardcoded 24-hour default and skips token-based history trimming.	2026-03-29 00:00:10 +08:00
Acbox	90ac222bc9	feat: auto-create search/tts providers at startup with enable toggle - Add `enable` column (default false) to search_providers and tts_providers tables - Auto-create default entries for all provider types on startup (disabled by default) - Add enable/disable Switch toggle in frontend for both search and TTS providers - Show green status dot in sidebar for enabled providers, sort enabled first - Filter bot settings dropdowns to only show enabled providers	2026-03-28 23:47:09 +08:00
Acbox Liu	bca13a13fa	feat(web): introduce a brand new web ui (#281 ) * feat(web): introduce a brand new web ui * refactor(ui): align chat sidebar and UI components with Figma design - Restyle chat page sidebar: header with icon/title, search input, section labels, and "new session" footer button - Simplify bot-sidebar and session-sidebar to card-based layout matching Figma session card design (58px height, 26px avatar, status dots) - Update master-detail-sidebar-layout with bg-sidebar and 53px header - Unify border-radius across UI components to rounded-lg (8px): Card, Toggle, Alert, Popover, Item; Dialog uses rounded-xl (12px) * refactor(ui): move shared theme and design tokens from web to ui package CSS variables, @theme inline mappings, @custom-variant, and base layer styles now live in @memohai/ui/style.css. The web app imports them via @import "@memohai/ui/style.css", keeping only the Tailwind entry point and web-specific imports (markstream-vue, @source). * refactor(ui): apply flat design system from Figma spec Overhaul @memohai/ui component styles to match the new "high-contrast, flat" design language defined in the Figma design spec (DESIGN.md). Theme: - --primary-foreground: pure white -> #fafafa - --ring: purple -> foreground color (focus rings no longer use brand purple) Atoms (zero shadow, monochrome): - Button: default bg-primary -> bg-foreground; add explicit "primary" variant for Send CTA - Badge: rounded-full -> rounded-sm; default bg-primary -> bg-foreground; add warning/outline/size variants - Alert: rounded-lg -> rounded-[10px]; remove shadow-sm; destructive drops bg-red-50 - Card: add shadow-lg, rounded-lg -> rounded-xl, py-6 -> p-6 - Input/Textarea: remove shadow, text-sm -> text-[16px], focus ring non-purple - Checkbox: checked bg-primary -> bg-foreground - Switch: checked bg-primary -> bg-foreground - RadioGroup: indicator fill-primary -> fill-foreground - Slider: range/thumb border-primary -> border-foreground Floating panels (shadow-md): - DropdownMenu/Combobox/Select/ContextMenu Content: shadow-lg -> shadow-md - Sheet: shadow-2xl -> shadow-lg - MenuItem destructive focus: bg-red-50 -> bg-accent Other: - Pagination active: bg-foreground text-background (black, not purple) - Item variants: bg-transparent -> bg-background/bg-accent - Tabs active: shadow-sm -> border-border - Toggle: remove shadow-xs, unify hover to accent - SelectTrigger/NativeSelect: remove shadow, unify focus ring Docs: - Add packages/ui/DESIGN.md with full design system spec - Simplify apps/web/AGENTS.md, remove duplicated design info, reference DESIGN.md * refactor(chat-ui): restructure chat page components and styles (#288) * refactor(chat-ui): restructure chat page components and styles * feat(chat): add collapsible sidebar for both sides * feat(ui): add PinInput and BadgeCount components, align styles with Figma spec New components: - PinInput (OTP input): PinInput, PinInputGroup, PinInputSlot, PinInputSeparator based on reka-ui PinInput primitives with flat border-stitching design - BadgeCount: circular numeric counter with default/destructive/secondary variants Style updates to match Figma design: - Sonner: border-radius from 1rem to var(--radius-lg) (10px) - Table: add border border-border rounded-sm to container - TagsInput: remove shadow-xs, rounded-md -> rounded-lg, ring-[3px] -> ring-2 Updated DESIGN.md with all new component specifications. * chore: move up css to ui package * refactor: change npm package from @memoh to @memohai * Feat/chat layout (#295) * refactor(chat-ui): restructure chat page components and styles * feat(chat): add collapsible sidebar for both sides * fix: update chat page icon * style: refine UI components appearance * style: refine UI components appearance * chore(ci): update lock file * refactor: new layout * chore: adjust style * fix: tauri ui size * chore: remove bot session metadata * refactor: text size and muted color * fix: indirect height of bot-details pages * feat: add 5 icons * refactor: polish chat flow and settings navigation labels Persist chat selection across pages, simplify provider/settings sidebars, and refine chat/session UX so navigation and composer behavior feel consistent without extra session/provider jumps. * docs(web): refresh AGENTS frontend architecture guide Expand and align the web AGENTS documentation with the current route structure, component inventory, chat transport flow, and store responsibilities so implementation guidance matches the codebase. --------- Co-authored-by: Quincy <69751197+dqygit@users.noreply.github.com>	2026-03-28 19:15:39 +08:00
BBQ	7f9d6e4aba	feat(acl): redesign ACL with conversation scope selector (#297 ) Backend - New subject kinds: all / channel_identity / channel_type - Source scope fields on bot_acl_rules: source_channel, source_conversation_type, source_conversation_id, source_thread_id - Fix source_scope_check constraint: resolve source_channel server-side (channel_type → subject_channel_type; channel_identity → DB lookup) - Add GET /bots/:id/acl/channel-types/:type/conversations to list observed conversations by platform type - ListObservedConversations: include private/DM chats, normalise conversation_type; COALESCE(name, handle) for display name - enrichConversationAvatar: persist entry.Name → conversation_name (keeps Telegram group titles current on every message) - Unify Priority type to int32 across Go types to match DB INTEGER; remove all int/int32 casts in service layer - Fix duplicate nil guard in Evaluate; drop dead SourceScope.Channel field - Migration 0048_acl_redesign Frontend - Drag-and-drop rule priority reordering (SortableJS/useSortable); fix reorder: compute new order from oldIndex/newIndex directly, not from the array (which useSortable syncs after onEnd) - Conversation scope selector: searchable popover backed by observed conversations (by identity or platform type); collapsible manual-ID fallback - Display: name as primary label, stable channel·type·id always shown as subtitle for verification - bot-terminal: accessibility fix on close-tab button (keyboard events) - i18n: drag-to-reorder, conversation source, manual IDs (en/zh) Tests: update fakeChatACL to Evaluate interface; fix SourceScope literals. SDK/spec regenerated.	2026-03-28 01:06:13 +08:00
Yiming Qi	64378d29ed	feat: openai codex support (#292 ) * feat(web): add provider oauth management ui * feat: add OAuth callback support on port 1455 * feat: enhance reasoning effort options and support for OpenAI Codex OAuth * feat: update twilight-ai dependency to v0.3.4 * refactor: promote openai-codex to first-class client_type, remove auth_type Replace the previous openai-responses + metadata auth_type=openai-codex-oauth combo with a dedicated openai-codex client_type. OAuth requirement is now determined solely by client_type, eliminating the auth_type concept from the LLM provider domain entirely. - Add openai-codex to DB CHECK constraint (migration 0047) with data migration - Add ClientTypeOpenAICodex constant and dedicated SDK/probe branches - Remove AuthType from SDKModelConfig, ModelCredentials, TriggerConfig, etc. - Simplify supportsOAuth to check client_type == openai-codex - Add conf/providers/codex.yaml preset with Codex catalog models - Frontend: replace auth_type selector with client_type-driven OAuth UI --------- Co-authored-by: Acbox <acbox0328@gmail.com>	2026-03-27 19:30:45 +08:00
Kathent	f61666479c	fix(server): check container running (#293 )	2026-03-27 16:12:28 +08:00
Acbox	da2e999ce3	feat: searchable timezone select & bot timezone priority - Add reusable TimezoneSelect component with search and UTC offset labels - Replace plain Select with searchable TimezoneSelect in profile settings, bot settings, and browser context settings - Move bot timezone setting from header dialog into bot settings tab - Resolve timezone with bot > user > system priority for all LLM-facing time formatting (user message header, system prompt, heartbeat, tools, memory extraction) - Format tool output timestamps (history, contacts) in resolved timezone	2026-03-26 21:00:21 +08:00
Acbox	65b2797626	refactor: unify SDK model factories into internal/models Move CreateModel, BuildReasoningOptions, ReasoningBudgetTokens and related types from internal/agent to internal/models as NewSDKChatModel, SDKModelConfig, etc. This eliminates duplicate ClientType constants and centralises all Twilight AI SDK instance creation in a single package. NewSDKEmbeddingModel now accepts a clientType parameter and dispatches to the native Google embedding provider for google-generative-ai, instead of always using the OpenAI-compatible endpoint.	2026-03-26 20:08:35 +08:00
Yiming Qi	03ba13e7e5	feat: add timezone support for schedule and user runtime (#282 )	2026-03-26 01:32:02 +08:00
Acbox	dd1b588e95	chore: remove @memohai/cli	2026-03-24 21:34:04 +08:00
Acbox	5963e787a9	fix(agent): preserve inline tags in message history Stop stripping <attachments>, <reactions>, and <speech> tags from assistant messages so the LLM retains full context across turns.	2026-03-24 19:45:40 +08:00
Acbox	e9c9ed5ab1	fix(agent): route native images into user message for vision models Images sent by users were silently dropped when the model supported vision: routeAttachmentsByCapability classified them as "Native", but extractFileRefPaths only collected "Fallback" (tool_file_ref) paths, so the image data URL was computed and then discarded — the model saw neither the image nor its container path. - Add InlineImages field to RunConfig to carry native image data - Replace extractFileRefPaths with extractAttachmentPaths that collects paths from both Native (FallbackPath) and Fallback attachments so the YAML header always lists every attachment - Add extractNativeImageParts to extract inline image data URLs - Pass InlineImages as sdk.ImagePart in prepareRunConfig so the LLM receives the actual image content alongside the text query	2026-03-24 19:14:33 +08:00
Ran	93097d50b2	refactor(memory): replace sdk to twilight	2026-03-24 06:18:16 +08:00
晨苒	e2e3b69acf	feat(channel): add WeChat (weixin) adapter with QR code (#278 ) * feat(channel): add WeChat (weixin) adapter with QR code * fix(channel): fix weixin block streaming * chore(channel): update weixin logo	2026-03-22 23:28:57 +08:00
AlexMa233	609ca49cf5	feat: matrix support (part 1) (#242 ) * feat(channel): add Matrix adapter support * fix(channel): prevent reasoning leaks in Matrix replies * fix(channel): persist Matrix sync cursors * fix(channel): improve Matrix markdown rendering * fix(channel): support Matrix attachments and multimodal history * fix(channel): expand Matrix reply media context * fix(handlers): allow media downloads for chat-access bots * fix(channel): classify Matrix DMs as direct chats * fix(channel): auto-join Matrix room invites * fix(channel): resolve Matrix room aliases for outbound send * fix(web): use Matrix brand icon in channel badges Replace the generic Matrix hashtag badge with the official brand asset so channel badges feel recognizable and fit the circular mask cleanly. * fix(channel): add Matrix room whitelist controls Let Matrix bots decide whether to auto-join invites and restrict inbound activity to allowed rooms or aliases. Expose the new controls in the web settings UI with line-based whitelist input so access rules stay explicit. * fix(channel): stabilize Matrix multimodal follow-ups and settings * fix(flow): avoid gosec panic on byte decoding * fix: fix golangci-lint * fix(channel): remove Matrix built-in ACL * fix(channel): preserve Matrix image captions * fix(channel): validate Matrix homeserver and sync access Fail Matrix connections early when the homeserver, access token, or /sync capability is misconfigured so bot health checks surface actionable errors. * fix(channel): preserve optional toggles and relax Matrix startup validation * fix(channel): tighten Matrix mention fallback parsing * fix(flow): skip structured assistant tool-call outputs * fix(flow): resolve merged resolver duplication Keep the internal agent resolver implementation after merging main so split helper files do not redeclare flow symbols. Restore user message normalization in sanitize and persistence paths to keep flow tests and command packages building. * fix(flow): remove unused merged resolver helper Drop the leftover truncate helper and import from the resolver merge fix so golangci-lint passes again without affecting flow behavior. --------- Co-authored-by: Acbox Liu <acbox0328@gmail.com>	2026-03-22 21:55:34 +08:00

1 2 3 4 5 ...

293 Commits