Memoh

mirror of https://github.com/memohai/Memoh.git synced 2026-04-27 07:16:19 +09:00

Author	SHA1	Message	Date
Acbox Liu	1680316c7f	refactor(agent): remove agent gateway instead of twilight sdk (#264 ) * refactor(agent): replace TypeScript agent gateway with in-process Go agent using twilight-ai SDK - Remove apps/agent (Bun/Elysia gateway), packages/agent (@memoh/agent), internal/bun runtime manager, and all embedded agent/bun assets - Add internal/agent package powered by twilight-ai SDK for LLM calls, tool execution, streaming, sential logic, tag extraction, and prompts - Integrate ToolGatewayService in-process for both built-in and user MCP tools, eliminating HTTP round-trips to the old gateway - Update resolver to convert between sdk.Message and ModelMessage at the boundary (resolver_messages.go), keeping agent package free of persistence concerns - Prepend user message before storeRound since SDK only returns output messages (assistant + tool) - Clean up all Docker configs, TOML configs, nginx proxy, Dockerfile.agent, and Go config structs related to the removed agent gateway - Update cmd/agent and cmd/memoh entry points with setter-based ToolGateway injection to avoid FX dependency cycles * fix(web): move form declaration before computed properties that reference it The `form` reactive object was declared after computed properties like `selectedMemoryProvider` and `isSelectedMemoryProviderPersisted` that reference it, causing a TDZ ReferenceError during setup. * fix: prevent UTF-8 character corruption in streaming text output StreamTagExtractor.Push() used byte-level string slicing to hold back buffer tails for tag detection, which could split multi-byte UTF-8 characters. After json.Marshal replaced invalid bytes with U+FFFD, the corruption became permanent — causing garbled CJK characters (�) in agent responses. Add safeUTF8SplitIndex() to back up split points to valid character boundaries. Also fix byte-level truncation in command/formatter.go and command/fs.go to use rune-aware slicing. * fix: add agent error logging and fix Gemini tool schema validation - Log agent stream errors in both SSE and WebSocket paths with bot/model context - Fix send tool `attachments` parameter: empty `items` schema rejected by Google Gemini API (INVALID_ARGUMENT), now specifies `{"type": "string"}` - Upgrade twilight-ai to d898f0b (includes raw body in API error messages) * chore(ci): remove agent gateway from Docker build and release pipelines Agent gateway has been replaced by in-process Go agent; remove the obsolete Docker image matrix entry, Bun/UPX CI steps, and agent-binary build logic from the release script. * fix: preserve attachment filename, metadata, and container path through persistence - Add `name` column to `bot_history_message_assets` (migration 0034) to persist original filenames across page refreshes. - Add `metadata` JSONB column (migration 0035) to store source_path, source_url, and other context alongside each asset. - Update SQL queries, sqlc-generated code, and all Go types (MessageAsset, AssetRef, OutboundAssetRef, FileAttachment) to carry name and metadata through the full lifecycle. - Extract filenames from path/URL in AttachmentsResolver before clearing raw paths; enrich streaming event metadata with name, source_path, and source_url in both the WebSocket and channel inbound ingestion paths. - Implement `LinkAssets` on message service and `LinkOutboundAssets` on flow resolver so WebSocket-streamed bot attachments are persisted to the correct assistant message after streaming completes. - Frontend: update MessageAsset type with metadata field, pass metadata through to attachment items, and reorder attachment-block.vue template so container files (identified by metadata.source_path) open in the sidebar file manager instead of triggering a download. * refactor(agent): decouple built-in tools from MCP, load via ToolProvider interface Migrate all 13 built-in tool providers from internal/mcp/providers/ to internal/agent/tools/ using the twilight-ai sdk.Tool structure. The agent now loads tools through a ToolProvider interface instead of the MCP ToolGatewayService, which is simplified to only manage external federation sources. This enables selective tool loading and removes the coupling between business tools and the MCP protocol layer. * refactor(flow): split monolithic resolver.go into focused modules Break the 1959-line resolver.go into 12 files organized by concern: - resolver.go: core orchestration (Resolver struct, resolve, Chat, prepareRunConfig) - resolver_stream.go: streaming (StreamChat, StreamChatWS, tryStoreStream) - resolver_trigger.go: schedule/heartbeat triggers - resolver_attachments.go: attachment routing, inlining, encoding - resolver_history.go: message loading, deduplication, token trimming - resolver_store.go: persistence (storeRound, storeMessages, asset linking) - resolver_memory.go: memory provider integration - resolver_model_selection.go: model selection and candidate matching - resolver_identity.go: display name and channel identity resolution - resolver_settings.go: bot settings, loop detection, inbox - user_header.go: YAML front-matter formatting - resolver_util.go: shared utilities (sanitize, normalize, dedup, UUID) * fix(agent): enable Anthropic extended thinking by passing ReasoningConfig to provider Anthropic's thinking requires WithThinking() at provider creation time, unlike OpenAI which uses per-request ReasoningEffort. The config was never wired through, so Claude models could not trigger thinking. * refactor(agent): extract prompts into embedded markdown templates Move inline prompt strings from prompt.go into separate .md files under internal/agent/prompts/, using {{key}} placeholders and a simple render engine. Remove obsolete SystemPromptParams fields (Language, MaxContextLoadTime, Channels, CurrentChannel) and their call-site usage. * fix: lint	2026-03-19 13:31:54 +08:00
BBQ	68745133b7	fix(inbound): use bot owner token for agent gateway callbacks (#254 ) * feat(access): add guest chat ACL and simplify bot access Unify bot chat permissions around owner and guest ACL so public access, whitelist, and blacklist share a single model. Remove unused sharing paths, add searchable platform identity controls, and normalize Feishu identities to stable open_id records. * fix(web): format access control panel Include the post-commit formatting changes applied to the access control UI so the branch stays clean and the PR reflects the final rendered layout. * fix(migrations): drop legacy bot tables before bots Ensure the init down migration removes bot_members and bot_preauth_keys before dropping bots so full rollback succeeds after the ACL refactor. * feat(acl): add source-aware chat trigger rules Support channel-, conversation-, and thread-scoped ACL rules while keeping allow_guest, whitelist, and blacklist compatible. Also expose observed conversation candidates and normalize channel identity rules to their own platform. * fix(lint): resolve golangci-lint errors after rebase - Remove unused receivers and parameters in fakeRows/Service methods - Delete unused makeNoRow helper and toParticipantFields function - Fix gci/gofumpt formatting * fix(lint): fix gci import formatting in acl types and handler * fix(acl): tighten observed group and thread selection (#245) Use inbox plus persisted messages to discover observed group and thread routes, and lock scope fields after selecting a concrete observed target. This keeps Telegram group candidates visible and prevents contradictory private/group scope edits. * chore: regenerate sqlc swagger and sdk after rebase onto main * fix(inbound): use bot owner token for agent gateway callbacks The inbound channel processor issued a JWT for the chatting user's identity. When the agent called back into container/MCP endpoints (e.g. /bots/{id}/tools, /bots/{id}/mcp-stdio), AuthorizeBotAccess rejected non-owner users with HTTP 403 "bot access denied". Resolve the bot owner via PolicyService and issue the downstream token under the owner's identity, consistent with schedule, heartbeat, and email gateways. The chatting user's identity is still tracked via SourceChannelIdentityID and identity headers.	2026-03-16 23:05:23 +08:00
Acbox	020c74c2bc	chore(fix): lint	2026-03-15 00:58:38 +08:00
Acbox	ac8a935545	refactor: remove bot type	2026-03-15 00:42:09 +08:00
BBQ	839e63acda	feat(access): add guest chat ACL (#235 )	2026-03-14 17:15:41 +08:00
Fodesu	b46e494d3a	feat(tts): introduce `TTS` system (#195 )	2026-03-13 02:49:52 +08:00
Acbox	9b771acaa8	fix: slash commands in group chats trigger all bots instead of targeted one - In group chats, only process slash commands when the message is directed at this bot (via @mention or reply-to-bot), preventing all bots from responding to the same command. - Use raw_text metadata (before quote/forward context prepending) for command detection so quoted content like "/fs" doesn't accidentally match a command. - Fix isTelegramBotMentioned text_mention entity check to verify the mentioned bot matches the current bot, not just any bot.	2026-03-12 20:08:55 +08:00
Acbox	e9059fddda	refactor: defer user message persistence to storeRound for atomic writes User messages from channel inbound (Telegram, Discord, Feishu, etc.) were previously persisted before the agent runs. Now they are written together with assistant/tool messages at the end of a conversation turn (or on abort), matching the behavior of WebSocket and sync chat paths.	2026-03-12 18:59:33 +08:00
Acbox	bb26d18757	fix(command): add missing command handler wiring and lint fixes Wire SetCommandHandler into ChannelInboundProcessor so slash commands are intercepted before reaching the LLM. Also apply lint fixes across command package (strconv.Itoa, comment formatting, unused code removal) and remove obsolete tool-call-browser.vue component.	2026-03-11 19:05:55 +08:00
Acbox	1da251885d	feat(agent): add extensible tag interception system and inline reactions Refactor the attachment tag extraction into a generic TagResolver/StreamTagExtractor system that supports multiple custom tags. Implement <reactions> tag allowing the agent to embed emoji reactions directly in text responses, dispatched as side-effects through the channel reactor interface. - Add TagResolver interface and StreamTagExtractor streaming state machine - Refactor AttachmentsStreamExtractor as backward-compatible wrapper - Add reactionsResolver and ReactionDeltaAction stream event - Wire reaction dispatch in Go channel inbound processor - Fix .gitignore to scope compiled binary patterns to repo root	2026-03-11 17:43:30 +08:00
Acbox	2debfb496c	fix(channel): resolve attachment filename and prevent duplicate sends - Derive attachment name from path basename when not explicitly set in parseAttachmentDelta, fixing the "file.bin" fallback on Telegram. - Infer correct AttachmentType (image/audio/video) from MIME in applyAssetToAttachment instead of keeping the generic "file" type. - Remove outboundAttachments re-attachment to final messages since attachments are already delivered during streaming via StreamEventAttachment, preventing duplicate file sends on platforms.	2026-03-11 17:00:07 +08:00
Acbox	30653fbdbf	fix(agent): reject send tool when targeting the same conversation Pass replyTarget through the full pipeline (ChatRequest → gateway identity → agent headers → MCP session) so the send tool can detect when the destination matches the current conversation and return an error guiding the agent to reply directly instead.	2026-03-11 16:59:42 +08:00
Menci	36d50738b5	fix(channel): consistent markdown rendering across all Telegram paths (#210 ) - Extract ContainsMarkdown to shared channel package - Auto-detect markdown in normalizeOutboundMessage and MCP send tool - Apply markdown-to-HTML conversion during streaming deltas, not just on the final message - Remove resolveTelegramParseMode which incorrectly returned Telegram's native "Markdown" mode instead of converting to HTML - Fix all 14 Telegram send/edit paths for consistent parse mode handling - Reset parseMode for plain-text error messages to avoid HTML corruption	2026-03-09 13:06:44 +08:00
Ringo.Typowriter	e6a6dbe3f6	feat(channel): add QQ channel support and image message pipeline (#199 ) * feat(channel): add qq adapter and outbound delivery * feat(channel): ingest inbound qq messages * feat(web): expose qq channel in management ui * feat(channel): support qq attachment ingestion * fix(mcp): fail read raw immediately for missing files * fix(agent): parse inline image data into native image parts * test(agent): align read_media tool tests with SDK options * fix(channel): harden qq image delivery and reconnect loop Avoid data URLs for qq channel images, reset reconnect backoff after healthy sessions, and fall back gracefully for malformed public image URLs. * fix(channel): restore qq media delivery and target resolution * fix(qq,mcp,agent): fix message/qq regressions and pass go lint * fix(qq,agent): validate inline base64 and sync heartbeat seq * fix(qq): validate remote voice mime for upload checks * fix(qq): fall back intents and restore adapter wiring * fix(qq): prevent final text leakage and dedupe persisted inbound query	2026-03-07 17:12:06 +08:00
0x24a	ac405c49e2	fix(channel): return success when bind code is re-checked by same identity (#201 ) * fix(channel): return success when bind code is re-checked by same identity * style: run gofmt	2026-03-07 15:07:32 +08:00
BBQ	3feb03aca7	ci: add go lint and race test workflow (#187 )	2026-03-05 11:25:33 +08:00
Ringo.Typowriter	0a2a17ecc8	feat(agent): add readMedia tool for model to view the image (#165 ) * feat(agent): add readMedia tool for loading local images into model context * feat(channel/inbound): include container attachment refs in inbound query * fix(agent): preserve ImagePart literal typing in buildNativeImageParts * chore: rename tool --------- Co-authored-by: 晨苒 <16112591+chen-ran@users.noreply.github.com>	2026-03-04 11:24:01 +08:00
Acbox	f0517a3a1f	fix: inbox indirectly push notidy item into context	2026-03-03 20:38:36 +08:00
Menci	b1925bf2be	feat(telegram): use sendMessageDraft for streaming in private chats (#174 ) * feat(telegram): use sendMessageDraft for streaming in private chats Use Telegram Bot API 9.3's sendMessageDraft to stream partial messages with smooth animation in private chats, replacing the sendMessage + editMessageText approach. Group/channel chats keep the existing edit-based streaming. - Add sendTelegramDraft() for the sendMessageDraft API - Detect private chats via conversation_type metadata in OpenStream - Use 300ms throttle for drafts (vs 5s for edits) - Send permanent messages at tool call boundaries and on final event - Reset buffer atomically in StreamEventFinal to prevent duplicate messages when multiple final events fire (one per assistant output) * test(telegram): improve draft mode test assertions Add sendTextForTest hook for sendTelegramTextReturnMessage to enable direct assertion of send calls. Clean up residual unused variables and replace indirect assertions with explicit mock-based verification.	2026-03-03 16:01:18 +08:00
Acbox Liu	fe10abf3fc	refactor: inbox (#137 ) * refactor: inbox * fix: migrations * fix: migrations	2026-02-26 20:16:02 +08:00
Ran	5e12b5a53f	fix: ensure unifying on hardcoded /data mount path	2026-02-24 03:35:27 +08:00
Acbox	ac929f9f44	feat: add message id in user header	2026-02-23 00:06:15 +08:00
Acbox Liu	c591af14b0	feat: bot inbox (#77 ) * feat: bot inbox * feat: unified header * fix: missing tool_call usage * feat: add group name in header	2026-02-22 01:27:24 +08:00
Acbox	7b12fb0b0c	fix: send file out of `/data/media` failed	2026-02-21 00:04:38 +08:00
Acbox	1a78ba3f53	feat: add platform metadata in contacts	2026-02-20 22:19:15 +08:00
Acbox	6b7c3db952	refactor: process user header in go side	2026-02-20 21:40:13 +08:00
Ringo.Typowriter	53f080503c	fix(telegram): aggregate media_group inbound and preserve ordering with (#69 ) attachment fallback query	2026-02-19 17:14:12 +08:00
BBQ	bc374fe8cd	refactor: content-addressed assets, cross-channel multimodal, infra simplification (#63 ) * refactor(attachment): multimodal attachment refactor with snapshot schema and storage layer - Add snapshot schema migration (0008) and update init/versions/snapshots - Add internal/attachment and internal/channel normalize for unified attachment handling - Move containerfs provider from internal/media to internal/storage - Update agent types, channel adapters (Telegram/Feishu), inbound and handlers - Add containerd snapshot lineage and local_channel tests - Regenerate sqlc, swagger and SDK * refactor(media): content-addressed asset system with unified naming - Replace asset_id foreign key with content_hash as sole identifier for bot_history_message_assets (pure soft-link model) - Remove mime, size_bytes, storage_key from DB; derive at read time via media.Resolve from actual storage - Merge migrations 0008/0009 into single 0008; keep 0001 as canonical schema - Add Docker initdb script for deterministic migration execution order - Fix cross-channel real-time image display (Telegram → WebUI SSE) - Fix message disappearing on refresh (null assets fallback) - Fix file icon instead of image preview (mime derivation from storage) - Unify AssetID → ContentHash naming across Go, Agent, and Frontend - Change storage key prefix from 4-char to 2-char for directory sharding - Add server-entrypoint.sh for Docker deployment migration handling * refactor(infra): embedded migrations, Docker simplification, and config consolidation - Embed SQL migrations into Go binary, removing shell-based migration scripts - Consolidate config files into conf/ directory (app.example.toml, app.docker.toml, app.dev.toml) - Simplify Docker setup: remove initdb.d scripts, streamline nginx config and entrypoint - Remove legacy CLI, feishu-echo commands, and obsolete incremental migration files - Update install script and docs to require sudo for one-click install - Add mise tasks for dev environment orchestration * chore: recover migrations --------- Co-authored-by: Acbox <acbox0328@gmail.com>	2026-02-19 00:20:27 +08:00
BBQ	df7876a30c	feat: add media asset system, channel lifecycle refactor, and chat attachments (#54 )	2026-02-17 19:06:46 +08:00
Acbox	38753ef054	refactor: channel tools	2026-02-15 17:48:20 +08:00
BBQ	faaadf14c5	feat(channel): pass conversation type through to agent gateway and persist in route Propagate conversation type (direct/group/thread) from channel adapters all the way to the agent prompt. Store conversation_type on bot_channel_routes so the bot knows whether a message originates from a p2p chat, group, or thread. Schema changes are folded into the 0001 init migration (destructive update).	2026-02-13 06:22:24 +08:00
BBQ	85251a2905	refactor(core): codebase quality cleanup - Remove user-level model settings (chat_model_id, memory_model_id, embedding_model_id, max_context_load_time, language) from users table - Merge migration 0002 into 0001, remove compatibility migrations - Delete dead conversation/resolver.go (1177 lines, only flow/resolver.go used) - Remove type aliases (Chat=Conversation, types_alias.go) - Fix SQL: remove AND false stub, fix UpdateChatTitle model_id, reset model IDs in DeleteSettings, add preauth expiry filter, add ListMessages limit, remove 10 dead queries - Extract shared handler helpers (RequireChannelIdentityID, AuthorizeBotAccess) - Rename internal/router to internal/channel/inbound - Fix identity confusion: remove UserID->ChannelIdentityID fallbacks - Fix all _ = var patterns with proper error logging - Fix error propagation: storeMessages, rescheduleJob, botContainerID - Fix naming: ModelId->ModelID, active->is_active, Duration semantic fix - Remove dead code: mcpService, ReplyTarget, callMCPServer, sshShellQuote, buildSessionMetadata, ChatRequest.Language, TriggerPayload.ChatID - Fix code quality: errors.Is(), remove goto, CreateHuman deprecated - Remove Enable model endpoint and user-level settings CLI commands - Regenerate sqlc, swagger, SDK	2026-02-12 23:50:48 +08:00

32 Commits