PNArchives/Memoh

Fork 0

mirror of https://github.com/memohai/Memoh.git synced 2026-04-27 07:16:19 +09:00

Files

T

Acbox 661d742750 feat: full api server

2026-01-10 21:55:39 +08:00

8.4 KiB

Raw Blame History

Agent API 文档

概述

Agent API 提供了一个智能对话代理接口，支持流式响应、记忆管理和工具调用。

端点

POST `/agent/stream` 🔒 需要认证

与 AI Agent 进行流式对话。

权限要求

需要有效的 Bearer token
自动使用当前登录用户的身份

请求

Headers:

Authorization: Bearer <your_jwt_token>
Content-Type: application/json

Body:

{
  "message": "你好，请介绍一下你自己",
  "maxContextLoadTime": 60,
  "language": "Chinese"
}

参数说明:

字段	类型	必需	默认值	说明
message	string	是	-	用户消息内容
maxContextLoadTime	number	否	从设置读取或60	加载上下文的时间范围（分钟，1-1440）
language	string	否	从设置读取或"Same as user input"	Agent 回复的首选语言

注意: maxContextLoadTime 和 language 如果在请求中未指定，将从用户设置（Settings）中读取。如果设置中也没有，则使用默认值。

响应

返回 Server-Sent Events (SSE) 流式响应。

Content-Type: text/event-stream

事件格式:

每个事件以 data: 开头，后跟 JSON 格式的数据：

data: {"type":"text-delta","text":"你"}

data: {"type":"text-delta","text":"好"}

data: {"type":"tool-call","toolName":"search_memory","args":{...}}

data: [DONE]

事件类型:

text-delta - 文本增量

{
  "type": "text-delta",
  "text": "文本片段"
}

tool-call - 工具调用

{
  "type": "tool-call",
  "toolName": "search_memory",
  "args": {
    "query": "搜索内容"
  }
}

[DONE] - 流结束标记
error - 错误事件

{
  "type": "error",
  "error": "错误消息"
}

错误响应

400 Bad Request - 模型配置未找到

{
  "success": false,
  "error": "Model configuration not found. Please configure your models in settings."
}

401 Unauthorized - 未认证

{
  "success": false,
  "error": "No bearer token provided"
}

500 Internal Server Error - 服务器错误

{
  "success": false,
  "error": "Failed to process request"
}

使用示例

JavaScript/TypeScript (EventSource)

async function streamAgentChat(message: string, token: string) {
  const response = await fetch('http://localhost:7002/agent/stream', {
    method: 'POST',
    headers: {
      'Authorization': `Bearer ${token}`,
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      message,
      maxContextLoadTime: 60,
      language: 'Chinese',
    }),
  })

  if (!response.ok) {
    const error = await response.json()
    throw new Error(error.error)
  }

  const reader = response.body?.getReader()
  const decoder = new TextDecoder()

  if (!reader) throw new Error('No reader available')

  while (true) {
    const { done, value } = await reader.read()
    if (done) break

    const chunk = decoder.decode(value)
    const lines = chunk.split('\n')

    for (const line of lines) {
      if (line.startsWith('data: ')) {
        const data = line.slice(6)
        
        if (data === '[DONE]') {
          console.log('\n✅ Stream completed')
          return
        }

        try {
          const event = JSON.parse(data)
          
          if (event.type === 'text-delta' && event.text) {
            process.stdout.write(event.text)
          } else if (event.type === 'tool-call') {
            console.log(`\n[Tool: ${event.toolName}]`)
          } else if (event.type === 'error') {
            console.error('\n❌ Error:', event.error)
          }
        } catch (e) {
          // Skip invalid JSON
        }
      }
    }
  }
}

// 使用示例
const token = 'your_jwt_token_here'
await streamAgentChat('你好，请介绍一下你自己', token)

curl 示例

# 1. 登录获取 token
TOKEN=$(curl -X POST http://localhost:7002/auth/login \
  -H "Content-Type: application/json" \
  -d '{"username":"admin","password":"your_password"}' \
  | jq -r '.data.token')

# 2. 流式对话
curl -N -X POST http://localhost:7002/agent/stream \
  -H "Authorization: Bearer $TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "message": "你好，请介绍一下你自己",
    "maxContextLoadTime": 60,
    "language": "Chinese"
  }'

Python 示例

import requests
import json

def stream_agent_chat(message: str, token: str):
    url = 'http://localhost:7002/agent/stream'
    headers = {
        'Authorization': f'Bearer {token}',
        'Content-Type': 'application/json',
    }
    data = {
        'message': message,
        'maxContextLoadTime': 60,
        'language': 'Chinese',
    }

    with requests.post(url, headers=headers, json=data, stream=True) as response:
        response.raise_for_status()
        
        for line in response.iter_lines():
            if line:
                line_str = line.decode('utf-8')
                if line_str.startswith('data: '):
                    data = line_str[6:]
                    
                    if data == '[DONE]':
                        print('\n✅ Stream completed')
                        break
                    
                    try:
                        event = json.loads(data)
                        
                        if event.get('type') == 'text-delta' and event.get('text'):
                            print(event['text'], end='', flush=True)
                        elif event.get('type') == 'tool-call':
                            print(f"\n[Tool: {event['toolName']}]")
                        elif event.get('type') == 'error':
                            print(f"\n❌ Error: {event['error']}")
                    except json.JSONDecodeError:
                        pass

# 使用示例
token = 'your_jwt_token_here'
stream_agent_chat('你好，请介绍一下你自己', token)

工作原理

1. 认证验证

验证 JWT token
提取用户信息

2. 模型配置获取

从用户设置中获取：

Chat Model（对话模型）
Embedding Model（嵌入模型）
Summary Model（摘要模型）

3. Agent 创建

创建 Memory 实例（用于记忆管理）
创建 Agent 实例，配置回调函数：
- onReadMemory: 从数据库加载历史对话
- onSearchMemory: 搜索相关记忆
- onFinish: 保存对话到记忆

4. 流式响应

Agent 处理用户消息
实时流式返回生成的文本
报告工具调用事件
完成后自动保存到记忆

5. 记忆管理

自动加载近期对话历史作为上下文
使用向量搜索查找相关记忆
对话结束后自动保存

特性

✅ 流式响应

实时返回生成的文本
无需等待完整响应
更好的用户体验

✅ 记忆管理

自动保存对话历史
智能加载相关上下文
向量搜索相关记忆

✅ 工具调用

支持搜索记忆工具
可扩展其他工具
实时报告工具使用

✅ 个性化配置

每个用户使用自己的模型配置
可自定义语言偏好
可配置上下文加载时间

前置条件

1. 配置模型和 Agent 设置

使用 Settings API 配置默认模型和 Agent 参数：

curl -X PUT http://localhost:7002/settings \
  -H "Authorization: Bearer $TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "defaultChatModel": "chat-model-uuid",
    "defaultEmbeddingModel": "embedding-model-uuid",
    "defaultSummaryModel": "summary-model-uuid",
    "maxContextLoadTime": 60,
    "language": "Chinese"
  }'

详见 Settings API 文档

2. 创建模型配置

使用 Model API 创建模型：

curl -X POST http://localhost:7002/model \
  -H "Authorization: Bearer $TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "modelId": "gpt-4",
    "baseUrl": "https://api.openai.com/v1",
    "apiKey": "your-api-key",
    "clientType": "openai",
    "name": "GPT-4"
  }'

限制和注意事项

模型配置必需: 用户必须先配置好模型才能使用 Agent
流式连接: 需要支持 Server-Sent Events 的客户端
超时处理: 长时间无响应可能导致连接超时
并发限制: 建议每个用户同时只维护一个对话流

8.4 KiB Raw Blame History