feat: transcription support (#394)

* feat: expand speech provider support with new client types and configuration schema

* feat: add icon support for speech providers and update related configurations

* feat: add SVG support for Deepgram and Elevenlabs with Vue components

* feat: except *-speech client type in llm provider

* feat: enhance speech provider functionality with advanced settings and model import capabilities

* chore: remove go.mod replace

* feat: enhance speech provider functionality with advanced settings and model import capabilities

* chore: update go module dependencies

* feat: Ear and Mouth

* fix: separate ear/mouth page

* fix: separate audio domain and restore transcription templates

Move speech and transcription internals into the audio domain, restore template-driven transcription providers, and regenerate Swagger/SDK so the frontend can stop hand-calling /transcription-* APIs.

---------

Co-authored-by: aki <arisu@ieee.org>
This commit is contained in:
Acbox
2026-04-22 00:12:01 +08:00
committed by GitHub
parent fd8f1ec078
commit 925fdee478
70 changed files with 6612 additions and 1692 deletions
+2
View File
@@ -12,6 +12,7 @@ type Settings struct {
SearchProviderID string `json:"search_provider_id"`
MemoryProviderID string `json:"memory_provider_id"`
TtsModelID string `json:"tts_model_id"`
TranscriptionModelID string `json:"transcription_model_id"`
BrowserContextID string `json:"browser_context_id"`
Language string `json:"language"`
AclDefaultEffect string `json:"acl_default_effect"`
@@ -36,6 +37,7 @@ type UpsertRequest struct {
SearchProviderID string `json:"search_provider_id,omitempty"`
MemoryProviderID string `json:"memory_provider_id,omitempty"`
TtsModelID string `json:"tts_model_id,omitempty"`
TranscriptionModelID string `json:"transcription_model_id,omitempty"`
BrowserContextID string `json:"browser_context_id,omitempty"`
Language string `json:"language,omitempty"`
AclDefaultEffect string `json:"acl_default_effect,omitempty"`