CowAgent supports a wide range of mainstream large language models. Model interfaces live under the project’s models/ directory. Beyond text chat, several vendors also provide vision understanding, image generation, speech-to-text, text-to-speech, and embeddings — all of which can be invoked on demand in the Agent flow.
Capability Matrix
A snapshot of each vendor’s capabilities. “Text” refers to the main chat model; the remaining columns show which Agent capabilities the vendor can power.
| Vendor | Representative Models | Text | Vision | Image Gen | STT | TTS | Embedding |
|---|
| DeepSeek | deepseek-v4-flash / pro | ✅ | | | | | |
| MiniMax | MiniMax-M3 | ✅ | ✅ | ✅ | | ✅ | |
| Claude | claude-opus-4-8 | ✅ | ✅ | | | | |
| Gemini | gemini-3.5-flash | ✅ | ✅ | ✅ | | | |
| OpenAI | gpt-5.5, o-series | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| GLM | glm-5.1, glm-5v-turbo | ✅ | ✅ | | ✅ | | ✅ |
| Qwen | qwen3.7-plus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Doubao | doubao-seed-2.0 series | ✅ | ✅ | ✅ | | | ✅ |
| Kimi | kimi-k2.6 | ✅ | ✅ | | | | |
| ERNIE | ernie-5.1 | ✅ | ✅ | | | | |
| MiMo | mimo-v2.5-pro / v2.5 | ✅ | ✅ | | | ✅ | |
| LinkAI | 100+ models from multiple vendors | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Custom | Local models / third-party proxies | ✅ | | | | | |
Every capability in the Web console (Vision / Image / STT / TTS / Embedding / Web Search) can be configured independently with its own vendor and model — there is no forced binding between them.
Option 1 (recommended): Manage models and capabilities online via the Web console, with no need to edit the configuration file:
Option 2: Edit config.json manually and fill in the model name and API key for the selected vendor. Every model also supports OpenAI-compatible access — just set bot_type to openai and configure open_ai_api_base and open_ai_api_key.