diff --git a/.env.example b/.env.example index 4b95dc8fc..1fd9e9de2 100644 --- a/.env.example +++ b/.env.example @@ -55,6 +55,11 @@ LLM_SERVER_PROVIDER= LLM_SERVER_CONFIG_PATH= LLM_SERVER_LEGACY_REASONING= LLM_SERVER_PRESERVE_REASONING= +## Atlas Cloud example (OpenAI-compatible — https://www.atlascloud.ai/?utm_source=github&utm_medium=link&utm_campaign=pentagi) +# LLM_SERVER_URL=https://api.atlascloud.ai/v1 +# LLM_SERVER_KEY=your_atlascloud_api_key +# LLM_SERVER_MODEL=deepseek-ai/deepseek-v4-pro +# LLM_SERVER_PROVIDER=openai ## Ollama LLM provider (Local Server or Cloud) # Local: http://ollama-server:11434, Cloud: https://ollama.com diff --git a/README.md b/README.md index fc5ca5872..4fd703706 100644 --- a/README.md +++ b/README.md @@ -14,6 +14,14 @@ +

+ + Atlas Cloud + +

+ +> 🎁 **[Atlas Cloud](https://www.atlascloud.ai/?utm_source=github&utm_medium=link&utm_campaign=pentagi)** is a full-modal, OpenAI-compatible AI inference platform (DeepSeek, Qwen, GLM, Kimi, MiniMax, …). PentAGI can run on it via the custom LLM provider — see [Atlas Cloud (OpenAI-compatible)](#atlas-cloud-openai-compatible). + ## Table of Contents - [Overview](#overview) @@ -1392,6 +1400,36 @@ The `LLM_SERVER_PRESERVE_REASONING` setting controls whether reasoning content i This setting is required by some LLM providers (e.g., Moonshot) that return errors like "thinking is enabled but reasoning_content is missing in assistant tool call message" when reasoning content is not included in multi-turn conversations. Enable this setting if your provider requires reasoning content to be preserved. +#### Atlas Cloud (OpenAI-compatible) + +[Atlas Cloud](https://www.atlascloud.ai/?utm_source=github&utm_medium=link&utm_campaign=pentagi) is OpenAI-compatible, so it plugs into PentAGI through the `LLM_SERVER_*` custom-provider variables — a single endpoint for DeepSeek, Qwen, GLM, Kimi, MiniMax, and more. + +```bash +LLM_SERVER_URL=https://api.atlascloud.ai/v1 +LLM_SERVER_KEY=your_atlascloud_api_key +LLM_SERVER_MODEL=deepseek-ai/deepseek-v4-pro +LLM_SERVER_PROVIDER=openai +``` + +`deepseek-ai/deepseek-v4-pro` is a reasoning model and works with PentAGI's default structured reasoning format (`LLM_SERVER_LEGACY_REASONING=false`). Any other Atlas chat model id can be used the same way. + +
+All Atlas Cloud chat models (59) + +- **Anthropic (Claude):** `anthropic/claude-haiku-4.5-20251001`, `anthropic/claude-opus-4.8`, `anthropic/claude-sonnet-4.6` +- **OpenAI (GPT):** `openai/gpt-5.4`, `openai/gpt-5.5` +- **Google (Gemini):** `google/gemini-3.1-flash-lite`, `google/gemini-3.1-pro-preview`, `google/gemini-3.5-flash` +- **Qwen:** `qwen/qwen2.5-7b-instruct`, `Qwen/Qwen3-235B-A22B-Instruct-2507`, `qwen/qwen3-235b-a22b-thinking-2507`, `qwen/qwen3-30b-a3b`, `Qwen/Qwen3-30B-A3B-Instruct-2507`, `qwen/qwen3-30b-a3b-thinking-2507`, `qwen/qwen3-32b`, `qwen/qwen3-8b`, `Qwen/Qwen3-Coder`, `qwen/qwen3-coder-next`, `qwen/qwen3-max-2026-01-23`, `Qwen/Qwen3-Next-80B-A3B-Instruct`, `Qwen/Qwen3-Next-80B-A3B-Thinking`, `Qwen/Qwen3-VL-235B-A22B-Instruct`, `qwen/qwen3-vl-235b-a22b-thinking`, `qwen/qwen3-vl-30b-a3b-instruct`, `qwen/qwen3-vl-30b-a3b-thinking`, `qwen/qwen3-vl-8b-instruct`, `qwen/qwen3.5-122b-a10b`, `qwen/qwen3.5-27b`, `qwen/qwen3.5-35b-a3b`, `qwen/qwen3.5-397b-a17b`, `qwen/qwen3.6-35b-a3b`, `qwen/qwen3.6-plus` +- **DeepSeek:** `deepseek-ai/deepseek-ocr`, `deepseek-ai/deepseek-r1-0528`, `deepseek-ai/DeepSeek-V3-0324`, `deepseek-ai/DeepSeek-V3.1`, `deepseek-ai/DeepSeek-V3.1-Terminus`, `deepseek-ai/deepseek-v3.2`, `deepseek-ai/DeepSeek-V3.2-Exp`, `deepseek-ai/deepseek-v4-flash`, `deepseek-ai/deepseek-v4-pro` +- **Moonshot (Kimi):** `moonshotai/Kimi-K2-Instruct`, `moonshotai/Kimi-K2-Instruct-0905`, `moonshotai/Kimi-K2-Thinking`, `moonshotai/kimi-k2.5`, `moonshotai/kimi-k2.6` +- **Zhipu (GLM):** `zai-org/GLM-4.6`, `zai-org/glm-4.7`, `zai-org/glm-5`, `zai-org/glm-5-turbo`, `zai-org/glm-5.1`, `zai-org/glm-5v-turbo` +- **MiniMax:** `MiniMaxAI/MiniMax-M2`, `minimaxai/minimax-m2.1`, `minimaxai/minimax-m2.5`, `minimaxai/minimax-m2.7` +- **xAI:** `xai/grok-4.3` +- **Kwaipilot:** `kwaipilot/kat-coder-pro-v2` +- **Other:** `owl` + +
+ ### Ollama Provider Configuration PentAGI supports Ollama for both local LLM inference (zero-cost, enhanced privacy) and Ollama Cloud (managed service with free tier). diff --git a/assets/atlas-cloud-logo.png b/assets/atlas-cloud-logo.png new file mode 100644 index 000000000..780a47698 Binary files /dev/null and b/assets/atlas-cloud-logo.png differ