Reduce your AI costs

![manifest-gh](https://github.com/user-attachments/assets/7dd74fc2-f7d6-4558-a95a-014ed754a125)

## What is Manifest? Manifest is a smart model router for agents and AI applications that redirects each query to the right model, saving up to 70% in AI costs. - 🔀 Routing based on complexity, specificity and custom HTTP headers - 🎛️ Mix your providers: API keys, Subscriptions, Local models, Custom providers - 📊 Track every single dollar, setup notifications and limits - 🚑 Fallback on different models when queries fails ## Quick start ### Cloud version Go to [app.manifest.build](https://app.manifest.build) and follow the guide. ### Self-hosted Manifest ships as a [Docker image](https://hub.docker.com/r/manifestdotbuild/manifest). One command: ```bash bash <(curl -sSL https://raw.githubusercontent.com/mnfst/manifest/main/docker/install.sh) ``` Open [http://localhost:2099](http://localhost:2099) and sign up — the first account you create becomes the admin. Full self-hosting guide: [docker/DOCKER_README.md](docker/DOCKER_README.md). > The legacy `manifest` npm package is deprecated and no longer published. ## Providers Manifest connects to **300+ models across 17 providers** plus any custom provider (OpenAI/Anthropic compatible). Bring your own API key, reuse a paid subscription you already have, or run models locally — all routed through the same `/auto` endpoint. | Provider | API key | Subscription | Featured models | | --- | :---: | :--- | :--- | | [**OpenAI**](https://platform.openai.com/) | ✅ | ✅ ChatGPT Plus / Pro / Team | gpt-5, gpt-5-mini, o4, o4-mini | | [**Anthropic**](https://www.anthropic.com/) | ✅ | ✅ Claude Max / Pro | claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5 | | [**Google**](https://ai.google.dev/) | ✅ | — | gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash | | [**xAI**](https://x.ai/) | ✅ | — | grok-4, grok-3, grok-code-fast | | [**DeepSeek**](https://www.deepseek.com/) | ✅ | — | deepseek-v3.2, deepseek-r1 | | [**Mistral**](https://mistral.ai/) | ✅ | — | mistral-large, codestral, magistral | | [**Qwen** (Alibaba Cloud)](https://www.alibabacloud.com/en/solutions/generative-ai/qwen) | ✅ | — | qwen3-max, qwen3-coder, qwq-32b | | [**Moonshot** (Kimi)](https://kimi.ai/) | ✅ | ✅ Kimi Coding Plan | kimi-k2, kimi-for-coding, moonshot-v1-128k | | [**MiniMax**](https://www.minimax.io/) | ✅ | ✅ MiniMax Coding Plan | minimax-m2, abab7-chat-preview | | [**Z.ai** (Zhipu)](https://z.ai/) | ✅ | ✅ GLM Coding Plan | glm-4.6, glm-4.5-air | | [**BytePlus**](https://www.byteplus.com/en/activity/codingplan) | — | ✅ ModelArk Coding Plan | ark-code-latest, bytedance-seed-code, deepseek-v4-flash | | [**OpenCode**](https://opencode.ai/) | — | ✅ Go subscription | Routes via OpenCode Go catalog | | [**Ollama**](https://ollama.com/) | 🖥️ Local | ✅ Ollama Cloud | Any GGUF model, port `11434` | | [**LM Studio**](https://lmstudio.ai/) | 🖥️ Local | — | Any GGUF model, port `1234` | | [**llama.cpp**](https://github.com/ggml-org/llama.cpp) | 🖥️ Local | — | Any GGUF model, port `8080` | | [**OpenRouter**](https://openrouter.ai/) | ✅ | — | Routes to 300+ models across labs | | [**GitHub Copilot**](https://github.com/features/copilot) | — | ✅ Copilot subscription | OAuth, no API key needed | | **Custom** (OpenAI/Anthropic-compatible) | ✅ | — | Any `/v1/chat/completions` or `/v1/messages` endpoint | ## Quick links - [Docs](https://manifest.build/docs) - [Discord](https://discord.com/invite/FepAked3W7) - [Discussions](https://github.com/mnfst/manifest/discussions) - [Contributing](CONTRIBUTING.md) - [GitHub](https://github.com/mnfst/manifest) ## License [MIT](LICENSE)