# Agnes AI Model Catalog This catalog summarizes the public Agnes AI model families and the recommended API entry points. ## Catalog Status | Field | Value | | --- | --- | | Catalog version | `2026.06.22` | | Last updated | `2026-06-22 00:00 Asia/Singapore` | | Scope | Public model, endpoint, quota, and troubleshooting reference | | Change notice | Rate limits, subscription quotas, model parameters, and availability may change. Treat the values in this catalog as current reference values, not permanent contractual limits. | ## API Base URLs | Use Case | Base URL | | --- | --- | | OpenAI-compatible APIs | `https://apihub.agnes-ai.com/v1` | | Image API root | `https://apihub.agnes-ai.com` | Authentication: ```text Authorization: Bearer YOUR_API_KEY ``` ## Text and Agent Models | Model | Endpoint | Capabilities | Suggested Use Cases | | --- | --- | --- | --- | | `agnes-1.5-flash` | `POST /v1/chat/completions` | Fast chat completions, text generation, image URL input, low-latency inference | Realtime assistants, content generation, summarization, simple multimodal tasks | | `agnes-2.0-flash` | `POST /v1/chat/completions` | Chat, streaming, tool calling, coding, reasoning, image understanding, agent workflows | Developer agents, customer support, coding tasks, workflow automation, multimodal assistants | ### Text Model Notes | Model | Current Reference Specs | Notes | | --- | --- | --- | | `agnes-1.5-flash` | `256K` context, `64K` max output reference limit | Recommended for high-throughput chat and low-latency content workflows. | | `agnes-2.0-flash` | `256K` context, `64K` max output reference limit | The temporary `1M` context window was rolled back in June 2026 for stability. Use this model for coding, reasoning, agents, vision input, streaming, and tool calling. | ## Image Models | Model | Endpoint | Capabilities | Suggested Use Cases | | --- | --- | --- | --- | | `agnes-image-2.0-flash` | `POST /v1/images/generations` | Text-to-image, image-to-image, URL output, Base64 output | Creative images, product visuals, posters, image transformation | | `agnes-image-2.1-flash` | `POST /v1/images/generations` | High-density image generation, image editing, URL or Data URI input, flexible image sizes | Detailed compositions, marketing assets, character visuals, social media content | ## Video Models | Model | Endpoint | Capabilities | Suggested Use Cases | | --- | --- | --- | --- | | `agnes-video-v2.0` | `POST /v1/videos` | Text-to-video, image-to-video, multi-image video, keyframe animation, async generation | Storytelling, marketing videos, product demos, social video, app motion assets | Video result query: ```text GET https://apihub.agnes-ai.com/agnesapi?video_id= ``` Current guidance: - Use the returned `video_id` to poll video results. - Do not use `task_id` for current result polling unless a legacy integration specifically documents that workflow. - If a video task stays queued for more than a few minutes, verify that the polling request is using `video_id`. Legacy task query format: ```text GET https://apihub.agnes-ai.com/v1/videos/{task_id} ``` ## Current Rate Limits These values are current public reference values as of `2026-06-22`. Use the official platform console as the final source of truth for production traffic planning. ### Text Model RPM | Model Type | User Type | Public Request RPM | Actual Executable RPM | | --- | --- | ---: | ---: | | Text models | Free / default | 30 | 20 | | Text models | Enterprise | 60 | 40 | | Text models | Token Plan | 1,000 | 1,000 | ### Image Model RPM | Model Type | User Type | Resolution | Public Request RPM | Actual Executable RPM | | --- | --- | --- | ---: | ---: | | Image models | Free / default | 1K | 30 | 20 | | Image models | Free / default | 2K | 20 | 10 | | Image models | Free / default | 3K | 2 | 1 | | Image models | Free / default | 4K | 1 | 1 | | Image models | Enterprise | 1K | 60 | 40 | | Image models | Enterprise | 2K | 40 | 20 | | Image models | Enterprise | 3K | 2 | 1 | | Image models | Enterprise | 4K | 2 | 1 | | Image models | Token Plan | 1K | 120 | 100 | | Image models | Token Plan | 2K | 120 | 80 | | Image models | Token Plan | 3K | 2 | 1 | | Image models | Token Plan | 4K | 2 | 1 | ### Video Model RPM | Model Type | User Type | Public Request RPM | Actual Executable RPM | | --- | --- | ---: | ---: | | Video models | Free / default | 30 | 20 | | Video models | Enterprise | 60 | 40 | | Video models | Token Plan | 100 | 100 | ### RPM Field Definitions | Field | Meaning | | --- | --- | | Public Request RPM | Number of requests a user is allowed to initiate per minute. | | Actual Executable RPM | Number of requests that can actually be executed per minute after service-side scheduling and capacity constraints. | ## Current Subscription Quotas These quota values are current public reference values as of `2026-06-22` and may be adjusted in later pricing or capacity updates. | Plan | Price | `agnes-2.0-flash` | `agnes-image-2.0/2.1-flash` | `agnes-video-v2.0` | | --- | ---: | --- | --- | --- | | Starter | `$4` | 1,500 requests per 5 hours; 15,000 requests per week | 4,000 images per day | 500 seconds per day | | Plus | `$10` | 7,500 requests per 5 hours; 75,000 requests per week | 4,000 images per day | 500 seconds per day | | Pro | `$50` | 30,000 requests per 5 hours; 300,000 requests per week | 4,000 images per day | 500 seconds per day | ## Compatibility Notes Agnes AI is designed for OpenAI-compatible integrations. For agent clients and coding tools, configure: ```text Base URL: https://apihub.agnes-ai.com/v1 API Key: YOUR_API_KEY Chat endpoint: /v1/chat/completions ``` Recommended model selection: | Workflow | Recommended Model | | --- | --- | | General chat and content generation | `agnes-1.5-flash` | | Coding, reasoning, tool calling, and agent workflows | `agnes-2.0-flash` | | Text-to-image and image editing | `agnes-image-2.1-flash` | | Fast image generation | `agnes-image-2.0-flash` | | Text-to-video and image-to-video | `agnes-video-v2.0` | ## Troubleshooting Reference | Status | Meaning | What to Check | | --- | --- | --- | | `400` | Invalid request | Required fields, parameter types, image URL accessibility, response format placement, model-specific parameter support. | | `401` | Authentication failed | API key value, `Authorization: Bearer ...` format, environment variable loading, account status. | | `404` | Endpoint or resource not found | Base URL, endpoint path, model name, and whether a generated resource ID exists. | | `429` | Rate limit exceeded | Current user plan, RPM limits, concurrent requests, retry and backoff behavior. | | `500` | Server error | Retry with backoff, reduce payload complexity, verify whether the issue reproduces with a minimal request. | | `502` | Upstream gateway error | Retry with backoff and check service status if available. | | `503` | Service busy or unavailable | Retry later, reduce concurrency, and avoid immediate repeated polling. | | `520` | Unknown upstream error | Retry with backoff and capture request metadata for support investigation. | ## Documentation Links | Model | Docs | | --- | --- | | `agnes-1.5-flash` | https://agnes-ai.com/doc/agnes-15-flash | | `agnes-2.0-flash` | https://agnes-ai.com/doc/agnes-20-flash | | `agnes-image-2.0-flash` | https://agnes-ai.com/doc/agnes-image-20-flash | | `agnes-image-2.1-flash` | https://agnes-ai.com/doc/agnes-image-21-flash | | `agnes-video-v2.0` | https://agnes-ai.com/doc/agnes-video-v20 |