--- name: novita-docs description: Complete reference documentation for Novita AI platform. Use when user asks about Novita AI products, APIs, pricing, integrations, GPU instances, model catalogs, sandbox environments, or design system. --- # Novita AI Platform Reference Complete documentation for the Novita AI platform - an AI & Agent Cloud for developers. ## When to Use This Skill Load this skill when the user asks about: - **Novita AI products**: Model APIs, GPU instances, serverless GPUs, agent sandbox - **Model information**: "What models does Novita support?", model pricing, capabilities - **API guidance**: How to use APIs, authentication, endpoints, parameters - **Pricing and billing**: Cost estimates, billing queries, payment methods - **Integrations**: LangChain, LlamaIndex, Cursor, and 30+ other tools - **Design system**: Colors, typography, buttons, navigation, icons, logo - **Getting started**: Quickstart guides, FAQs, setup instructions - **Troubleshooting**: Error codes, common issues, support ## Quick Reference | Resource | URL | |----------|-----| | **Website** | https://novita.ai | | **Model Catalog** | https://novita.ai/models (200+ models) | | **Documentation** | https://novita.ai/docs | | **Pricing** | https://novita.ai/pricing | | **Console** | https://novita.ai/console | | **API Base URL** | `https://api.novita.ai/openai` | | **Support** | support@novita.ai | | **Discord** | https://discord.gg/YyPRAzwp7P | --- ## ๐Ÿ” Quick: Query Available Models **Most common question**: "What models does Novita support?" ### Query Methods **1. Web Catalog** (human-friendly): - Browse 200+ models at https://novita.ai/models - Filter by type: LLM, image, video, audio, embeddings **2. API Endpoint** (automation): ```bash curl https://api.novita.ai/openai/v1/models \ -H "Authorization: Bearer " ``` Returns: Model ID, pricing per million tokens, context size, description ### Model Categories - **LLMs**: 100+ models (Llama, Qwen, DeepSeek, Mistral, etc.) - **Image Generation**: Flux, Stable Diffusion, SDXL - **Video**: Wan 2.6, CogVideoX - **Audio**: TTS, voice cloning - **Embeddings**: Text embedding models ### Quick Links | Task | Reference | |------|-----------| | List all models via API | [list-models.md](references/api-reference/llm/list-models.md) | | Get specific model info | [retrieve-model.md](references/api-reference/llm/retrieve-model.md) | | Recommended LLMs | [llm/recommended.md](references/llm/recommended.md) | | Image model APIs | [api-reference/image-apis/](references/api-reference/image-apis/) | | Model API guides | [model-apis/](references/model-apis/) | **Pro Tip**: Always call `/v1/models` API first for the latest model list and current pricing. --- ## How to Use This Documentation ### 1. Start Here - **New users**: See [getting-started/](references/getting-started/) - company overview, quickstart, FAQ - **Model queries**: Check the "Quick: Query Available Models" section above - **API help**: Jump to specific API reference sections below ### 2. Find Documentation by Category **Product Guides** (usage and features): - [getting-started/](references/getting-started/) - Overview, quickstart, product pages - [llm/](references/llm/) - LLM API guides (16 files) - [model-apis/](references/model-apis/) - Model API guides (11 files) - [gpu-instance/](references/gpu-instance/) - GPU instances (14 files) - [serverless-gpus/](references/serverless-gpus/) - Serverless GPUs (6 files) - [sandbox/](references/sandbox/) - Agent Sandbox (43 files) - [integrations/](references/integrations/) - 30+ integration guides **API Reference** (endpoints and parameters): - [api-reference/basic/](references/api-reference/basic/) - Auth, billing (6 files) - [api-reference/llm/](references/api-reference/llm/) - LLM endpoints (16 files) - [api-reference/image-apis/](references/api-reference/image-apis/) - Image/video APIs (48 files) - [api-reference/gpu-instance/](references/api-reference/gpu-instance/) - GPU APIs (2 files) **Support**: - [billing/](references/billing/) - Billing and payments (4 files) - [team/](references/team/) - Team management (1 file) **Design System**: - [design-system/](references/design-system/) - UI/UX specs (7 files) ### 3. File Naming Convention Files are organized by category: ``` references/ โ”œโ”€โ”€ getting-started/ # Product overviews and quickstart โ”œโ”€โ”€ llm/ # LLM feature guides โ”œโ”€โ”€ model-apis/ # Model API guides โ”œโ”€โ”€ gpu-instance/ # GPU instance guides โ”œโ”€โ”€ serverless-gpus/ # Serverless GPU guides โ”œโ”€โ”€ sandbox/ # Agent Sandbox docs (with subdirs) โ”œโ”€โ”€ integrations/ # Third-party tool integrations โ”œโ”€โ”€ api-reference/ # API endpoint documentation โ”‚ โ”œโ”€โ”€ basic/ # Auth, billing APIs โ”‚ โ”œโ”€โ”€ llm/ # LLM API endpoints โ”‚ โ”œโ”€โ”€ image-apis/ # Image/video API endpoints โ”‚ โ””โ”€โ”€ gpu-instance/ # GPU instance APIs โ”œโ”€โ”€ billing/ # Billing and payment โ”œโ”€โ”€ team/ # Team management โ””โ”€โ”€ design-system/ # UI/UX design specs ``` --- ## ๐Ÿ“š Documentation Index ### Core Product Documentation **Getting Started** (8 files) - [company-overview.md](references/getting-started/company-overview.md) - Company overview, products, testimonials - [gpus.md](references/getting-started/gpus.md) - GPU Cloud product overview - [sandbox.md](references/getting-started/sandbox.md) - Agent Sandbox product overview - [gpu-baremetal.md](references/getting-started/gpu-baremetal.md) - Bare metal GPU servers - [introduction.md](references/getting-started/introduction.md) - Platform introduction - [quickstart.md](references/getting-started/quickstart.md) - Quick start guide - [faq.md](references/getting-started/faq.md) - Frequently asked questions - [error-handling.md](references/getting-started/error-handling.md) - Error handling **LLM Guides** (17 files) Core: [api](references/llm/api.md) ยท [batch-api](references/llm/batch-api.md) ยท [function-calling](references/llm/function-calling.md) ยท [vision](references/llm/vision.md) ยท [reasoning](references/llm/reasoning.md) ยท [structured-outputs](references/llm/structured-outputs.md) ยท [prompt-cache](references/llm/prompt-cache.md) ยท [rate-limits](references/llm/rate-limits.md) ยท [monitoring](references/llm/monitoring.md) ยท [observability-metrics](references/llm/observability-metrics.md) ยท [dedicated-endpoint](references/llm/dedicated-endpoint.md) ยท [playgrounds](references/llm/playgrounds.md) ยท [recommended](references/llm/recommended.md) **Model APIs** (11 files) [overview](references/model-apis/overview.md) ยท [sdks](references/model-apis/sdks.md) ยท [dedicated-endpoints](references/model-apis/dedicated-endpoints.md) ยท [training-guidance](references/model-apis/training-guidance.md) ยท [custom-model](references/model-apis/custom-model.md) ยท [sampler](references/model-apis/sampler.md) ยท [vae](references/model-apis/vae.md) ยท [clip-skip](references/model-apis/clip-skip.md) ยท [rate-limits](references/model-apis/rate-limits.md) ยท [v2-to-v3-migration](references/model-apis/v2-to-v3-migration.md) ยท [configure-custom-s3-bucket](references/model-apis/configure-custom-s3-bucket.md) **GPU Instance** (14 files) [overview](references/gpu-instance/overview.md) ยท [overview-guide](references/gpu-instance/overview-guide.md) ยท [choose-a-gpu](references/gpu-instance/choose-a-gpu.md) ยท [pricing](references/gpu-instance/pricing.md) ยท [quickstart-*](references/gpu-instance/quickstart-preparations.md) (5 files) ยท [jupyterlab](references/gpu-instance/jupyterlab.md) ยท [save-image](references/gpu-instance/save-image.md) ยท [upgrade-instance](references/gpu-instance/upgrade-instance.md) ยท [edit-instance](references/gpu-instance/edit-instance.md) ยท [image-prewarm](references/gpu-instance/image-prewarm.md) **Serverless GPUs** (6 files) [overview](references/serverless-gpus/overview.md) ยท [pricing](references/serverless-gpus/pricing.md) ยท [quickstart-*](references/serverless-gpus/quickstart-preparations.md) (4 files) **Agent Sandbox** (43 files organized in subdirectories) Core: [overview](references/sandbox/overview.md) ยท [pricing](references/sandbox/pricing.md) ยท [sdk-and-cli](references/sandbox/sdk-and-cli.md) Quickstart: [your-first-sandbox](references/sandbox/quickstart/your-first-sandbox.md) ยท [introduction](references/sandbox/quickstart/introduction.md) ยท [installation](references/sandbox/quickstart/installation.md) ยท [quick-start](references/sandbox/quickstart/quick-start.md) ยท [frameworks](references/sandbox/quickstart/frameworks.md) ยท [advanced](references/sandbox/quickstart/advanced.md) CLI: [overview](references/sandbox/cli/overview.md) ยท [auth](references/sandbox/cli/auth.md) ยท [spawn](references/sandbox/cli/spawn.md) ยท [list](references/sandbox/cli/list.md) ยท [shutdown](references/sandbox/cli/shutdown.md) Commands: [overview](references/sandbox/commands/overview.md) ยท [background](references/sandbox/commands/background.md) ยท [streaming](references/sandbox/commands/streaming.md) Filesystem: [overview](references/sandbox/filesystem/overview.md) ยท [read-write](references/sandbox/filesystem/read-write.md) ยท [upload](references/sandbox/filesystem/upload.md) ยท [download](references/sandbox/filesystem/download.md) ยท [watch](references/sandbox/filesystem/watch.md) Lifecycle: [overview](references/sandbox/lifecycle/overview.md) ยท [clone](references/sandbox/lifecycle/clone.md) ยท [list](references/sandbox/lifecycle/list.md) ยท [idle-timeout](references/sandbox/lifecycle/idle-timeout.md) Template: [overview](references/sandbox/template/overview.md) ยท [customize-cpu-ram](references/sandbox/template/customize-cpu-ram.md) ยท [start-cmd](references/sandbox/template/start-cmd.md) ยท [ready-cmd](references/sandbox/template/ready-cmd.md) ยท [version-management](references/sandbox/template/version-management.md) More: [console](references/sandbox/console.md) ยท [connect](references/sandbox/connect.md) ยท [internet-access](references/sandbox/internet-access.md) ยท [environment-variables](references/sandbox/environment-variables.md) ยท [metadata](references/sandbox/metadata.md) ยท [metrics](references/sandbox/metrics.md) ยท [mount-cloudstorage](references/sandbox/mount-cloudstorage.md) **Integrations** (30 tools) [langchain](references/integrations/langchain.md) ยท [llamaindex](references/integrations/llamaindex.md) ยท [huggingface](references/integrations/huggingface.md) ยท [cursor](references/integrations/cursor.md) ยท [dify](references/integrations/dify.md) ยท [browseruse](references/integrations/browseruse.md) ยท [skyvern](references/integrations/skyvern.md) ยท [gradio](references/integrations/gradio.md) ยท [anythingllm](references/integrations/anythingllm.md) ยท [axolotl](references/integrations/axolotl.md) ยท [chatbox](references/integrations/chatbox.md) ยท [claude-code](references/integrations/claude-code.md) ยท [codecompanion](references/integrations/codecompanion.md) ยท [continue](references/integrations/continue.md) ยท [deepsearcher](references/integrations/deepsearcher.md) ยท [docsgpt](references/integrations/docsgpt.md) ยท [helicone](references/integrations/helicone.md) ยท [kohya-ss-gui](references/integrations/kohya-ss-gui.md) ยท [langflow](references/integrations/langflow.md) ยท [langfuse](references/integrations/langfuse.md) ยท [litellm](references/integrations/litellm.md) ยท [lobechat](references/integrations/lobechat.md) ยท [lollms-webui](references/integrations/lollms-webui.md) ยท [openai-agents-sdk](references/integrations/openai-agents-sdk.md) ยท [owl](references/integrations/owl.md) ยท [pageassist](references/integrations/pageassist.md) ยท [portkey](references/integrations/portkey.md) ยท [verba](references/integrations/verba.md) ### API Reference **Basic APIs** (6 files) [authentication](references/api-reference/basic/authentication.md) ยท [error-code](references/api-reference/basic/error-code.md) ยท [get-user-balance](references/api-reference/basic/get-user-balance.md) ยท [query-*-billing](references/api-reference/basic/) (3 files) **LLM APIs** (16 files) [list-models](references/api-reference/llm/list-models.md) ยท [retrieve-model](references/api-reference/llm/retrieve-model.md) ยท [create-chat-completion](references/api-reference/llm/create-chat-completion.md) ยท [create-completion](references/api-reference/llm/create-completion.md) ยท [create-embeddings](references/api-reference/llm/create-embeddings.md) ยท [create-rerank](references/api-reference/llm/create-rerank.md) ยท [create-batch](references/api-reference/llm/create-batch.md) ยท [cancel-batch](references/api-reference/llm/cancel-batch.md) ยท [list-batches](references/api-reference/llm/list-batches.md) ยท [retrieve-batch](references/api-reference/llm/retrieve-batch.md) ยท [list-files](references/api-reference/llm/list-files.md) ยท [upload-batch-input-file](references/api-reference/llm/upload-batch-input-file.md) ยท [query-file](references/api-reference/llm/query-file.md) ยท [retrieve-file-content](references/api-reference/llm/retrieve-file-content.md) ยท [delete-file](references/api-reference/llm/delete-file.md) **Image/Video APIs** (54 files) [introduction](references/api-reference/image-apis/introduction.md) Core APIs: [txt2img](references/api-reference/image-apis/txt2img.md) ยท [img2img](references/api-reference/image-apis/img2img.md) ยท [inpainting](references/api-reference/image-apis/inpainting.md) ยท [upscale](references/api-reference/image-apis/upscale.md) ยท [image-upscaler](references/api-reference/image-apis/image-upscaler.md) ยท [remove-background](references/api-reference/image-apis/remove-background.md) ยท [image-to-prompt](references/api-reference/image-apis/image-to-prompt.md) ยท [eraser](references/api-reference/image-apis/eraser.md) ยท [remove-text](references/api-reference/image-apis/remove-text.md) ยท [replace-background](references/api-reference/image-apis/replace-background.md) ยท [merge-face](references/api-reference/image-apis/merge-face.md) ยท [reimagine](references/api-reference/image-apis/reimagine.md) ยท [video-merge-face](references/api-reference/image-apis/video-merge-face.md) ยท [task-result](references/api-reference/image-apis/task-result.md) Flux Models: [flux-1-schnell](references/api-reference/image-apis/flux-1-schnell.md) ยท [flux-1-kontext-dev](references/api-reference/image-apis/flux-1-kontext-dev.md) ยท [flux-1-kontext-max](references/api-reference/image-apis/flux-1-kontext-max.md) ยท [flux-1-kontext-pro](references/api-reference/image-apis/flux-1-kontext-pro.md) ยท [flux-2-dev](references/api-reference/image-apis/flux-2-dev.md) ยท [flux-2-flex](references/api-reference/image-apis/flux-2-flex.md) ยท [flux-2-pro](references/api-reference/image-apis/flux-2-pro.md) Other Models: [seedream-*](references/api-reference/image-apis/seedream-3-0.md) (3) ยท [glm-image](references/api-reference/image-apis/glm-image.md) ยท [hunyuan-image-3](references/api-reference/image-apis/hunyuan-image-3.md) ยท [qwen-*](references/api-reference/image-apis/qwen-txt2img.md) (2) ยท [z-image-turbo](references/api-reference/image-apis/z-image-turbo.md) ยท [z-image-turbo-lora](references/api-reference/image-apis/z-image-turbo-lora.md) Training: [create-style-training](references/api-reference/image-apis/create-style-training.md) ยท [create-subject-training](references/api-reference/image-apis/create-subject-training.md) ยท [list-training-task](references/api-reference/image-apis/list-training-task.md) ยท [get-training-images-url](references/api-reference/image-apis/get-training-images-url.md) Other: [glm-tts-voice-clone](references/api-reference/image-apis/glm-tts-voice-clone.md) ยท [webhook](references/api-reference/image-apis/webhook.md) **GPU Instance APIs** (2 files) [create-instance](references/api-reference/gpu-instance/create-instance.md) ยท [list-clusters](references/api-reference/gpu-instance/list-clusters.md) ### Support & Design System **Billing** (4 files) [budgets](references/billing/budgets.md) ยท [auto-top-up](references/billing/auto-top-up.md) ยท [payment-methods](references/billing/payment-methods.md) ยท [low-balance-alert](references/billing/low-balance-alert.md) **Team** (1 file) [team-management](references/team/team-management.md) **Design System** (7 files) [overview](references/design-system/overview.md) ยท [typography](references/design-system/typography.md) ยท [colors](references/design-system/colors.md) ยท [buttons](references/design-system/buttons.md) ยท [navigation](references/design-system/navigation.md) ยท [icons](references/design-system/icons.md) ยท [logo](references/design-system/logo.md) --- ## Common Tasks ### Start with Model APIs 1. Get API key from https://novita.ai/console 2. Set base URL to `https://api.novita.ai/openai` 3. Call `/v1/models` to list available models 4. Use OpenAI-compatible APIs for chat completions 5. See [llm/api.md](references/llm/api.md) for details ### Launch GPU Instance 1. Go to https://novita.ai/gpus-console/explore 2. Choose GPU or template 3. Configure and launch 4. Connect via SSH or web terminal 5. See [gpu-instance/](references/gpu-instance/) for details ### Create Serverless Endpoint 1. Prepare container image 2. Go to https://novita.ai/gpus-console/serverless 3. Create endpoint with scale policy 4. Test and deploy 5. See [serverless-gpus/](references/serverless-gpus/) for details ### Start Agent Sandbox 1. Install SDK or CLI 2. Create sandbox with desired resources 3. Run commands or upload code 4. Pause/resume as needed 5. See [sandbox/](references/sandbox/) for details ### Integrate with Framework 1. Get Novita API key 2. Set base URL to `https://api.novita.ai/openai` 3. Update model names as needed 4. See [integrations/](references/integrations/) for specific guides --- ## Support & Resources - **Documentation**: https://novita.ai/docs - **Email**: support@novita.ai - **Discord**: https://discord.gg/YyPRAzwp7P - **FAQ**: https://novita.ai/docs/guides/faq - **Status Page**: https://status.novita.ai/