aid: baseten url: https://raw.githubusercontent.com/api-evangelist/baseten/refs/heads/main/apis.yml name: Baseten x-type: company description: >- Baseten is a production inference platform for deploying and serving custom and pre-trained ML models. Offers a Model APIs catalog with OpenAI-compatible endpoints (DeepSeek, Qwen, GLM, Nemotron), dedicated deployments via Truss, autoscaling GPU compute, async/queue inference, training, chains (multi-model workflows), and management APIs. image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg tags: - AI - ML - Inference - Deployment - MLOps - OpenAI Compatible - Anthropic Compatible - Truss created: '2026-05-08' modified: '2026-05-08' specificationVersion: '0.19' apis: - aid: baseten:llm-inference name: Baseten LLM Inference API description: >- OpenAI-compatible chat completions for Baseten's Model APIs catalog (DeepSeek V4, Qwen, GLM, Nemotron, etc.). Per-million-token pricing. image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg humanURL: https://docs.baseten.co/reference/inference-api/llm-openapi-spec baseURL: https://inference.baseten.co/v1 tags: - AI - LLM - Chat Completions - OpenAI Compatible properties: - type: Documentation url: https://docs.baseten.co/ - type: OpenAPI url: openapi/baseten-llm-openapi.json - type: Pricing url: https://www.baseten.co/pricing/ - aid: baseten:messages name: Baseten Anthropic-Compatible Messages API description: >- Anthropic Messages-compatible inference for compatible Model APIs models. image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg humanURL: https://docs.baseten.co/reference/inference-api/messages-openapi-spec baseURL: https://inference.baseten.co tags: - AI - LLM - Anthropic Compatible - Messages properties: - type: Documentation url: https://docs.baseten.co/ - type: OpenAPI url: openapi/baseten-messages-openapi.json - aid: baseten:management name: Baseten Management & Async API description: >- Deployment management, async/queued inference, chain calls (multi-model workflows), training, dedicated-deployment lifecycle, async result polling, and webhook delivery. image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg humanURL: https://docs.baseten.co/api-reference baseURL: https://api.baseten.co tags: - Management - Async - Webhooks - Deployment - Training properties: - type: Documentation url: https://docs.baseten.co/api-reference - type: TrussCLI url: https://docs.baseten.co/ common: - type: Website url: https://www.baseten.co/ - type: Documentation url: https://docs.baseten.co/ - type: Pricing url: https://www.baseten.co/pricing/ - type: Plans url: plans/baseten-plans-pricing.yml - type: RateLimits url: rate-limits/baseten-rate-limits.yml - type: FinOps url: finops/baseten-finops.yml maintainers: - FN: Kin Lane email: kin@apievangelist.com