aid: fal-ai
url: https://raw.githubusercontent.com/api-evangelist/fal-ai/refs/heads/main/apis.yml
apis:
- aid: fal-ai:fal-model-apis
  name: fal Model APIs
  tags:
  - AI
  - Generative AI
  - Image Generation
  - Video Generation
  - Audio Generation
  - Multimodal
  - Inference
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/quickstart
    type: Documentation
  - url: https://fal.ai/models
    type: Documentation
    name: Model Gallery
  - url: openapi/fal-model-apis-openapi.yml
    type: OpenAPI
  - url: json-schema/fal-queue-request-schema.json
    type: JSONSchema
  - url: json-schema/fal-queue-status-schema.json
    type: JSONSchema
  - url: json-ld/fal-ai-context.jsonld
    type: JSONLD
  - type: NaftikoCapability
    url: capabilities/model-apis-queue.yaml
  description: Unified queue-based REST API for invoking 1,000+ generative image, video, audio, and multimodal
    models hosted on fal's inference infrastructure. Submit a request to `https://queue.fal.run/{model-id}`, poll
    `/requests/{request_id}/status` or `/requests/{request_id}` for progress and results, or subscribe to webhook
    callbacks. Supports synchronous responses, asynchronous queueing, server-sent streaming progress, and request
    cancellation. Powers flagship models including FLUX, Veo 3, Kling 2.5, Wan 2.5, Seedream, Nano Banana, Qwen,
    SDXL, and Stable Diffusion variants.
- aid: fal-ai:fal-realtime-api
  name: fal Realtime API
  tags:
  - AI
  - Generative AI
  - Realtime
  - WebSocket
  - Streaming
  - Inference
  humanURL: https://fal.ai/docs/model-apis/real-time
  baseURL: wss://realtime.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/real-time
    type: Documentation
  - url: https://github.com/fal-ai/real-time-demo-app
    type: CodeExamples
  description: WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as
    LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-directional binary/JSON messaging keeps a
    persistent connection open so each frame, prompt, or pose adjustment is processed in milliseconds. Powers
    fal.realtime client utilities used in canvas apps, drawing tools, AR experiences, and live video pipelines.
- aid: fal-ai:fal-streaming-api
  name: fal Streaming API
  tags:
  - AI
  - Generative AI
  - Streaming
  - Server-Sent Events
  - Inference
  humanURL: https://fal.ai/docs/model-apis/streaming
  baseURL: https://queue.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/streaming
    type: Documentation
  description: HTTP streaming endpoint (`/{model-id}/stream`) that emits progressive partial outputs as a model
    runs — used for LLM/VLM token streams, incremental video frames, and step-by-step image diffusion previews.
    Compatible with Server-Sent Events parsers in the official fal-client SDKs.
- aid: fal-ai:fal-storage-api
  name: fal Storage API
  tags:
  - AI
  - Generative AI
  - File Upload
  - Storage
  - CDN
  humanURL: https://fal.ai/docs/model-apis/file-uploads
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/docs/model-apis/file-uploads
    type: Documentation
  - url: openapi/fal-storage-api-openapi.yml
    type: OpenAPI
  - type: NaftikoCapability
    url: capabilities/storage-files.yaml
  description: REST endpoints for uploading binary inputs (images, audio clips, reference frames, control maps) to
    fal's CDN so they can be referenced by URL when invoking model APIs. Issues short-lived signed upload URLs via
    `/storage/upload/initiate` and serves the resulting assets from `https://v3.fal.media`.
- aid: fal-ai:fal-serverless-platform-api
  name: fal Serverless Platform API
  tags:
  - AI
  - Serverless
  - GPU
  - Deployments
  - Platform
  humanURL: https://fal.ai/docs/private-serverless-models
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/docs/private-serverless-models
    type: Documentation
  - url: https://github.com/fal-ai/fal
    type: SDK
    name: fal Python SDK and CLI
  - url: openapi/fal-serverless-platform-api-openapi.yml
    type: OpenAPI
  - type: NaftikoCapability
    url: capabilities/serverless-apps.yaml
  description: Programmatic management of custom fal Serverless applications — list, inspect, deploy, scale, and
    monitor user-defined GPU functions deployed with `@fal.function`, `fal.App`, or BYO containers. Covers app
    metadata, secrets, file volumes, scaling parameters (`keep_alive`, `min_concurrency`), and execution analytics.
- aid: fal-ai:fal-models-catalog-api
  name: fal Models Catalog API
  tags:
  - AI
  - Generative AI
  - Catalog
  - Discovery
  humanURL: https://fal.ai/models
  baseURL: https://fal.ai
  properties:
  - url: https://fal.ai/models
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/models-catalog.yaml
  description: Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model
    metadata, capability tags, pricing per output, supported parameters, example inputs, and OpenAPI schemas per
    model. Backs the model gallery, search, and SDK tooling.
- aid: fal-ai:fal-compute-api
  name: fal Compute API
  tags:
  - AI
  - GPU
  - Compute
  - Infrastructure
  - Dedicated
  humanURL: https://fal.ai/compute
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/compute
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/compute-instances.yaml
  description: Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for
    training, fine-tuning, and persistent workloads. Hourly or per-second billing with no lock-in.
- aid: fal-ai:fal-keys-api
  name: fal API Keys API
  tags:
  - AI
  - Administration
  - Authentication
  - API Keys
  humanURL: https://fal.ai/dashboard/keys
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/keys
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/keys-management.yaml
  description: 'Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model,
    Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.'
- aid: fal-ai:fal-usage-billing-api
  name: fal Usage and Billing API
  tags:
  - AI
  - Administration
  - Usage
  - Billing
  - FinOps
  humanURL: https://fal.ai/dashboard/usage
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/usage
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/usage-billing.yaml
  description: Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing
    history. Surfaces the same data shown on the fal dashboard so platform teams can pipe inference cost into
    internal FinOps tooling.
name: fal
tags:
- AI
- Artificial Intelligence
- Generative AI
- Generative Media
- Image Generation
- Video Generation
- Audio Generation
- Inference
- Serverless
- GPU
- MCP
kind: contract
image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
access: 3rd-Party
common:
- type: Portal
  url: https://fal.ai
- type: Documentation
  url: https://fal.ai/docs
- type: Documentation
  name: Model APIs Quickstart
  url: https://fal.ai/docs/model-apis/quickstart
- type: Documentation
  name: Model Gallery
  url: https://fal.ai/models
- type: Documentation
  name: Authentication
  url: https://fal.ai/docs/authentication
- type: Documentation
  name: Webhooks
  url: https://fal.ai/docs/model-apis/webhooks
- type: Documentation
  name: Realtime
  url: https://fal.ai/docs/model-apis/real-time
- type: Documentation
  name: Streaming
  url: https://fal.ai/docs/model-apis/streaming
- type: Documentation
  name: File Uploads
  url: https://fal.ai/docs/model-apis/file-uploads
- type: Documentation
  name: Private Serverless Models
  url: https://fal.ai/docs/private-serverless-models
- type: GettingStarted
  url: https://fal.ai/docs/model-apis/quickstart
- type: StatusPage
  url: https://status.fal.ai
- type: Blog
  url: https://blog.fal.ai
- type: SignUp
  url: https://fal.ai/login
- type: Pricing
  url: https://fal.ai/pricing
- type: Support
  name: Discord
  url: https://discord.gg/fal-ai
- type: Forum
  url: https://discord.gg/fal-ai
- type: TermsOfService
  url: https://fal.ai/legal/terms-of-service
- type: PrivacyPolicy
  url: https://fal.ai/legal/privacy-policy
- type: TrustCenter
  url: https://trust.fal.ai
- type: LinkedIn
  url: https://www.linkedin.com/company/featuresandlabels
- type: Twitter
  url: https://twitter.com/fal
- type: GitHubOrganization
  url: https://github.com/fal-ai
- type: SDK
  name: fal Python Client
  url: https://github.com/fal-ai/fal-client-python
- type: SDK
  name: fal JavaScript Client
  url: https://github.com/fal-ai/fal-js
- type: SDK
  name: fal Swift Client
  url: https://github.com/fal-ai/fal-swift
- type: SDK
  name: fal Java/Kotlin Client
  url: https://github.com/fal-ai/fal-java
- type: SDK
  name: fal Dart/Flutter Client
  url: https://github.com/fal-ai/fal-dart
- type: SDK
  name: fal Python SDK / Serverless
  url: https://github.com/fal-ai/fal
- type: Tool
  name: fal Terraform Provider
  url: https://github.com/fal-ai/terraform-provider-fal
- type: Tool
  name: fal Blender Extension
  url: https://github.com/fal-ai/fal-blender-extension
- type: Tool
  name: fal VS Code Extension (Serverless)
  url: https://github.com/fal-ai/serverless-vscode
- type: CodeExamples
  name: Awesome fal
  url: https://github.com/fal-ai/awesome
- type: CodeExamples
  name: Real-Time Demo App
  url: https://github.com/fal-ai/real-time-demo-app
- type: CodeExamples
  name: fal Next.js Template
  url: https://github.com/fal-ai/fal-nextjs-template
- type: Documentation
  name: MCP Server
  url: https://fal.ai/docs/mcp-server
- type: Documentation
  name: ComfyUI Integration
  url: https://fal.ai/docs/comfyui
- url: plans/fal-ai-plans-pricing.yml
  type: Plans
- url: rate-limits/fal-ai-rate-limits.yml
  type: RateLimits
- url: finops/fal-ai-finops.yml
  type: FinOps
- type: Features
  data:
  - Unified queue-based REST API at https://queue.fal.run/{model-id} for 1,000+ generative models
  - Image generation models — FLUX (Schnell, Dev, Pro, Kontext Pro), Seedream V4, Nano Banana, Qwen, SDXL, SD3,
    Ideogram, Recraft
  - Video generation models — Veo 3, Kling 2.5 Turbo Pro, Wan 2.5, Seedance 2.0, Ovi, Hunyuan, Sora-class
  - Audio and voice models — Inworld TTS-1.5, ElevenLabs, MMAudio, MusicGen, Stable Audio
  - 3D and multimodal models — TripoSR, Hunyuan3D, LivePortrait, FaceChain
  - Synchronous, asynchronous queue, server-sent streaming, and WebSocket realtime invocation modes
  - Webhook callbacks for queue completion with HMAC signature verification
  - File uploads / CDN storage at https://v3.fal.media with signed upload URLs
  - fal Serverless — `@fal.function`, `fal.App`, BYO container deployment with autoscaling from 0 to thousands of
    GPUs
  - fal Compute — dedicated H100/H200/A100/B200 instances with SSH and per-second billing
  - Per-output billing (image, video second, audio minute) plus per-second GPU billing for custom deployments
  - 99.99% uptime SLA, SOC 2 compliance, private endpoints, and enterprise support
  - Proprietary Inference Engine — up to 10x faster than reference implementations
  - Official SDKs for Python (fal-client), JavaScript/TypeScript (@fal-ai/client), Swift, Java/Kotlin, Dart
  - fal CLI for serverless deploy / run / apps / secrets / auth
  - fal MCP Server exposing all 1,000+ models to AI assistants via the Model Context Protocol
  - ComfyUI and Blender extensions, plus Terraform provider for infra-as-code
  - Day-zero launch partner for major model releases (FLUX, Veo, Kling, Seedance, Wan, etc.)
  sources:
  - https://fal.ai
  - https://fal.ai/docs
  - https://fal.ai/pricing
  - https://fal.ai/models
  - https://github.com/fal-ai
  - https://blog.fal.ai
  updated: '2026-05-25'
created: '2026-05-25'
modified: '2026-05-25'
position: Consuming
description: fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for
  running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at
  https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models
  — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU
  infrastructure. fal Serverless lets developers ship custom models with `@fal.function` / `fal.App` / BYO
  containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity,
  Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.
maintainers:
- FN: Kin Lane
  email: info@apievangelist.com
  X: apievangelist
  url: https://apievangelist.com
specificationVersion: '0.16'