---
title: ElevenLabs GraphQL Schema
description: Conceptual GraphQL schema for the ElevenLabs AI voice and speech synthesis platform.
provider: ElevenLabs
baseURL: https://api.elevenlabs.io
docsURL: https://elevenlabs.io/docs/api-reference/
schemaFile: elevenlabs-schema.graphql
version: 1.0.0
created: 2026-06-14
---

# ElevenLabs GraphQL Schema

This is a conceptual GraphQL schema representing the ElevenLabs AI voice and speech synthesis platform. ElevenLabs provides a REST API; this schema translates those REST capabilities into a strongly-typed GraphQL surface, making it straightforward to compose voice, audio, and conversational AI operations in a single query layer.

## Coverage

The schema covers the full breadth of the ElevenLabs platform:

- **Voice Management** — list, retrieve, add, edit, delete, and share voices; manage voice settings; browse the public voice library.
- **Text-to-Speech (TTS)** — synthesize speech from text using any voice and model, with control over output format, latency optimization, seed, and text normalization.
- **Speech-to-Speech (STS)** — convert an uploaded audio recording to a target voice while preserving timing and emotion.
- **Voice Cloning** — instant voice cloning from short audio samples and professional voice cloning with verification.
- **Voice Design** — generate novel AI voices from text descriptions specifying gender, age, accent, and strength, then save them to the library.
- **Sound Effects** — generate cinematic sound effects from natural language descriptions.
- **Audio Isolation** — remove background noise from uploaded audio.
- **Dubbing** — create and manage dubbing projects that translate video or audio into target languages while preserving speaker voice characteristics.
- **Generation History** — browse and manage TTS and STS generation history.
- **Studio Projects** — create and render long-form audio projects organized into chapters with per-chapter voice and model assignments.
- **Conversational AI Agents** — configure, deploy, and query real-time voice agents with LLM prompts, knowledge bases, and tool integrations.
- **Pronunciation Dictionaries** — manage custom pronunciation dictionaries referenced in TTS requests.
- **Subscription and Usage** — retrieve current plan, character usage, quotas, and invoice information.
- **API Keys and Webhooks** — manage API credentials and register webhook endpoints for async event delivery.

## Types

| Type | Purpose |
|---|---|
| `Voice` | Full voice resource with metadata, samples, and settings |
| `VoiceDetails` | Extended voice metadata including sharing and fine-tuning state |
| `VoiceID` | Scalar — unique voice identifier |
| `VoiceName` | Scalar — display name for a voice |
| `VoiceCategory` | Enum — PREMADE, CLONED, GENERATED, PROFESSIONAL |
| `VoiceLabels` | Accent, description, age, gender, and use-case tags |
| `VoiceSamples` | Collection of audio samples for a voice |
| `VoiceSample` | A single audio sample with file metadata |
| `SampleID` | Scalar — unique sample identifier |
| `VoiceSettings` | Stability, similarity boost, style, and speaker boost parameters |
| `Stability` | Scalar — delivery consistency (0.0–1.0) |
| `SimilarityBoost` | Scalar — voice likeness strength (0.0–1.0) |
| `Style` | Scalar — style exaggeration level (0.0–1.0) |
| `SpeakerBoost` | Scalar — speaker boost toggle |
| `VoiceDesign` | Parameters for generating a new voice from a text description |
| `VoiceDescription` | Scalar — natural language voice description |
| `VoiceCreation` | Input parameters for creating a voice from audio files |
| `ClonedVoice` | A cloned voice resource returned after creation |
| `VoiceClone` | Clone creation result with verification status |
| `VoiceSharingEnabled` | Scalar — sharing toggle state |
| `AddVoiceRequest` | Input for adding a new voice |
| `SharedVoice` | A voice shared publicly in the library |
| `VoiceSharing` | Sharing metadata including likes, clones, and review status |
| `VoiceLibrary` | Paginated listing of shared voices |
| `VoiceFineTuning` | Fine-tuning progress and model association |
| `VoiceVerification` | Professional clone verification status |
| `VoiceVerificationAttempt` | Individual verification attempt record |
| `TextToSpeech` | TTS operation context |
| `TTSRequest` | Request parameters for a TTS synthesis |
| `TTSResult` | Synthesized audio with metadata |
| `AudioOutput` | Downloadable audio descriptor |
| `OutputFormat` | Enum — MP3/PCM/ULAW at various bitrates and sample rates |
| `AudioStream` | WebSocket streaming session descriptor |
| `LatencyOptimization` | Enum — DEFAULT, NORMAL, STRONG, MAX, MAX_TEXT_NORMALIZER |
| `ModelID` | Scalar — unique model identifier |
| `TTSModel` | Synthesis model with capabilities and limits |
| `ModelLanguage` | A language supported by a model |
| `SpeechToSpeech` | STS operation context |
| `STSRequest` | Request parameters for a speech-to-speech conversion |
| `STSResult` | Converted audio with metadata |
| `AudioInput` | Metadata descriptor for an uploaded audio file |
| `DubbingProject` | A dubbing project summary |
| `Dubbing` | Full dubbing resource with segment detail |
| `DubLanguage` | Enum — supported dubbing target/source languages |
| `DubSegment` | A timed segment within a dubbing project |
| `DubSource` | Source media descriptor |
| `SoundEffect` | A generated sound effect result |
| `SoundEffectRequest` | Request parameters for sound effect generation |
| `VoiceGeneration` | A generated voice preview from a design prompt |
| `GenerationHistory` | Paginated generation history |
| `HistoryItem` | A single generation history record |
| `ItemDetails` | Detailed metadata for a history item |
| `CharacterCount` | Scalar — character usage count |
| `TTSUsage` | Aggregate TTS usage statistics |
| `SubscriptionPlan` | Enum — FREE, STARTER, CREATOR, PRO, SCALE, BUSINESS, ENTERPRISE |
| `SubscriptionInfo` | Current subscription and character usage |
| `PlanQuota` | Hard limits for a given subscription plan |
| `SubscriptionInvoice` | Next invoice details |
| `APIKey` | An ElevenLabs API key resource |
| `Token` | A session authentication token |
| `Webhook` | A registered webhook endpoint |
| `WebhookEvent` | An event payload delivered to a webhook |
| `WebhookEventType` | Enum — async event types (TTS_COMPLETED, DUBBING_COMPLETED, etc.) |
| `PronunciationDictionary` | A custom pronunciation dictionary |
| `PronunciationDictionaryLocator` | Reference to a dictionary version for TTS requests |
| `ConversationalAgent` | An AI voice agent configuration |
| `AgentPrompt` | System prompt and LLM parameters for an agent |
| `AgentTool` | A tool available to a conversational agent |
| `KnowledgeBaseDocument` | A document in an agent's knowledge base |
| `Conversation` | A conversation session with an agent |
| `ConversationTurn` | A single turn in a conversation |
| `StudioProject` | A long-form audio project |
| `ProjectChapter` | A chapter within a Studio project |
| `VoiceSettingsInput` | Input type for voice settings mutations |

## Queries

- `voices` — list all voices accessible to the authenticated user
- `voice(voiceId)` — retrieve full details for a single voice
- `defaultVoiceSettings` — retrieve system default voice settings
- `voiceLibrary(...)` — search and paginate the public shared voice library
- `models` — list all available synthesis models
- `history(...)` — paginate generation history
- `historyItem(historyItemId)` — retrieve a single history record
- `subscription` — current plan tier, character usage, and limits
- `planQuota(plan)` — hard limits for any given plan tier
- `apiKeys` — list API keys
- `webhooks` — list registered webhook endpoints
- `dubbingProject(dubbingId)` — retrieve a dubbing project with segments
- `studioProjects` — list all Studio projects
- `studioProject(projectId)` — retrieve a single Studio project
- `pronunciationDictionaries` — list pronunciation dictionaries
- `conversationalAgents` — list configured voice agents
- `conversationalAgent(agentId)` — retrieve a single agent configuration
- `conversation(conversationId)` — retrieve a conversation session with transcript

## Mutations

- `textToSpeech` — synthesize text to audio
- `speechToSpeech` — convert audio to a target voice
- `generateSoundEffect` — generate a sound effect from a description
- `designVoice` — preview generated voices from demographic parameters
- `saveDesignedVoice` — save a generated voice preview to the library
- `addVoice` — create a new cloned voice from audio samples
- `editVoice` — update a voice name, description, or labels
- `deleteVoice` — remove a voice
- `shareVoice` — enable or disable public sharing for a voice
- `addSharedVoice` — copy a shared voice to the authenticated user's library
- `updateVoiceSettings` — update fine-grained synthesis parameters for a voice
- `createDubbing` — start a dubbing project
- `deleteDubbing` — remove a dubbing project
- `isolateAudio` — strip background noise from an audio file
- `deleteHistoryItems` — remove history records
- `createStudioProject` — initialize a new long-form audio project
- `addProjectChapter` — add a chapter to a Studio project
- `convertProject` — render a Studio project to audio
- `createConversationalAgent` — configure a new voice agent
- `updateConversationalAgent` — update an existing agent
- `deleteConversationalAgent` — remove an agent
- `createWebhook` — register a webhook endpoint
- `deleteWebhook` — remove a webhook
- `createAPIKey` — generate a new API key
- `deleteAPIKey` — revoke an API key
- `createPronunciationDictionary` — upload a pronunciation dictionary

## References

- API Reference: https://elevenlabs.io/docs/api-reference/
- GitHub: https://github.com/elevenlabs
- Pricing: https://elevenlabs.io/pricing