aid: assemblyai name: AssemblyAI description: Built by AI experts, AssemblyAI's Speech AI models include accurate speech-to-text for voice data (such as calls, virtual meetings, and podcasts), speaker detection, sentiment analysis, chapter detection, PII redaction, and more. AssemblyAI provides powerful APIs for transcribing and understanding audio data at scale. The platform supports real-time streaming transcription via WebSocket, asynchronous batch transcription, and audio intelligence features including summarization, auto chapters, entity detection, and content safety filtering. SDKs are available for Python, Node.js, Ruby, Java, and Go. type: Index image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg tags: - AI - Artificial Intelligence - Audio - Speech - Transcription - Speech to Text url: https://raw.githubusercontent.com/api-evangelist/assemblyai/refs/heads/main/apis.yml created: '2024-06-06' modified: '2026-05-19' specificationVersion: '0.19' apis: - aid: assemblyai:assemblyai-api name: AssemblyAI API description: The AssemblyAI API provides speech-to-text transcription, speaker diarization, sentiment analysis, chapter detection, PII redaction, and other audio intelligence capabilities via REST and WebSocket interfaces. humanURL: https://www.assemblyai.com/docs/ baseURL: https://api.assemblyai.com tags: - Audio Intelligence - Speech to Text - Transcription properties: - type: Documentation url: https://www.assemblyai.com/docs/ - type: GettingStarted url: https://www.assemblyai.com/docs/getting-started/transcribe-an-audio-file - type: Authentication url: https://www.assemblyai.com/docs/concepts/authentication - type: APIReference url: https://www.assemblyai.com/docs/api-reference/overview - type: OpenAPI url: openapi/assemblyai-openapi-original.yml - type: AsyncAPI url: openapi/assemblyai-asyncapi-original.yml - type: NaftikoCapability url: capabilities/assemblyai-lemur.yaml - type: NaftikoCapability url: capabilities/assemblyai-streaming.yaml - type: NaftikoCapability url: capabilities/assemblyai-transcript.yaml common: - type: LinkedIn url: https://www.linkedin.com/company/assemblyai - type: Portal url: https://www.assemblyai.com/ title: AssemblyAI Website - type: Documentation url: https://www.assemblyai.com/docs/ title: Documentation - type: Blog url: https://www.assemblyai.com/blog title: Blog - type: SignUp url: https://www.assemblyai.com/dashboard/signup title: Sign Up - type: Login url: https://www.assemblyai.com/dashboard/login title: Login - type: Pricing url: https://www.assemblyai.com/pricing title: Pricing - type: GitHubOrganization url: https://github.com/AssemblyAI title: AssemblyAI GitHub Organization - type: StatusPage url: https://status.assemblyai.com/ title: Status Page - type: Features data: - name: Speech-to-Text Transcription description: High-accuracy transcription of audio files and streams using AssemblyAI's Universal-2 model with support for 99+ languages and custom vocabulary. - name: Real-Time Streaming Transcription description: WebSocket-based streaming transcription for live audio with partial results and final transcripts, supporting call centers, live captioning, and voice applications. - name: Speaker Diarization description: Automatic speaker detection and labeling that identifies who said what in multi-speaker recordings. - name: Audio Intelligence description: Advanced understanding features including sentiment analysis, summarization, auto chapters, entity detection, content safety filtering, and PII redaction. - name: LeMUR description: LeMUR (Leveraging Large Language Models for Understanding Recordings) enables asking questions of audio transcripts using a conversational AI interface built on top of transcriptions. - type: UseCases data: - name: Call Center Analytics description: Customer service teams transcribe and analyze customer calls for quality assurance, compliance, agent coaching, and sentiment analysis. - name: Meeting Intelligence description: Enterprises transcribe virtual meetings (Zoom, Teams, Meet) to generate summaries, action items, and searchable archives. - name: Podcast Processing description: Podcast producers transcribe episodes for SEO, accessibility, show notes, and content repurposing. - name: Voice Application Development description: Developers build voice-powered applications using real-time streaming transcription for voice commands, dictation, and conversation interfaces. - name: Compliance and Legal description: Legal and compliance teams transcribe depositions, hearings, and recorded communications with PII redaction and timestamped transcripts. - type: Integrations data: - name: Twilio description: Integration with Twilio Media Streams for transcribing phone calls in real-time using AssemblyAI's streaming API. - name: Zoom description: Integration with Zoom recordings for batch transcription and meeting intelligence processing. - name: Python SDK description: Official Python SDK for AssemblyAI available on PyPI (assemblyai) for easy integration in Python applications. - name: Node.js SDK description: Official Node.js SDK for AssemblyAI available on npm (@assemblyai/sdk) for JavaScript and TypeScript applications. - name: Agent Skills url: https://github.com/AssemblyAI/assemblyai-skill type: AgentSkill maintainers: - FN: Kin Lane email: kin@apievangelist.com