# @memberjunction/ai-azure
MemberJunction AI provider for Azure OpenAI Service. This package provides both LLM and embedding capabilities through Azure's enterprise-grade deployment of OpenAI models, implementing `BaseLLM` and `BaseEmbeddings` from `@memberjunction/ai`.
## Architecture
```mermaid
graph TD
A["AzureLLM
(Provider)"] -->|extends| B["BaseLLM
(@memberjunction/ai)"]
C["AzureEmbedding
(Provider)"] -->|extends| D["BaseEmbeddings
(@memberjunction/ai)"]
A -->|wraps| E["Azure AI Inference
REST Client"]
C -->|wraps| E
A -->|authenticates via| F["API Key or
Azure AD (Entra ID)"]
B -->|registered via| G["@RegisterClass"]
D -->|registered via| G
style A fill:#7c5295,stroke:#563a6b,color:#fff
style C fill:#7c5295,stroke:#563a6b,color:#fff
style B fill:#2d6a9f,stroke:#1a4971,color:#fff
style D fill:#2d6a9f,stroke:#1a4971,color:#fff
style E fill:#2d8659,stroke:#1a5c3a,color:#fff
style F fill:#b8762f,stroke:#8a5722,color:#fff
style G fill:#b8762f,stroke:#8a5722,color:#fff
```
## Features
- **Chat Completions**: Full support for Azure-hosted language models including GPT-4, GPT-3.5-Turbo, and Phi-4
- **Streaming Support**: Real-time streaming responses for enhanced user experience
- **Text Embeddings**: Generate vector embeddings for semantic search and similarity matching
- **Dual Authentication**: Support for both API key and Azure Active Directory (Entra ID) authentication
- **Text Processing**: Built-in text summarization and classification capabilities
- **Type Safety**: Full TypeScript support with comprehensive type definitions
- **Factory Pattern**: Seamless integration with MemberJunction's AI factory system
## Installation
```bash
npm install @memberjunction/ai-azure
```
## Configuration
### Prerequisites
1. An Azure subscription with Azure AI or Azure OpenAI service deployed
2. Either an API key for your Azure AI resource or Azure AD credentials
3. The endpoint URL for your Azure AI resource
### Authentication Methods
#### API Key Authentication
```typescript
import { AzureLLM } from '@memberjunction/ai-azure';
const azureLLM = new AzureLLM('your-api-key');
azureLLM.SetAdditionalSettings({
endpoint: 'https://your-resource.openai.azure.com/'
});
```
#### Azure AD Authentication
```typescript
const azureLLM = new AzureLLM('');
azureLLM.SetAdditionalSettings({
endpoint: 'https://your-resource.openai.azure.com/',
useAzureAD: true
});
```
## Usage
### Chat Completion
```typescript
const result = await azureLLM.ChatCompletion({
model: 'gpt-4',
messages: [
{ role: 'system', content: 'You are a helpful assistant.' },
{ role: 'user', content: 'Explain the theory of relativity.' }
],
maxOutputTokens: 500,
temperature: 0.7
});
if (result.success) {
console.log(result.data.choices[0].message.content);
}
```
### Streaming
```typescript
const result = await azureLLM.ChatCompletion({
model: 'gpt-4',
messages: [{ role: 'user', content: 'Write a detailed story.' }],
streaming: true,
streamingCallbacks: {
OnContent: (content) => process.stdout.write(content),
OnComplete: (result) => console.log('\nDone!')
}
});
```
### Embeddings
```typescript
import { AzureEmbedding } from '@memberjunction/ai-azure';
const embedder = new AzureEmbedding('your-api-key');
embedder.SetAdditionalSettings({
endpoint: 'https://your-resource.openai.azure.com/'
});
const result = await embedder.EmbedText({
model: 'text-embedding-ada-002',
text: 'Sample text for embedding'
});
console.log(`Dimensions: ${result.vector.length}`);
```
## Configuration Options
| Setting | Type | Required | Description |
|---------|------|----------|-------------|
| `endpoint` | `string` | Yes | Azure AI resource endpoint URL |
| `useAzureAD` | `boolean` | No | Use Azure AD authentication instead of API key |
## Supported Parameters
| Parameter | Supported | Notes |
|-----------|-----------|-------|
| temperature | Yes | 0.0 - 2.0 |
| maxOutputTokens | Yes | Maximum tokens to generate |
| topP | Yes | Nucleus sampling (0.0 - 1.0) |
| frequencyPenalty | Yes | -2.0 to 2.0 |
| presencePenalty | Yes | -2.0 to 2.0 |
| seed | Yes | Deterministic outputs |
| stopSequences | Yes | Stop generation sequences |
| responseFormat | Yes | Text, JSON, Markdown |
| streaming | Yes | Real-time streaming |
| topK | No | Not available in Azure OpenAI |
| minP | No | Not available in Azure OpenAI |
## Class Registration
- `AzureLLM` -- Registered via `@RegisterClass(BaseLLM, 'AzureLLM')`
- `AzureEmbedding` -- Registered via `@RegisterClass(BaseEmbeddings, 'AzureEmbedding')`
## Dependencies
- `@memberjunction/ai` - Core AI abstractions
- `@memberjunction/global` - Class registration
- `@azure-rest/ai-inference` - Azure AI inference REST client
- `@azure/core-auth` - Azure authentication core
- `@azure/identity` - Azure identity and credential management