--- name: edge-router description: "Route AI agent compute tasks to the cheapest viable backend. Supports local inference (Ollama), cloud GPU (Vast.ai), and quantum hardware (Wukong 72Q). Use when an agent needs to decide where to run a task, optimize compute costs, check backend availability, or execute workloads across edge/cloud/quantum infrastructure." --- # Edge Router Routes tasks to cheapest available backend: local (free) → cloud GPU ($0.01) → quantum ($0.10). ## API Base: `https://edge-router.gpupulse.dev/api/v1` (or localhost:3825) ### Route (recommend) ```bash curl -X POST "$BASE/route" -H "Content-Type: application/json" \ -d '{"task_type": "inference"}' ``` ### Execute (route + run) ```bash curl -X POST "$BASE/execute" -H "Content-Type: application/json" \ -d '{"task_type": "inference", "payload": {"model": "llama3.2:1b", "prompt": "hello"}}' ``` ### Task Types - `inference` → local first, cloud fallback - `training` → cloud GPU - `quantum` → Wukong 72Q - `auto` → cheapest available ### Other - `GET /backends` — list + status - `GET /stats` — routing statistics - `GET /health` — health check