--- layout: home hero: name: "FERRISGRID" text: "Visual control for local agents" tagline: "Turn screens into coordinates, and coordinates into action." actions: - theme: brand text: "Get Started" link: /getting-started/ - theme: alt text: "Commands" link: /commands/ - theme: alt text: "Docker Workspace" link: /workspaces/docker - theme: alt text: "Architecture" link: /concepts/architecture - theme: alt text: "GitHub" link: https://github.com/BrunoV21/FerrisGrid-CLI - theme: alt text: "TypeScript npm mirror" link: https://github.com/BrunoV21/FerrisGrid-CLI-ts features: - title: "Single-step execution" details: "Every invocation observes once or executes one validated action, then exits. The agent owns the reasoning loop." - title: "Coordinate-first screenshots" details: "Screenshots include deterministic metadata so image-space choices map back to native screen coordinates." - title: "Eyes plus a map" details: "FerrisGrid turns desktop pixels into structured observations an LLM can reason over without hiding the underlying screenshot." - title: "Local traces" details: "Screenshots, metadata, action requests, parsed actions, results, and recap artifacts stay under .ferrisgrid by default." - title: "Docker workspace" details: "Run a Linux desktop in the background, watch it through noVNC, and keep agent input away from your main desktop." - title: "Compact Markdown protocol" details: "FerrisGrid prints agent-readable Markdown for observations, actions, errors, and doctor checks." - title: "Cross-platform shape" details: "The same observe/act interface is designed to span macOS, Linux, and Windows as platform backends mature." ---