DeveloperDeveloper Guide
Authentication & Request Flow
This document explains how requests flow through Cube AI, including authentication, authorization, and routing to LLM backends.
Overview

Flow Description
- Client requests enter through Traefik.
- Cube Proxy intercepts all incoming requests.
- Authentication and authorization are handled via SuperMQ Auth.
- Authorized requests are routed to the configured LLM backend (e.g. Ollama).
- UI traffic is routed separately to Cube AI UI and SuperMQ Users services.
This flow ensures that all inference requests are authenticated, authorized, and securely routed before reaching any model backend.