All routes handled by the obleth data plane (:8080), authentication requirements, and passthrough behavior.
The data plane listens on port 8080 and proxies OpenAI-compatible requests to the configured upstream. All routes require a valid API key in the Authorization header.
| Method | Path | Description |
|---|---|---|
POST | /v1/chat/completions | Chat completions (streaming and non-streaming) |
POST | /v1/completions | Legacy text completions |
POST | /v1/embeddings | Embeddings |
GET | /v1/models | List available models (from obleth's model registry) |
GET | /health | Liveness probe (no auth required) |
All other paths are forwarded to the upstream as-is (passthrough). This means vendor-specific extensions like /v1/batch or /v1/files work without any obleth configuration.
Every request (except /health) must include:
Authorization: Bearer sk_<48 hex chars>
The key is resolved as described in the Authentication reference.
Authorization replaced with model's api_key (if set)obleth forwards client headers except hop-by-hop, auth, and encoding headers:
hostcontent-lengthauthorizationx-api-keyaccept-encodingconnectionWhen cache is enabled for a model, the data plane:
SHA-256(model + request_body) before forwardingobleth:cache:{hash} in Rediscache_ttl_secsCache hits include X-Obleth-Cache: hit. Misses continue through the normal streaming path and are recorded in the usage ledger as cache_status = "miss".