Introduction

obleth is a lightweight, high-performance, fairshare-first AI gateway. It sits between your front door (HAProxy) and your inference backend (Aibrix / vLLM) and ow

obleth is a lightweight, high-performance, fairshare-first AI gateway. It sits between your front door (HAProxy) and your inference backend (Aibrix / vLLM) and owns multi-tenant identity, contention-based weighted fair queuing, token-accurate cost accounting, and reliability.

Start here

  • Quick Start — Docker Compose, first API key, first chat completion (~5 min)
  • Why obleth? — how it differs from LiteLLM and raw vLLM routing
  • Fairshare engine — weighted queuing under saturation

Reference

The gateway source code lives in obleth-gateway. It is source-available under the Elastic License 2.0 — you may use and modify it for your own workloads; you may not offer it as a hosted or managed gateway service to third parties.