🚀 Installation#

We provide two installation paths. Docker is recommended — the image ships with UCX, flash-attn, sglang, and CUDA prebuilt.

🛠️ Option B: Manual install#

Build the prerequisites first:

  • UCX 1.20.x with CUDA + verbs support — follow upstream, or reuse the exact build flags in docker/Dockerfile.

  • flash-attn — install a prebuilt wheel matching your torch==2.9.1 / CUDA version.

Then install:

git clone git@github.com:sgl-project/sglang-omni.git
cd sglang-omni

uv venv .venv -p 3.12
source .venv/bin/activate

uv pip install -v -e .   # drop `-e` for a non-editable install