πŸš€ Installation#

We provide two installation paths. Docker is recommended β€” the image ships with UCX, flash-attn, sglang, and CUDA prebuilt.

πŸ› οΈ Option B: Manual install#

Build the prerequisites first:

  • UCX 1.20.x with CUDA + verbs support β€” follow upstream, or reuse the exact build flags in docker/Dockerfile.

  • flash-attn-4 β€” install >=4.0.0b9,<4.0.0b16, matching torch==2.11.0 and SGLang’s nvidia-cutlass-dsl pin.

Then install:

git clone git@github.com:sgl-project/sglang-omni.git
cd sglang-omni

uv venv .venv -p 3.12
source .venv/bin/activate

uv pip install -v -e .   # drop `-e` for a non-editable install