🚀 Installation#

We provide two installation paths. Docker is recommended — the image ships with UCX, flash-attn, sglang, and CUDA prebuilt.

🐳 Option A: Docker (recommended)#

1. Pull the image

docker pull lmsysorg/sglang-omni:dev

2. Run the container

docker run -it \
    --shm-size 32g \
    --gpus all \
    --ipc host \
    --network host \
    --privileged \
    lmsysorg/sglang-omni:dev \
    /bin/zsh

3. Install sglang-omni inside the container

git clone git@github.com:sgl-project/sglang-omni.git
cd sglang-omni

uv venv .venv -p 3.12
source .venv/bin/activate

uv pip install -v -e .   # drop `-e` for a non-editable install

🛠️ Option B: Manual install#

Build the prerequisites first:

UCX 1.20.x with CUDA + verbs support — follow upstream, or reuse the exact build flags in docker/Dockerfile.
flash-attn-4 — install >=4.0.0b18, matching torch==2.11.0 and SGLang 0.5.16’s nvidia-cutlass-dsl 4.6.0 pin.

Then install:

git clone git@github.com:sgl-project/sglang-omni.git
cd sglang-omni

uv venv .venv -p 3.12
source .venv/bin/activate

uv pip install -v -e .   # drop `-e` for a non-editable install

🚀 Installation

Contents

🚀 Installation#

🐳 Option A: Docker (recommended)#

🛠️ Option B: Manual install#