Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a fast serving framework for large language models and vision language models.

    Python 20.1k 3.4k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    644 48

  3. ome ome Public

    OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)

    Go 312 44

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 230 29

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 466 106

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 160 29

Repositories

Showing 10 of 18 repositories
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 160 Apache-2.0 29 45 (1 issue needs help) 17 Updated Nov 12, 2025
  • sglang Public

    SGLang is a fast serving framework for large language models and vision language models.

    sgl-project/sglang’s past year of commit activity
    Python 20,088 Apache-2.0 3,354 583 (32 issues need help) 844 Updated Nov 12, 2025
  • SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    sgl-project/SpecForge’s past year of commit activity
    Python 466 MIT 106 47 (5 issues need help) 14 Updated Nov 12, 2025
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 89 20 8 0 Updated Nov 12, 2025
  • sgl-flash-attn Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    sgl-project/sgl-flash-attn’s past year of commit activity
    Python 14 BSD-3-Clause 2,142 0 0 Updated Nov 12, 2025
  • sgl-kernel-npu Public

    SGLang kernel library for NPU

    sgl-project/sgl-kernel-npu’s past year of commit activity
    C++ 72 MIT 47 7 16 Updated Nov 12, 2025
  • rbg Public

    A workload for deploying LLM inference services on Kubernetes

    sgl-project/rbg’s past year of commit activity
    Go 101 Apache-2.0 28 9 4 Updated Nov 12, 2025
  • FlashMLA Public Forked from deepseek-ai/FlashMLA

    FlashMLA: Efficient Multi-head Latent Attention Kernels

    sgl-project/FlashMLA’s past year of commit activity
    C++ 0 MIT 904 0 0 Updated Nov 11, 2025
  • sgl-kernel-xpu Public

    SGLang kernel library for Intel XPU

    sgl-project/sgl-kernel-xpu’s past year of commit activity
    Python 13 MIT 13 0 9 Updated Nov 11, 2025
  • genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    sgl-project/genai-bench’s past year of commit activity
    Python 230 MIT 29 6 6 Updated Nov 10, 2025