-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[diffusion] Fuse tanh-GELU into shared MLP FFN up-proj GEMM epilogue
diffusion
SGLang Diffusion
#28366
opened Jun 16, 2026 by
BBuf
Collaborator
Loading…
3 of 4 tasks
[Scheduler] Add SLA-constrained dynamic batching for adaptive decode throughput
#28365
opened Jun 16, 2026 by
MissyLee2018
Loading…
5 tasks
[core] Gate the overlap WAR barrier on forward reads to recover decode throughput
#28363
opened Jun 16, 2026 by
hnyls2002
Collaborator
Loading…
[AMD] moe_shared_gate_residual_fuse: MoE Shared-Expert Sigmoid Gate + Residual Add
jit-kernel
#28362
opened Jun 16, 2026 by
yichiche
Collaborator
Loading…
3 tasks done
[AMD] gdn_qk_l2norm_fuse: GatedDeltaNet Q/K L2Norm Fusion
jit-kernel
#28361
opened Jun 16, 2026 by
yichiche
Collaborator
Loading…
3 tasks done
[AMD] Fix AITER Scout workflow permissions
amd
#28360
opened Jun 16, 2026 by
yctseng0211
Collaborator
Loading…
Markdown format modification
deepseek
documentation
Improvements or additions to documentation
hicache
Hierarchical Caching for SGLang
lora
Multi-modal
multi-modal language model
npu
quant
LLM Quantization
speculative-decoding
#28359
opened Jun 16, 2026 by
a60124901
Loading…
5 tasks
[AMD] fix(jit): port kv_canary write/verify/plan kernels to ROCm
jit-kernel
#28357
opened Jun 16, 2026 by
michaelzhang-ai
Collaborator
Loading…
2 of 3 tasks
feat: add FlashInfer cutlass FP8 block-scale MoE backend for Qwen3.5
#28355
opened Jun 16, 2026 by
anish-shanbhag
•
Draft
docs(cookbook): migrate Gemma4 to the config-driven template
documentation
Improvements or additions to documentation
#28353
opened Jun 15, 2026 by
zijiexia
Collaborator
Loading…
docs(cookbook): migrate MiniMax-M2.7 to the config-driven template
documentation
Improvements or additions to documentation
#28352
opened Jun 15, 2026 by
zijiexia
Collaborator
Loading…
docs(cookbook): migrate Nemotron3-Ultra to the config-driven template
documentation
Improvements or additions to documentation
#28351
opened Jun 15, 2026 by
zijiexia
Collaborator
Loading…
docs(cookbook): migrate GLM-5.1 to the config-driven template
documentation
Improvements or additions to documentation
#28350
opened Jun 15, 2026 by
zijiexia
Collaborator
Loading…
[AMD]: Enable NIXL PD disaggregation for ROCm(1/n)
amd
#28348
opened Jun 15, 2026 by
Lzy17
Contributor
Loading…
5 tasks
Revert Gemma4 modelopt fp4 MoE backend change
#28347
opened Jun 15, 2026 by
mmangkad
Collaborator
Loading…
[AMD] Make HIP capability probes import-safe when no HIP GPU is visible
#28345
opened Jun 15, 2026 by
XinyuJiangCMU
Contributor
Loading…
[AMD] register 6 2-gpu tests to stage-b-test-2-gpu-large-amd
run-ci
#28344
opened Jun 15, 2026 by
michaelzhang-ai
Collaborator
Loading…
[Kimi K2.5] Fix eagle3 aux capture for tp>1 when AR fusion is enabled
deepseek
#28343
opened Jun 15, 2026 by
kpham-sgl
Collaborator
Loading…
5 tasks done
[Tokenizer] Fix abort racing server crash when large amount of aborts
bypass-fastfail
run-ci
run-ci-extra
#28341
opened Jun 15, 2026 by
hanming-lu
Collaborator
Loading…
5 tasks
Precompute FP8 KV cache inverse scales to drop per-forward reciprocal kernels
blackwell
SM100/SM120
#28339
opened Jun 15, 2026 by
vschandramourya
Loading…
3 tasks done
docs: add Amazon SageMaker AI deployment guide
documentation
Improvements or additions to documentation
#28338
opened Jun 15, 2026 by
Jyothirmaikottu
Loading…
2 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.