-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ROCm (gfx1100): sgl-kernel HIP enablement
amd
dependencies
Pull requests that update a dependency file
sgl-kernel
#17958
opened Jan 29, 2026 by
ljubomirj
Loading…
ROCm (gfx1100): AWQ/GLM4 fixes, vllm fallbacks, dtype fixes
amd
dependencies
Pull requests that update a dependency file
quant
LLM Quantization
sgl-kernel
#17957
opened Jan 29, 2026 by
ljubomirj
Loading…
[Liquid AI] Support LFM VL
deepseek
diffusion
SGLang Diffusion
Multi-modal
multi-modal language model
quant
LLM Quantization
#17954
opened Jan 29, 2026 by
vincentzed
Loading…
1 of 5 tasks
[AMD] support two batch overlapping for mori ep
deepseek
documentation
Improvements or additions to documentation
#17953
opened Jan 29, 2026 by
billishyahao
Loading…
5 tasks
Add A3 8/16-NPU runners, constant file and test cases for basic functional parameters, LLM and multimodal models & API
deepseek
Multi-modal
multi-modal language model
npu
#17952
opened Jan 29, 2026 by
Sugar920
Loading…
5 tasks
Direct model loading from object storage with Runai Model Streamer
documentation
Improvements or additions to documentation
#17948
opened Jan 29, 2026 by
noa-neria
Loading…
[MUSA][8/N] Port CUDA kernels that are compatible with MUSA
dependencies
Pull requests that update a dependency file
mthreads
quant
LLM Quantization
sgl-kernel
#17946
opened Jan 29, 2026 by
yafengio
Loading…
3 of 5 tasks
Do online LLM Quantization
fp8 quantization while loading weights instead of in process_weights_after_loading, reducing memory overhead
quant
#17945
opened Jan 29, 2026 by
fxmarty-amd
Loading…
1 of 2 tasks
[diffusion] Fix b64 Output logic
diffusion
SGLang Diffusion
#17944
opened Jan 29, 2026 by
varadrane1707
Loading…
2 of 5 tasks
【docs】【Ascend】Update Expert Parallelism docs for Ascend NPU
documentation
Improvements or additions to documentation
#17940
opened Jan 29, 2026 by
husf1130
Loading…
5 tasks
Support passing spaces_between_special_tokens per request
#17939
opened Jan 29, 2026 by
RunningLeon
Loading…
5 tasks
[diffusion]Allows quality adjustment of generated images/videos through requests.
diffusion
SGLang Diffusion
run-ci
#17937
opened Jan 29, 2026 by
IPostYellow
Loading…
5 tasks
support fused_moe_triton and moe_sum_all_reduce kernel fusion[reduce 20-30% TTFT]
#17931
opened Jan 29, 2026 by
xieminghe1
Loading…
5 tasks
Fix SHM pointer re-serialization in DP attention.
run-ci
#17930
opened Jan 29, 2026 by
FlamingoPg
Loading…
2 of 5 tasks
fix: zmq_to_tokenizer encoder transfer when host listens to 0.0.0.0
#17929
opened Jan 29, 2026 by
RangerCD
Loading…
5 tasks
[Refactor] [Diffusion] Refactor custom ops
diffusion
SGLang Diffusion
#17928
opened Jan 29, 2026 by
Makcum888e
•
Draft
2 of 5 tasks
feat: Add ModelScope support for multimodal_gen models
diffusion
SGLang Diffusion
#17924
opened Jan 29, 2026 by
yrk111222
Loading…
2 of 5 tasks
[EPD][refactor]: introduce BaseMMReceiver for gRPC transport integration
#17921
opened Jan 29, 2026 by
liusy58
Loading…
5 tasks
Enable Sglang diffusion on Intel XPU with flux.1-dev (#53)
dependencies
Pull requests that update a dependency file
diffusion
SGLang Diffusion
#17920
opened Jan 29, 2026 by
sushildubey171
•
Draft
5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.