-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[k8s] Clarified the usage of shared memory.
#4341
opened Mar 12, 2025 by
jsuchome
Loading…
2 of 6 tasks
docs: add parameter --log-requests-level
#4335
opened Mar 12, 2025 by
panpan0000
Loading…
2 of 6 tasks
Remove the choices in --speculative-eagle-topk argument
#4329
opened Mar 12, 2025 by
Achazwl
Loading…
1 of 6 tasks
Fix Llama3.3 tool call support
high priority
#4320
opened Mar 11, 2025 by
CatherineSue
Loading…
3 of 6 tasks
[Feature] Support "strict" in function calling
#4310
opened Mar 11, 2025 by
DarkSharpness
Loading…
3 of 6 tasks
add moe topk softmax templated from vllm to improve
#4302
opened Mar 11, 2025 by
qingquansong
•
Draft
6 tasks
[Fix] Fix a bug when calculating m in benchmark_lightning_attention_prefill.py
#4294
opened Mar 11, 2025 by
xiefan46
Loading…
6 tasks
Add --metrics-flush-interval flag to solve metrics sticking
#4285
opened Mar 11, 2025 by
kebe7jun
Loading…
Add metrics for tokenization/detokenization/wait in queue latency
#4280
opened Mar 11, 2025 by
hebiao064
Loading…
1 of 6 tasks
[Feature] Support Tensor Parallelism and Weight Slicing for Lora
#4274
opened Mar 10, 2025 by
aoshen524
Loading…
3 of 4 tasks
Fix the output of hidden states after HTTP requests
#4269
opened Mar 10, 2025 by
Qiaolin-Yu
Loading…
1 of 6 tasks
Apply structured output sampling after reasoning steps in Reasoning models
#4264
opened Mar 10, 2025 by
minleminzui
Loading…
6 tasks done
Integrate DeepEP into SGLang
high priority
#4232
opened Mar 9, 2025 by
liz-badada
•
Draft
1 of 6 tasks
[Feature] Prefill assistant response - add continue_final_message parameter
#4226
opened Mar 9, 2025 by
adarshxs
Loading…
3 tasks done
[Fix] Check the device backend before calling empty_cache function
#4212
opened Mar 8, 2025 by
cboss6
Loading…
1 of 6 tasks
Statistical Analysis of the Output Stability of the Deepseek Model
#4202
opened Mar 8, 2025 by
tanzelin430
•
Draft
2 of 6 tasks
[ROCm/Draft/No-Merge]: Flex Attention Enablement
amd
collaboration
documentation
Improvements or additions to documentation
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.