vllm-project / vllm-ascend Public

Notifications You must be signed in to change notification settings
Fork 52
Star 315

Code
Issues 53
Pull requests 22
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm-ascend

Labels 19 Milestones 0

New pull request New

22 Open 195 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Misc] Add multi-model example

#313 opened Mar 12, 2025 by shen-shanshan

Loading…

[Doc] Add Single NPU (Qwen2.5-VL-7B) documentation

Improvements or additions to documentation

#311 opened Mar 12, 2025 by xiemingda-1002

Loading…

[CI][Test] Update pytest ut list

#310 opened Mar 12, 2025 by shen-shanshan

Loading…

[V1][Core] Add support for V1 Engine module:core

#295 opened Mar 11, 2025 by shen-shanshan

Loading…

[Doc] Add the release note for 0.7.3rc1 documentation

Improvements or additions to documentation

#285 opened Mar 10, 2025 by wangxiyuan

Loading…

[don't merge]Remove lock

#283 opened Mar 9, 2025 by Yikun • Draft

[Core] Support the features of prefix cache and chunk prefill module:core

#282 opened Mar 9, 2025 by rjg-lyh

Loading…

[Test] UT for linear method patch module:tests

#281 opened Mar 9, 2025 by rjg-lyh

Loading…

[Misc] Add transpose optimization in the linear layer

#280 opened Mar 9, 2025 by rjg-lyh

Loading…

Test GemmaX2 module:tests

#274 opened Mar 8, 2025 by geekchen007

Loading…

[Platform] Add get_stream_cls() for platform module:core

#261 opened Mar 7, 2025 by shen-shanshan • Draft

[Feature] add all_to_all and reduce_scatter module:core

#256 opened Mar 7, 2025 by onehaitao

Loading…

[Feature] Graph mode for deepseek. module:core module:ops

#254 opened Mar 6, 2025 by SidaoY

Loading…

Test DeepSeek V2 module:ops module:tests

#245 opened Mar 5, 2025 by Yikun • Draft

[core] Support custom ascendc kernels in vllm-ascend [draft] module:core

#233 opened Mar 4, 2025 by ganyi1996ppo

Loading…

[CI]Make UT cases in test_comm_ops.py compatible on Ascend NPU module:core

#220 opened Mar 3, 2025 by wwfu109

Loading…

[CI] UT for fused moe module:ops module:tests

#196 opened Feb 27, 2025 by SidaoY

Loading…

[BugFix]add int8 cache dtype when using attention quantization module:core

#128 opened Feb 21, 2025 by Angazenn

Loading…

[CI][UT]Update ut list

#123 opened Feb 20, 2025 by Potabk

Loading…

[CI] update vllm native ut

#76 opened Feb 17, 2025 by MengqingCao • Draft

[Doc]Add benchmark scripts

#74 opened Feb 17, 2025 by Potabk

Loading…

[Doc][WIP] Add official doc zh

#36 opened Feb 11, 2025 by Potabk

Loading…

ProTip! Updated in the last three days: updated:>2025-03-09.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly