Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] benchmark_moe supports expert parallel performance Performance-related issues
#22251 opened Aug 5, 2025 by jeejeelee Loading…
4 tasks
[V0 Deprecation][TPU] Remove V1 flag check from tests ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#22248 opened Aug 5, 2025 by NickLucche Loading…
[Platform] allow platform to init dp group ready ONLY add when PR is ready to merge/full CI is needed
#22243 opened Aug 5, 2025 by wangxiyuan Loading…
1 of 4 tasks
fix: No module named 'pandas' ci/build
#22240 opened Aug 5, 2025 by linpan Loading…
[Perf][Feat][Core] Workload-Aware KVCache Eviction Policy documentation Improvements or additions to documentation performance Performance-related issues v1
#22236 opened Aug 5, 2025 by Chasingdreams6 Loading…
4 tasks done
[CI/Build] Update flashinfer to 0.2.9 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#22233 opened Aug 5, 2025 by mgoin Loading…
Tanuj/meow penalty ci/build documentation Improvements or additions to documentation frontend needs-rebase tpu Related to Google TPUs v1
#22229 opened Aug 5, 2025 by tanujtiwari1998 Loading…
4 tasks
[Multimodal] Skip vision component loading for llava in text-only mode multi-modality Related to multi-modality (#4194)
#22224 opened Aug 5, 2025 by sfeng33 Loading…
Fp8 paged attention update rocm Related to AMD ROCm
#22222 opened Aug 5, 2025 by xiao-llm Draft
4 tasks
Support fp8(e4m3) mfma for ROCm paged attention rocm Related to AMD ROCm
#22221 opened Aug 5, 2025 by xiao-llm Draft
4 tasks
feat: Add native support for XLM-RoBERTa embedding and BAAI/bge-reranker-v2-m3 documentation Improvements or additions to documentation new-model Requests to new models
#22216 opened Aug 4, 2025 by honghanhh Loading…
4 tasks done
[Perf] Support topk softmax fused kernel for broader num_experts
#22211 opened Aug 4, 2025 by shixianc Loading…
3 of 4 tasks
[Bugfix] fix hash error for chunked local attention hybrid KV v1
#22209 opened Aug 4, 2025 by luccafong Loading…
7 tasks done
[Misc] Improve Worker process title v1
#22205 opened Aug 4, 2025 by 22quinn Loading…
[Misc] Update HunYuan dense model test documentation Improvements or additions to documentation
#22200 opened Aug 4, 2025 by jeejeelee Draft
4 tasks
[Core] Separate MM IPC cache from processor cache documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed v1
#22198 opened Aug 4, 2025 by DarkLight1337 Loading…
2 of 4 tasks
[Model] Mamba models - Support FP32 SSM cache
#22196 opened Aug 4, 2025 by danielafrimi Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.