Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] guard against change in cuda library name
#8609 opened Sep 19, 2024 by bnellnm Loading…
[Bugfix] Move health checks to separate thread
#8583 opened Sep 18, 2024 by joerunde Loading…
[Core] Allow IPv6 in VLLM_HOST_IP with zmq
#8575 opened Sep 18, 2024 by russellb Loading…
[Bugfix] Fix Phi3.5 mini and MoE LoRA inference
#8571 opened Sep 18, 2024 by garg-amit Loading…
[MISC] add support custom_op check
#8557 opened Sep 18, 2024 by jikunshang Loading…
[Misc] Fix api_server args
#8556 opened Sep 18, 2024 by Juelianqvq Loading…
[CI/Build] Re-enabling Entrypoints tests on ROCm, excluding ones that fail ready ONLY add when PR is ready to merge/full CI is needed
#8551 opened Sep 18, 2024 by alexeykondrat Loading…
[Bugfix] Validate SamplingParam n is an int
#8548 opened Sep 17, 2024 by saumya-saran Loading…
[Bugfix] fix OpenAI API server startup with --disable-frontend-multiprocessing ready ONLY add when PR is ready to merge/full CI is needed
#8537 opened Sep 17, 2024 by dtrifiro Loading…
ppc64le: Dockerfile and CI fix
#8529 opened Sep 17, 2024 by sumitd2 Loading…
[CI/Build][Misc] Comparing between block manager v1 and v2, under full prefix sharing and no prefix sharing case. ready ONLY add when PR is ready to merge/full CI is needed
#8528 opened Sep 16, 2024 by KuntaiDu Loading…
[dbrx] refactor dbrx experts to extend FusedMoe class ready ONLY add when PR is ready to merge/full CI is needed
#8518 opened Sep 16, 2024 by divakar-amd Loading…
[Model][VLM] Add LLaVA-Onevision model support
#8486 opened Sep 14, 2024 by litianjian Loading…
2 of 3 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.