Skip to content

Commit 04f7853

Browse files
committed
update pngs
Signed-off-by: hsliu <liuhongsheng4@huawei.com>
1 parent 5ee31ea commit 04f7853

File tree

3 files changed

+8
-0
lines changed

3 files changed

+8
-0
lines changed

_posts/2025-11-30-vllm-omni.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -50,8 +50,16 @@ vLLM-Omni is not just a wrapper; it is a re-imagining of how vLLM handles data f
5050

5151
* **Performance:** We utilize pipelined stage execution to overlap computation for high throughput performance, ensuring that while one stage is processing, others aren't idle.
5252

53+
<p align="center">
54+
<img src="/assets/figures/2025-11-30-vllm-omni/vllm-omni-pipeline-async-stage.png" alt="vllm-omni pipelined stage execution" width="80%">
55+
</p>
56+
5357
We benchmarked vLLM-Omni against Hugging Face Transformers to demonstrate the efficiency gains in omni-modal serving.
5458

59+
<p align="center">
60+
<img src="/assets/figures/2025-11-30-vllm-omni/vllm-omni-vs-hf.png" alt="vLLM-Omni against Hugging Face Transformers" width="80%">
61+
</p>
62+
5563

5664
## **Future Roadmap**
5765

106 KB
Loading
115 KB
Loading

0 commit comments

Comments
 (0)