-
Notifications
You must be signed in to change notification settings - Fork 660
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] Support stopping the inference for the corresponding request in the online service after a disconnection request.
#5320
opened Dec 1, 2025 by
qwes5s5
Loading…
5 tasks
[PD Disaggregation] Add timestamp for analyzing splitwise deployment
#5317
opened Dec 1, 2025 by
juncaipeng
Loading…
5 tasks done
[Cherry-Pick] 1.fix tp+ep moe_forward; 2.set max_prefill_batch=env.MAX_PREFILL_NUM #5315
#5316
opened Dec 1, 2025 by
carryyu
Loading…
5 tasks
[Optimization]1.fix tp+ep moe_forward; 2.set max_prefill_batch=env.MAX_PREFILL_NUM
#5315
opened Dec 1, 2025 by
carryyu
Loading…
5 tasks
[Optimization] support mm prefill batch
#5313
opened Dec 1, 2025 by
kevincheng2
Loading…
3 of 5 tasks
[PD Disaggregation] [tmp] decode use cpu buffer to receive cache from prefill
#5308
opened Dec 1, 2025 by
juncaipeng
Loading…
5 tasks done
[Intel HPU] add example benchmark scripts for hpu
#5304
opened Dec 1, 2025 by
fmiao2372
Loading…
2 tasks done
[Cherry-Pick][Feature] Enable prefix caching for mtp(#5302)
#5303
opened Nov 30, 2025 by
rainyfly
Loading…
5 tasks
[Cherry-Pick][BugFix]Set default OMP_NUM_THREADS=3 and fix extra GPU memory usage in DeepSeek (#5219)
#5294
opened Nov 28, 2025 by
bukejiyu
Loading…
1 of 5 tasks
Revert "[CI] 【Hackathon 9th Sprint No.41】NO.41 功能模块单测补充 -part"
#5291
opened Nov 28, 2025 by
YuanRisheng
Loading…
Revert "[CI] 【Hackathon 9th Sprint No.18】NO.18 功能模块单测补充 -part"
#5290
opened Nov 28, 2025 by
YuanRisheng
Loading…
[Quantization] Support w4afp8 MoE dynamic quantization
#5282
opened Nov 27, 2025 by
Sunny-bot1
Loading…
3 of 5 tasks
[CI] Update test_docker to paddle_dev
#5278
opened Nov 27, 2025 by
EmmonsCurse
Loading…
5 tasks done
[Optimization]
Qwen2.5-VL support multi-batch prefill
#5269
opened Nov 27, 2025 by
aquagull
Loading…
2 of 5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.