-
Notifications
You must be signed in to change notification settings - Fork 752
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BugFix] Seperate prometheus multiproc dir for single-server multi-dp services
#8059
opened Jun 16, 2026 by
liyonghua0910
Collaborator
Loading…
5 tasks
[Docs] Add Mooncake HA redis backend deployment example
#8058
opened Jun 16, 2026 by
jackyYang6
Contributor
Loading…
3 of 5 tasks
[Refactor] Remove redundant code in MLA cache management
#8050
opened Jun 15, 2026 by
HayzelHan
Loading…
[Models] Support MLA_SWA functionality
#8049
opened Jun 15, 2026 by
chang-wenbin
Collaborator
Loading…
2 of 5 tasks
[Models] fix fleet model fallback ep init
#8039
opened Jun 11, 2026 by
xiaoguoguo626807
Contributor
Loading…
4 tasks done
[XPU][OP] Add build_sampling_params kernel for MTP speculative decoding
#8032
opened Jun 10, 2026 by
Clarity256
Loading…
3 of 5 tasks
[CI] DEBUG switch CI test env to new driver
#8007
opened Jun 4, 2026 by
EmmonsCurse
Collaborator
Loading…
2 of 5 tasks
Integrate Elastic-Attention (PawQwen3) into FastDeploy
#8001
opened Jun 4, 2026 by
tianzhenxu
Loading…
[Others]Prefill-Decode Batch Invariant in glm
#8000
opened Jun 4, 2026 by
bukejiyu
Collaborator
Loading…
5 tasks
[Iluvatar] Support CINN for PaddleOCR-VL by converting max_seqlens to Tensor inputs
#7997
opened Jun 4, 2026 by
wuyujiji
Contributor
Loading…
5 tasks done
debug allreduce fusion acc issue
#7976
opened Jun 2, 2026 by
BingooYang
Contributor
Loading…
5 tasks
[Metax][CI]: skip trap asm on MetaX GPU to fix compile error
#7975
opened Jun 2, 2026 by
zhang-chenyi
Contributor
Loading…
5 tasks
[SOT] Support flashinfer_allreduce
#7970
opened Jun 2, 2026 by
ZhangX-21
Contributor
Loading…
5 tasks
Revert blockwise CUDAGraph and support piecewise CUDAGraph in prefill
#7969
opened Jun 2, 2026 by
ZhangX-21
Contributor
Loading…
5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-16.