jd-opensource / xllm Public

Notifications You must be signed in to change notification settings
Fork 90
Star 752

Code
Issues 39
Pull requests 14
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: jd-opensource/xllm

Labels 11 Milestones 0

New pull request New

14 Open 374 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

feat: add mm embedding model and its factory.

#471 opened Dec 2, 2025 by dongxianzhe

Loading…

feat: support prefix cache for deepseek-v3/r1 models.

#470 opened Dec 1, 2025 by DongheJin

Loading…

refactor: separate mlu and cuda version Qwen model implementation.

#468 opened Dec 1, 2025 by XuZhang99

Loading…

feat: determine device type automatically except npu device.

#467 opened Dec 1, 2025 by XuZhang99

Loading…

feat: add NPU process group initialization and management.

#456 opened Nov 29, 2025 by yingxudeng

Loading…

feat: implement chunked prefill and prefix cache for Qwen3 MoE.

#455 opened Nov 29, 2025 by yingxudeng

Loading…

feat: support deepseek mtp on mlu.

#454 opened Nov 28, 2025 by a120092009

Loading…

refactor: optimize unique token count preparation of batch input builder.

#449 opened Nov 27, 2025 by RobbieLeung

Loading…

refactor: move draft input preparation of decode batch from worker to batch builder.

#448 opened Nov 27, 2025 by RobbieLeung

Loading…

[WIP] feat: support loading model weights and forward overlap.

#441 opened Nov 26, 2025 by Clement-Wang26

Loading…

feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.

#399 opened Nov 18, 2025 by xanecdotex

Loading…

feat: enable torch_npu graph mode for Qwen-3 dense with TP support.

#325 opened Nov 6, 2025 by yingxudeng

Loading…

feat: add generative recommendation tokenizer.

#317 opened Nov 4, 2025 by magicheng0816

Loading…

【WIP】feat: add rec framwork.

#305 opened Oct 31, 2025 by DragonFive

Loading…

3 of 5 tasks

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!