Skip to content

Conversation

@TianyiZhao1437
Copy link
Collaborator

@TianyiZhao1437 TianyiZhao1437 commented Oct 30, 2025

📋 PR Title Format

The PR title should follow the format:

type(scope): concise message (max 50 chars)

Where:

  • type is one of: feat, fix, docs, refactor, perf, test, chore.
  • scope is optional and describes the part of the codebase affected (e.g., auth, ui, api).
  • concise message is a short description of the change (max 50 chars).

📝 Change Type

Please select the type of change this PR introduces (choose one or more):

  • feat: New feature.
  • fix: Bug fix.
  • docs: Documentation only changes.
  • refactor: A code change that neither fixes a bug nor adds a feature.
  • perf: Performance improvement.
  • test: Adding missing tests or correcting existing tests.
  • chore: Maintenance tasks (e.g., updating dependencies).

💡 Description

Support GPU inter-node tensor parallelism.

Key Changes

  1. Backend supports GPU TP.
  2. Launches each TP executor in subprocess. However, since we need to refactor the inter-process communication with gradient server, this feature is currently only enabled in non-scheduler mode.

🔗 Related Issues

List any issues this PR closes or relates to:

✅ Checklist

Please ensure the following points are addressed before merging:

  • I have performed a self-review of my own code.
  • I have added/updated tests that prove my fix or feature works (if applicable).
  • I have updated the documentation (if necessary).
  • My code follows the project's style guidelines.

@TianyiZhao1437 TianyiZhao1437 marked this pull request as draft October 30, 2025 04:42
@TianyiZhao1437 TianyiZhao1437 linked an issue Oct 30, 2025 that may be closed by this pull request
@TianyiZhao1437 TianyiZhao1437 marked this pull request as ready for review November 8, 2025 06:44
@TianyiZhao1437 TianyiZhao1437 requested a review from a team November 8, 2025 06:44
@TianyiZhao1437 TianyiZhao1437 changed the title feat(backend): Support GPU TP feat(backend): Support GPU TP in backend. Nov 8, 2025
@TianyiZhao1437 TianyiZhao1437 changed the title feat(backend): Support GPU TP in backend. feat(backend): Support GPU TP. Enable TP for non-scheduler mode. Nov 8, 2025
@TianyiZhao1437 TianyiZhao1437 changed the title feat(backend): Support GPU TP. Enable TP for non-scheduler mode. feat(backend): Support GPU TP and enable TP for non-scheduler mode Nov 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants