-
Notifications
You must be signed in to change notification settings - Fork 725
feat: Clean up lora endpoints #4715
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: William Arnold <7565007+Aphoh@users.noreply.github.com>
Signed-off-by: William Arnold <7565007+Aphoh@users.noreply.github.com>
Signed-off-by: William Arnold <7565007+Aphoh@users.noreply.github.com>
Signed-off-by: William Arnold <warnold@nvidia.com>
|
@coderabbitai review |
✅ Actions performedReview triggered.
|
WalkthroughThe PR consolidates LoRA management within the vLLM engine by transitioning LoRA endpoints from public system status server routes to internal engine routes, while removing corresponding public data structures and handlers from the system status server. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~12 minutes
Poem
Pre-merge checks❌ Failed checks (1 inconclusive)
✅ Passed checks (2 passed)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (2)
🧰 Additional context used🧠 Learnings (2)📓 Common learnings📚 Learning: 2025-10-07T20:32:27.578ZApplied to files:
🔇 Additional comments (3)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
@coderabbitai what happens to the DYN_* environment variables? |
|
Tip For best results, initiate chat on the files or code changes. 🧩 Analysis chain🏁 Script executed: Length of output: 263003 🏁 Script executed: Length of output: 2419 @Aphoh The What ChangedThis PR moved LoRA management from Rust routes (in
How They Work Now
User ImpactNo changes needed for users who already set these environment variables. They continue to work exactly as before, just with a cleaner implementation under the hood. 🧠 Learnings used |
Signed-off-by: William Arnold <warnold@nvidia.com>
Signed-off-by: William Arnold <warnold@nvidia.com>
Signed-off-by: William Arnold <warnold@nvidia.com>
|
Base got merged, leaving for later |
Overview:
Adds onto #4617, cleaning up some logic that was put in for doing custom vllm lora routes. It's substantially cleaner with the engine-wildcard routes. @biswapanda
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.