-
Notifications
You must be signed in to change notification settings - Fork 101
Bump kv-cache-manager to v0.4.0-rc2 #467
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
There will be a v0.4.0 release sometime today - this PR can import it. |
|
/hold until kv-cache manager v0.4 is released. |
|
yeah, it's WIP for that reason, is this a known flake? |
|
https://github.com/llm-d/llm-d-kv-cache-manager/releases/tag/v0.4.0-rc1 was released, it requires some changes in addition to the version bump. I tagged you in the relevant PR, this one is mandatory: https://github.com/llm-d/llm-d-kv-cache-manager/pull/150/files#r2547769214, 2nd comment is optional. Do you want to handle? An alternative can be merging this and handling separately.
This is not common - weird. |
3151f47 to
08b27db
Compare
|
It should be fixed now |
|
/hold cancel |
7be2fe0 to
1f18375
Compare
| if token := os.Getenv("HF_TOKEN"); token != "" && | ||
| parameters.IndexerConfig != nil && | ||
| parameters.IndexerConfig.TokenizersPoolConfig != nil && | ||
| parameters.IndexerConfig.TokenizersPoolConfig.HFTokenizerConfig != nil { | ||
| parameters.IndexerConfig.TokenizersPoolConfig.HFTokenizerConfig.HuggingFaceToken = token | ||
| } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
these lines look like internal implementation details of kvcache manager code.
can we move this to kvcache repo?
I'd expect the call indexerConfig, err := kvcache.NewDefaultConfig() in L45 to initialize that internally.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The kvcache library defines a configuration with default values. This code piece is that of a user, permitting configuration through the env-var HF_TOKEN. It is arguable that this is a special env-var that is widely accepted but generally in the kvcache library we attempt to contain all configuration in the referenced structure, leaving customized UX to the users.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
right. it's a user defined env var (with the user's token).
HF_TOKEN as env var is very acceptable.
the question I was trying to answer is why do we read that env var here and not in kvcache code.
this part looks very not natural, the scorer factory function writes to an internal config of the kvcache indexer parameters.
I was expecting to have the os.getenv call inside kvcache.NewDefaultConfig().
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this discussion is applicable to the current setup, should we discuss it separately?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree that kv cache manager library can do this since the use can still opt-out from the default env-var injection by not using the NewDefaultConfig function if they want to have different defaults
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's track this in a new issue, I think this should not be a blocker.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not a blocker 👍
Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>
Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>
Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>
1f18375 to
2125f03
Compare
Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>
|
/lgtm |
Bump + fix breaking changes