Skip to content

Conversation

@Diaphteiros
Copy link
Contributor

What this PR does / why we need it:
Increases robustness of the scheduler (ClusterRequest resources) against loss of Cluster resources. See release notes for details.

Which issue(s) this PR fixes:
Needs to be verified, but it should solve openmcp-project/backlog#300.

Special notes for your reviewer:

Release note:

The scheduler will now detect if a `ClusterRequest` references a non-existing `Cluster`, which will trigger a re-scheduling of the request.
Unless explicitly requested via the `clusters.openmcp.cloud/randomize-cluster-name: "true"` label, the names of `Cluster` resources created for `ClusterRequest`s are now deterministic (using a hash of name and namespace of the request) instead of random. This makes it easier to recover lost `Cluster` resources. Note that for limitedly shared clusters, the name depends on the `ClusterRequest` that initially created the `Cluster` and might still change when the `Cluster` is lost and later re-created by the scheduler.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants