You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hotunplug: Dynamically remove compute resources (CPU, Memory, Swap Capacity and HugePages) to the node, either via software (make resources go offline) or via hardware (physical removal while the system is running)
90
90
91
+
Node Compute Resource: CPU, Memory, Swap Capacity and HugePages
91
92
92
93
## Summary
93
94
94
-
The proposal seeks to facilitate hot plugging of node compute resources(CPU, Memory, Swap Capacity and HugePages), thereby streamlining cluster resource capacity updates through node compute resource resizing rather than introducing new nodes to the cluster.
95
+
The proposal seeks to facilitate hot plugging of node compute resources, thereby streamlining cluster resource capacity updates through node compute resource resizing rather than introducing new nodes to the cluster.
95
96
The revised node configurations will be automatically propagated at both the node and cluster levels.
96
97
97
98
Furthermore, this proposal intends to enhance the initialization and reinitialization processes of resource managers, including the CPU manager and memory manager, in response to alterations in a node's CPU and memory configurations and
@@ -135,7 +136,7 @@ Implementing this KEP will empower nodes to recognize and adapt to changes in th
135
136
### Goals
136
137
137
138
* Achieve seamless node capacity expansion through hot plugging resources.
138
-
* Enable the re-initialization of resource managers (CPU manager, memory manager) and kube runtime manager to accommodate alterations in the node's resource allocation.
139
+
* Enable the re-initialization of resource managers (CPU manager, memory manager) and kube runtime manager without reset to accommodate alterations in the node's resource allocation.
139
140
* Recalculating and updating the OOMScoreAdj and swap memory limit for existing pods.
140
141
141
142
### Non-Goals
@@ -913,7 +914,7 @@ VMs of cluster should support hot plug of compute resources for e2e tests.
913
914
or if it has to be terminated due to resource crunch.
914
915
* Recalculate OOM adjust score and Swap limits:
915
916
* Since the total capacity of the node has changed, values associated with the nodes memory capacity must be recomputed.
916
-
* Handling unplug of reserved and exclusively allocated cpus CPUs.
917
+
* Handling unplug of reserved and exclusively allocated CPUs.
917
918
918
919
* Fetching machine info via CRI
919
920
* At present, the machine data is retrieved from cAdvisor's cache through periodic checks. There is ongoing development to utilize CRI APIs for this purpose.
0 commit comments