Fix CosineDecay documentation to clarify alpha is a multiplier (#21827)

yashwantbezawada · web-flow · commit 68fb29179939 · 2025-11-06T14:42:18.000-08:00
* Fix CosineDecay documentation to clarify alpha is a multiplier

The documentation incorrectly stated that learning rate decays
'to alpha', when it actually decays to 'initial_lr * alpha'.

Updated the docstring to make it clear that alpha is a fraction/
multiplier, not an absolute target value.

* Fix line length to comply with 80 char limit
diff --git a/keras/src/optimizers/schedules/learning_rate_schedule.py b/keras/src/optimizers/schedules/learning_rate_schedule.py
@@ -584,9 +584,10 @@ class CosineDecay(LearningRateSchedule):
     schedule applies a linear increase per optimizer step to our learning rate
     from `initial_learning_rate` to `warmup_target` for a duration of
     `warmup_steps`. Afterwards, it applies a cosine decay function taking our
-    learning rate from `warmup_target` to `alpha` for a duration of
-    `decay_steps`. If `warmup_target` is None we skip warmup and our decay
-    will take our learning rate from `initial_learning_rate` to `alpha`.
+    learning rate from `warmup_target` to `warmup_target * alpha` for a
+    duration of `decay_steps`. If `warmup_target` is None we skip warmup and
+    our decay will take our learning rate from `initial_learning_rate` to
+    `initial_learning_rate * alpha`.
     It requires a `step` value to  compute the learning rate. You can
     just pass a backend variable that you increment at each training step.