Add callbacks, history logging and performance improvements to sgd. #105

samjmolyneux · 2025-10-05T21:15:32Z

By using jax.jit, and calculating the gradient and objective in a single pass with jax.value_and_grad, we get big speed boosts to sgd. The new time taken is approximately $\frac{1}{7}\text{th}$ of the original for the two normal example on my machine.
New history_logging_interval parameter for the stochastic_gradient_descent function allows the user to enable or disable logging of the optimisation history. The interval determines how frequently the history is logged. This makes it easier to debug optimisations and make decisions about hyperparameters.
To make history logging work, we add a dataclass IterationResult which is analogous to SolverResult. However, IterationResult uses frozen=True to allow for dataclass updates each iteration. Using a dataclass ensures backward compatibility for callbacks if a new attribute is logged.
New callbacks parameter for the stochastic_gradient_descent function allows the user to set a list of callback functions as is standard in optimisation loops. In future, the callbacks can be used for early stopping or live plotting of the results. As an example, we include a useful tqdm callback that displays a progress bar for the iterations and displays the current objective value.
We also add the following tests:
- test_normalise_callbacks: Tests that _normalise_callbacks does validation and casts valid types to list[Callable[IterationResult], None] .
- test_sgd_history_logging_intervals: Tests that the correct iterations are logged for different intervals and that the correct associated obj, fn_args and grad are too for sgd.
- test_callback_invocation: Tests that sgd callbacks are called in the correct order with the correct IterationResult.
- test_invalid_callback: Tests that sgd will raise an error if given an invalid callback.
- test_logging_or_callbacks_affect_sgd_convergence: Tests that various combinations of callbacks and logging intervals all result in the same convergence behaviour and thus all have the same final obj, fn_args, grad etc.

To sgd.py: - Added single pass JIT grad and objective calculation for big speed - boosts. - Added history logging option for easy debugging and better understanding of optimisation process. - Added callbacks functionality, allowing for user specific callbacks, e.g. early stopping, live graph etc. Added iteration_result.py to monitor current state of convergence. Use of dataclass ensures backwards compatablity of callbacks. Added solver_callbacks.py for callbacks and associated funcs. Added history attributes to solver result.

willGraham01

All-in-all I like these changes, and these are useful features that we could do with adding. Callbacks in particular should be very useful from a development/debugging perspective too,

Most of my comments relate to the design decisions for the code, considering what's in the rest of the codebase. Namely I think we can do some code recycling in places, and we tend to write our tests in a particular format (though the test cases provided are good).

Also, there are only two commits on this branch (one for codebase changes, one for tests). In general, don't be afraid to use more granular commits in your PRs (just take a look at how long the other PRs are!) - we use squash merges anyway, so everything gets condensed into a single commit on main anyway. And it's good to be able to roll things back.

willGraham01 · 2025-10-06T13:08:53Z

src/causalprog/solvers/iteration_result.py

+    obj_val_history: list[npt.ArrayLike] = field(default_factory=list)
+
+
+def _update_iteration_result(


Any reason why this isn't a method of the IterationResult class? I expected this to be something like IterationResult.update (and it's currently written in that form too -> swap iter_result to self).

Related, any reason for why history_logging_interval is an argument that we pass in, rather than an attribute that's set at creation time (I guess we could want dynamic logging which we wouldn't get with a fixed attribute, but is that a common enough use-case to design around?). It also means that we could just check history_logging_interval > 0 once at creation time, and not do it every time in the method.

Any reason why this isn't a method of the IterationResult class? I expected this to be something like IterationResult.update (and it's currently written in that form too -> swap iter_result to self).

Nope. It should be.

Related, any reason for why history_logging_interval is an argument that we pass in, rather than an attribute that's set at creation time (I guess we could want dynamic logging which we wouldn't get with a fixed attribute, but is that a common enough use-case to design around?). It also means that we could just check history_logging_interval > 0 once at creation time, and not do it every time in the method.

No. This is a good point.

willGraham01 · 2025-10-06T13:11:20Z

src/causalprog/solvers/sgd.py

    for _ in range(maxiter + 1):
-        objective_value = objective(current_params)
-        gradient_value = gradient(current_params)
+        _update_iteration_result(
+            iter_result,
+            current_params,
+            gradient_value,
+            _,
+            objective_value,
+            history_logging_interval,
+        )


Per Matt's convention from another PR, since we're using _ in the loop, we should probably use a name like current_iter or something for the loop variable. (Would add a suggestion sorry but GitHub doesn't let me suggest things for unchanged lines)

Yeah, because of the iters_used = _ line, I wasn't sure if this was a convention you were using, so I didn't want to change it without checking.

src/causalprog/solvers/solver_result.py

src/causalprog/solvers/solver_callbacks.py

tests/test_solvers/test_normalise_solver_inputs.py

willGraham01 · 2025-10-06T13:49:23Z

tests/test_solvers/test_sgd.py

        )
+
+
+@pytest.mark.parametrize(


I wouldn't advocate for doing this here and now, but I think we can likely condense these tests a bit.

If I'm right, we're currently checking:

The number of iterations logged is correct

The parameters / objective function / gradient value logged at each iteration is correct

Callbacks are invoked correctly regardless of shape (whether they are a list / single callable / None etc).

And catching the error case of the above (when not given callables).

Testing that callbacks don't affect the SGD result / convergence.

The correct invocation (and its associated error catch) are the same things that we're checking in _normalise_callbacks. As such, I'm of the opinion that we don't need to test for catching them here (since the tests for _normalise_callbacks will flag what happens if we pass bad things in here!) - and we should just pass valid entries to sgd's callbacks argument. Testing these callbacks return & log the expected values however, is of course something we should still be doing!

Value logging is probably worth checking, but we can probably drop one of the "interval=2" and "interval=3" cases (the purpose of both tests is to check the logging interval is respected), and one of the "interval=0" and "interval=-1" cases (which both check something sensible happens for a nonsensical input).

This means that it's probably possible to condense these 3 tests into a single test function (with parametrisation) along the lines of "test_sgd_logging". Where in each test we check logging, recording, and non-effect on convergence in each case. But that sounds like a lot of reorganisation, which I should probably just break out into a follow-on issue 😅

If I'm right, we're currently checking:

The number of iterations logged is correct

The parameters / objective function / gradient value logged at each iteration is correct

Callbacks are invoked correctly regardless of shape (whether they are a list / single callable / None etc).

And catching the error case of the above (when not given callables).

Testing that callbacks don't affect the SGD result / convergence.

Yes and for the last bullet point, we also testing that convergence isn't affected by combinations of history logging and callbacks. With the additional caveat that if the IterationResult attributes are directly changed then of course the convergence will differ.

The correct invocation (and its associated error catch) are the same things that we're checking in _normalise_callbacks. As such, I'm of the opinion that we don't need to test for catching them here (since the tests for _normalise_callbacks will flag what happens if we pass bad things in here!) - and we should just pass valid entries to sgd's callbacks argument.

Opting to test both was intentional. My thoughts are that, 1). I would like to know _normalise_callbacks works correctly and 2). it is implemented correctly in each solver. I think we could remove either one of test_sgd_callbacks_invoaction or test_normalise_callbacks. But I personally favour removing test_normalise_callbacks, and keeping test_sgd_callbacks_invocation because I think it's more important to know that it is implemented correctly in each solver.

Value logging is probably worth checking, but we can probably drop one of the "interval=2" and "interval=3" cases (the purpose of both tests is to check the logging interval is respected), and one of the "interval=0" and "interval=-1" cases (which both check something sensible happens for a nonsensical input).

Yeah I can remove those. My brain always just questions if there is something special about the first edge case that makes it work correctly, so I always feel the need to excessively add more!

This means that it's probably possible to condense these 3 tests into a single test function (with parametrisation) along the lines of "test_sgd_logging". Where in each test we check logging, recording, and non-effect on convergence in each case. But that sounds like a lot of reorganisation, which I should probably just break out into a follow-on issue 😅

Got it 👍 .

tests/test_solvers/test_sgd.py

samjmolyneux added 2 commits October 5, 2025 22:28

Add tests for sgd callbacks and history logging

3fbed8d

samjmolyneux force-pushed the sjmolyneux/sgd-improvements branch from 62924d7 to 3fbed8d Compare October 5, 2025 21:30

willGraham01 self-requested a review October 6, 2025 09:26

willGraham01 reviewed Oct 6, 2025

View reviewed changes

willGraham01 mentioned this pull request Oct 29, 2025

Overaching Objectives for TI 1 (Nov 2025) #115

Open

17 tasks

samjmolyneux added 10 commits October 31, 2025 22:22

combine update and iter_result

b5102f7

change _ to current_iter in loop

4aa1e50

add tqd, to dependencies

864541f

allow any collection of callbacks

a625478

fix expected error on incorrect callback type

1498f28

change pytest.raises to raises_context

005f383

remove redundant callback test cases

40658ac

parameterise test_normalise_callbacks

c75da1f

remove factories in favour of callback placeholder

cf832c4

move obj_fn to fixture

23e2848

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add callbacks, history logging and performance improvements to sgd. #105

Add callbacks, history logging and performance improvements to sgd. #105

samjmolyneux commented Oct 5, 2025

Uh oh!

willGraham01 left a comment

Uh oh!

willGraham01 Oct 6, 2025

Uh oh!

samjmolyneux Oct 7, 2025

Uh oh!

willGraham01 Oct 6, 2025

Uh oh!

samjmolyneux Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

willGraham01 Oct 6, 2025

Uh oh!

samjmolyneux Oct 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		obj_val_history: list[npt.ArrayLike] = field(default_factory=list)


		def _update_iteration_result(

		)


		@pytest.mark.parametrize(

Add callbacks, history logging and performance improvements to sgd. #105

Are you sure you want to change the base?

Add callbacks, history logging and performance improvements to sgd. #105

Conversation

samjmolyneux commented Oct 5, 2025

Uh oh!

willGraham01 left a comment

Choose a reason for hiding this comment

Uh oh!

willGraham01 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

samjmolyneux Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

willGraham01 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

samjmolyneux Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

willGraham01 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

samjmolyneux Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants