testthat 3.3.0 #765

hadley · 2025-11-05T15:07:09Z

First draft

lionel-

Nice release!

lionel- · 2025-11-11T07:56:48Z

content/blog/testthat-3-3-0/index.Rmd

+
+Overall it was a successful experiment. It helped me close over 100 issues in what felt like less time than usual. I don't have any hard numbers, but my gut feeling is that it was maybe a 10-20% improvement to my development velocity. This is still significant, especially since I'm an experienced R programmer and my workflow has been pretty stable for the last few years. I mostly used Claude for smaller, well-defined tasks where I had a good sense of what was needed. I found it particularly useful for refactoring, where it was easy to say precisely what I wanted, but executing the changes required a bunch of fiddly edits across many files.
+
+I also found it generally useful for getting over the "activation energy hump": there were a few issues that had been stagnating for years because they felt like they were going to be hard to do and with relatively limited payoff. I let Claude Code loose on a few of these and found it super useful. It only produced code I was really happy with a couple of times, but every time it gave me something to react to (often with strong negative feelings!) and that got me started actually engaging with the problem.


Anger-driven engagement algorithm for coders

content/blog/testthat-3-3-0/index.Rmd

lionel- · 2025-11-11T08:05:46Z

content/blog/testthat-3-3-0/index.Rmd

+
+## Other new features
+
+* testthat generally does a better job of handling nested tests, aka subtests, where you put a `test_that()` inside another `test_that()`, or more typically `it()` inside of `describe()`. Subtests will now generate more informative failure messages, free from duplication, with more informative skips if any subtests don't contain any expectations.


I thought nesting test_that() was mostly useful for testing the testthat package with itself, but this paragraph makes it sound like there might be user-oriented cases where this is helpful? If that's the case, it might be interesting to add a sentence explaining the use case, otherwise a sentence explaining this is mostly for internal testing.

It's mostly about making it possible to write tests inside of functions that get called inside of other tests. This is sometimes useful when you want to test that multiple functions/classes adhere to the same interface. But I'm also not 100% convinced that it's the right approach, so I don't want to go into the details here.

lionel- · 2025-11-11T08:07:09Z

content/blog/testthat-3-3-0/index.Rmd

+
+* `vignette("mocking")` explains mocking in detail, and new `local_mocked_s3_method()`, `local_mocked_s4_method()`, and `local_mocked_r6_class()` make it easier to mock S3 and S4 methods and R6 classes.
+
+* `test_dir()`, `test_check()`, and friends gain a `shuffle` argument that uses `sample()` to randomly reorder the top-level expressions in each test file. This random reordering surfaces dependencies between tests and code outside of any test, as well as dependencies between tests, helping you find and eliminate unintentional dependencies.


That's nice

lionel- · 2025-11-11T08:08:26Z

content/blog/testthat-3-3-0/index.Rmd

+
+* `test_dir()`, `test_check()`, and friends gain a `shuffle` argument that uses `sample()` to randomly reorder the top-level expressions in each test file. This random reordering surfaces dependencies between tests and code outside of any test, as well as dependencies between tests, helping you find and eliminate unintentional dependencies.
+
+* `try_again()` is now publicized, as it's a useful tool for testing flaky code:


Interesting, might be worth a note that this should still be skipped on CRAN? And include a skip_on_cran() in the example.

content/blog/testthat-3-3-0/index.Rmd

lionel- · 2025-11-11T08:15:19Z

content/blog/testthat-3-3-0/index.Rmd

+* New `SlowReporter` makes it easier to find the slowest tests in your package. You can run it with `devtools::test(reporter = "slow")`.
+
+* New `vignette("challenging-functions")` provides an index to other documentation organized by various challenges.
+


To use the new features, do you recommend bumping the version of testthat in Suggests? Might be a good place to mention it.

Unfortunately pkgload only checks for Imports not Suggests, so bumping the dep won't trigger the install prompt on load.

Yeah, and install.packages() won't check for it either. So overall, I don't think it's worth it.

teunbrand

It makes me excited to try out the new features!

content/blog/testthat-3-3-0/index.Rmd

hfrick · 2025-11-11T16:13:00Z

So many cool changes!

Sidenote: My local preview, opened in Firefox, has the horizontal line in the test_that() output piercing out to the right.

hadley · 2025-11-11T19:59:13Z

@hfrick thanks for spotting. fixed now!

hadley added 8 commits November 5, 2025 09:06

testthat 3.3.0

5121891

First draft

Mention review bot

693a1a7

Merged origin/main into testthat-3.3.0

4782d34

Hacking it into shape

c21d383

Proofread

e4093ea

Correct path

421cac6

Polishing

1fc9171

Claude code proofreading

c49deee

hadley mentioned this pull request Nov 10, 2025

Release testthat 3.3.0 r-lib/testthat#2275

Open

24 tasks

lionel- reviewed Nov 11, 2025

View reviewed changes

teunbrand reviewed Nov 11, 2025

View reviewed changes

content/blog/testthat-3-3-0/index.Rmd Show resolved Hide resolved

content/blog/testthat-3-3-0/index.Rmd Show resolved Hide resolved

hadley added 2 commits November 11, 2025 08:01

Code review + more proofreading

c9ae969

Add post metadata, images, and thanks

d63ea68

Set cli.width

8947c32


		Overall it was a successful experiment. It helped me close over 100 issues in what felt like less time than usual. I don't have any hard numbers, but my gut feeling is that it was maybe a 10-20% improvement to my development velocity. This is still significant, especially since I'm an experienced R programmer and my workflow has been pretty stable for the last few years. I mostly used Claude for smaller, well-defined tasks where I had a good sense of what was needed. I found it particularly useful for refactoring, where it was easy to say precisely what I wanted, but executing the changes required a bunch of fiddly edits across many files.

		I also found it generally useful for getting over the "activation energy hump": there were a few issues that had been stagnating for years because they felt like they were going to be hard to do and with relatively limited payoff. I let Claude Code loose on a few of these and found it super useful. It only produced code I was really happy with a couple of times, but every time it gave me something to react to (often with strong negative feelings!) and that got me started actually engaging with the problem.


		## Other new features

		* testthat generally does a better job of handling nested tests, aka subtests, where you put a `test_that()` inside another `test_that()`, or more typically `it()` inside of `describe()`. Subtests will now generate more informative failure messages, free from duplication, with more informative skips if any subtests don't contain any expectations.


		* `vignette("mocking")` explains mocking in detail, and new `local_mocked_s3_method()`, `local_mocked_s4_method()`, and `local_mocked_r6_class()` make it easier to mock S3 and S4 methods and R6 classes.

		* `test_dir()`, `test_check()`, and friends gain a `shuffle` argument that uses `sample()` to randomly reorder the top-level expressions in each test file. This random reordering surfaces dependencies between tests and code outside of any test, as well as dependencies between tests, helping you find and eliminate unintentional dependencies.


		* `test_dir()`, `test_check()`, and friends gain a `shuffle` argument that uses `sample()` to randomly reorder the top-level expressions in each test file. This random reordering surfaces dependencies between tests and code outside of any test, as well as dependencies between tests, helping you find and eliminate unintentional dependencies.

		* `try_again()` is now publicized, as it's a useful tool for testing flaky code:

		* New `SlowReporter` makes it easier to find the slowest tests in your package. You can run it with `devtools::test(reporter = "slow")`.

		* New `vignette("challenging-functions")` provides an index to other documentation organized by various challenges.

testthat 3.3.0 #765

Are you sure you want to change the base?

testthat 3.3.0 #765

Conversation

hadley commented Nov 5, 2025

Uh oh!

lionel- left a comment

Choose a reason for hiding this comment

Uh oh!

lionel- Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lionel- Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

hadley Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

lionel- Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

lionel- Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lionel- Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

hadley Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

teunbrand left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hfrick commented Nov 11, 2025

Uh oh!

hadley commented Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants