Randomized and parallel tests by kdeldycke · Pull Request #3151 · pallets/click

kdeldycke · 2025-11-19T06:12:04Z

This PR helps detection of leaks and flaky tests by running tests in randomized order and parallel.

Results:

Multiple cores are properly detected and tests are run in parallel:
```
created: 4/4 workers
4 workers [1336 items]
```
Fix test_echo_via_pager test with was leaky and polluting others.

Rowlando13 · 2025-11-19T06:29:43Z

Docs has the test runner as not thread safe so I am interested in what you find:

These tools should only be used for testing since they change the entire interpreter state for simplicity. They are not thread-safe!
https://click.palletsprojects.com/en/stable/testing/

kdeldycke · 2025-11-19T06:35:38Z

I tried to combined #3139 and #3140 but they don't fix the issue.

kdeldycke · 2025-11-19T06:52:30Z

Maybe #1572 is fixing the issue.

pjonsson · 2025-11-19T08:27:45Z

I tried to combined #3139 and #3140 but they don't fix the issue.

The test for #2991/#2993 failed when I tested the #3140 branch, so that doesn't seem surprising. Does this PR with only #3139 on top of stable improve the situation compared to stable?

pjonsson · 2025-11-19T10:31:47Z

I don't think you're doing this for performance, but I investigated pytest-xdist in a different context a couple of years ago, and at the time it was difficult to get a speedup because the startup time was linear to the number of workers.

The startup time still seems to be linear to the number of workers, but the constant is much better now, here is a log with timing for --numprocesses=: auto (16), 4, 1, and a final run that still has pytest-xdist in the dependencies but without passing --numprocesses= to pytest.

As comparison, uv run pytest on stable reports 1316 passed, 21 skipped, 1 xfailed in 1.38s on the same machine.

$ uv run pytest tests/test_compat.py
============================= test session starts ==============================
platform linux -- Python 3.12.3, pytest-9.0.1, pluggy-1.6.0
Using --randomly-seed=3856693423
rootdir: /home/ubuntu/src/click
configfile: pyproject.toml
plugins: xdist-3.8.0, randomly-4.0.1
16 workers [1 item]
.                                                                        [100%]
============================== 1 passed in 0.93s ===============================
$ uv run pytest tests/test_compat.py
      Built click @ file:///home/ubuntu/src/click
Uninstalled 1 package in 0.25ms
Installed 1 package in 1ms
============================= test session starts ==============================
platform linux -- Python 3.12.3, pytest-9.0.1, pluggy-1.6.0
Using --randomly-seed=2447227139
rootdir: /home/ubuntu/src/click
configfile: pyproject.toml
plugins: xdist-3.8.0, randomly-4.0.1
4 workers [1 item]
.                                                                        [100%]
============================== 1 passed in 0.47s ===============================
$ uv run pytest tests/test_compat.py
      Built click @ file:///home/ubuntu/src/click
Uninstalled 1 package in 0.30ms
Installed 1 package in 1ms
============================= test session starts ==============================
platform linux -- Python 3.12.3, pytest-9.0.1, pluggy-1.6.0
Using --randomly-seed=3619139103
rootdir: /home/ubuntu/src/click
configfile: pyproject.toml
plugins: xdist-3.8.0, randomly-4.0.1
1 worker [1 item]
.                                                                        [100%]
============================== 1 passed in 0.33s ===============================
$ uv run pytest tests/test_compat.py
      Built click @ file:///home/ubuntu/src/click
Uninstalled 1 package in 0.24ms
Installed 1 package in 1ms
============================= test session starts ==============================
platform linux -- Python 3.12.3, pytest-9.0.1, pluggy-1.6.0
Using --randomly-seed=2375080509
rootdir: /home/ubuntu/src/click
configfile: pyproject.toml
plugins: xdist-3.8.0, randomly-4.0.1
collected 1 item

tests/test_compat.py .                                                   [100%]

============================== 1 passed in 0.02s ===============================

kdeldycke · 2025-11-19T11:11:58Z

Does this PR with only #3139 on top of stable improve the situation compared to stable?

No, not in the results I got. I was under the impression that running tests in parallel would uncover the issues you were discussing in #3139 and #3140 with @getzze and @neutrinoceros. But it looks completely unrelated.

Here the issues are with the pager:

  FAILED tests/test_utils.py::test_echo_via_pager[test9-cat ] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test4-cat ] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test0-less] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test2- less ] - AssertionErro...
  FAILED tests/test_utils.py::test_echo_via_pager[test2- less] - AssertionError...
  FAILED tests/test_utils.py::test_echo_via_pager[test2- cat ] - AssertionError...
  FAILED tests/test_utils.py::test_echo_via_pager[test0-cat ] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test9-less] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test2-less] - AssertionError:...
  FAILED tests/test_utils.py::test_echo_via_pager[test8-cat ] - AssertionError:...

And the failing tests are completely random depending on the order they're executed (from 6 to 47 failing tests on my machine). That's why I was thinking that maybe a solution might be in the direction of #1572.

So I guess you can continue working on #3139 and #3140 independently of this PR.

pjonsson · 2025-11-19T12:28:03Z

No, not in the results I got. I was under the impression that running tests in parallel would uncover the issues you were discussing in #3139 and #3140

Well, #3139 does contain a test that I took from one of the referenced tickets, and that test triggers ValueError: I/O operation on closed file reliably for me when I run pytest with the right parameters even without parallelism. I think the perceived randomness with the "file" closing issue (buffers in that case) stems from finalization order of objects, and there are usually not a whole lot of guarantees from garbage collectors when it comes to finalization.

Don't take my comment as discouragement from what you're trying to achieve here, I'm just stating that my goal with #3139 was the incredibly narrowly scoped "fix the immediate Click issue I'm suffering from". I would guess that not having multiple objects trying to close the same buffers in their finalizers would also be beneficial here, but I don't know the Click code base so take my guesses for what they are worth.

neutrinoceros · 2025-11-19T15:46:36Z

And the failing tests are completely random depending on the order they're executed

sounds like a job for https://pypi.org/project/detect-test-pollution/ (just passing by, ignore me if that's not actually relevant)

davidism · 2025-11-19T16:08:39Z

I'd prefer not adding these as a regular test, it introduces a lot of complexity and uncertainty. Maybe just use it to clean up tests here and then remove again.

The tests already run very fast, and I'd prefer to speed them up directly if possible rather than by using multiprocessing.

neutrinoceros · 2025-11-19T16:16:16Z

Oh that's definitely not a tool you want to run regularly in CI. It's really meant to help for local debugging only.

Edit: hidden as off-topic, I thought David was talking about detect-test-pollution but on second thought that's probably not what he meant

davidism · 2025-11-19T16:36:01Z

No that's fine, I was referring to all three plugins.

kdeldycke · 2026-02-28T09:57:13Z

So it turns out that these randomized tests did not helped in fixing any of these issues:

The issues above are fixed by #3139, and stress-tests helped in figuring things out. Not randomized tests as this PR was supposed to.

kdeldycke · 2026-02-28T10:00:28Z

Still! These randomized tests help uncovered an issue in another, completely different area from our initial investigation. So we have test_echo_via_pager that is polluting tests and is leaky. This PR now fix that.

@davidism, given this new context, are you still against adding pytest-randomly and pytest-xdist dependencies on tests?

davidism · 2026-03-01T17:04:15Z

The problem is that whether it uncovers something or not is completely random so if we add this to CI unrelated PR might fail CI for no apparent reason to the author.

If you want to add a specific tox environment for this fine, but it should not run by default locally or in CI.

kdeldycke · 2026-03-01T17:07:41Z

The problem is that whether it uncovers something or not is completely random so if we add this to CI unrelated PR might fail CI for no apparent reason to the author.

I'm fine with that policy. I will rework this PR in a couple of days then.

kdeldycke · 2026-03-02T05:40:15Z

I moved the fix for the leaky test to its own PR at: #3238

That way we can benefits from it right away while I keep working on this PR.

kdeldycke · 2026-03-02T05:59:07Z

If you want to add a specific tox environment for this fine, but it should not run by default locally or in CI.

Ok so I just added a new [tool.tox.env.random] tox environment, to run this aspect of testing independently, and not in the regular CI invokation. This use the same pattern as the stress test suite I added in #3139 .

Now I would like to add some minimal documentation somewhere on how to run these tests. @Rowlando13 what's your opinion? Should we have a section somewhere in the Sphinx docs? I would like to document how to run the stress tests, the randomized tests, and the application smoke tests you implemented for Flask (#3177). So I can point contributors to them. Maybe not in docs/testing.md as this is for Click's users to cover how they test their own Click CLI.

Rowlando13 · 2026-03-05T23:38:21Z

You can add a new section for contributing. If it is standard stuff it should be added to pallets contributing.

kdeldycke · 2026-03-06T06:17:19Z

You can add a new section for contributing. If it is standard stuff it should be added to pallets contributing.

Just documented the optional test suites at: https://github.com/pallets/click/pull/3151/changes#diff-419d79ae7d440d49b3fa5fa7739f5f3a359e3ca42c3fa942e5604ae3474eb35b

Document optional test suites

kdeldycke changed the base branch from main to stable November 19, 2025 06:12

kdeldycke added this to the 8.3.2 milestone Nov 19, 2025

kdeldycke mentioned this pull request Nov 19, 2025

Fix CliRunner error with logging message #3140

Closed

kdeldycke marked this pull request as draft November 19, 2025 06:13

kdeldycke added dependencies Pull requests that update a dependency file f:test runner feature: cli test runner labels Nov 19, 2025

kdeldycke mentioned this pull request Nov 19, 2025

NamedTextIOWrapper: stop closing buffer #3139

Merged

kdeldycke changed the title ~~WIP: Randomize tests and run them in parallel~~ WIP: Randomized and parallel tests Nov 19, 2025

kdeldycke added the bug label Feb 28, 2026

kdeldycke force-pushed the pytest-randomly-xdist branch from 2f81d8e to c5b0fc6 Compare February 28, 2026 09:56

kdeldycke marked this pull request as ready for review February 28, 2026 10:01

kdeldycke changed the title ~~WIP: Randomized and parallel tests~~ Randomized and parallel tests + fix leaky test_echo_via_pager Feb 28, 2026

kdeldycke force-pushed the pytest-randomly-xdist branch from c5b0fc6 to 57ce42d Compare March 1, 2026 16:31

kdeldycke marked this pull request as draft March 1, 2026 17:08

kdeldycke mentioned this pull request Mar 2, 2026

Fix pager test leaks #3238

Open

kdeldycke changed the title ~~Randomized and parallel tests + fix leaky test_echo_via_pager~~ Randomized and parallel tests Mar 2, 2026

kdeldycke force-pushed the pytest-randomly-xdist branch 2 times, most recently from 65282cd to 33d6cc1 Compare March 2, 2026 05:45

kdeldycke force-pushed the pytest-randomly-xdist branch from 0521fe0 to e1f237e Compare March 2, 2026 06:20

kdeldycke force-pushed the pytest-randomly-xdist branch from e1f237e to a535f3f Compare March 6, 2026 06:16

kdeldycke marked this pull request as ready for review March 6, 2026 06:17

Randomize tests and run them in parallel

692c645

Document optional test suites

kdeldycke force-pushed the pytest-randomly-xdist branch from cae594d to 692c645 Compare March 6, 2026 06:21

Uh oh!

Conversation

kdeldycke commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rowlando13 commented Nov 19, 2025

Uh oh!

kdeldycke commented Nov 19, 2025

Uh oh!

kdeldycke commented Nov 19, 2025

Uh oh!

pjonsson commented Nov 19, 2025

Uh oh!

pjonsson commented Nov 19, 2025

Uh oh!

kdeldycke commented Nov 19, 2025

Uh oh!

pjonsson commented Nov 19, 2025

Uh oh!

neutrinoceros commented Nov 19, 2025

Uh oh!

davidism commented Nov 19, 2025

Uh oh!

neutrinoceros commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidism commented Nov 19, 2025

Uh oh!

kdeldycke commented Feb 28, 2026

Uh oh!

kdeldycke commented Feb 28, 2026

Uh oh!

davidism commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kdeldycke commented Mar 1, 2026

Uh oh!

kdeldycke commented Mar 2, 2026

Uh oh!

kdeldycke commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rowlando13 commented Mar 5, 2026

Uh oh!

kdeldycke commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kdeldycke commented Nov 19, 2025 •

edited

Loading

neutrinoceros commented Nov 19, 2025 •

edited

Loading

davidism commented Mar 1, 2026 •

edited

Loading

kdeldycke commented Mar 2, 2026 •

edited

Loading