[tests] refactor caching tests. by sayakpaul · Pull Request #13235 · huggingface/diffusers

sayakpaul · 2026-03-09T13:57:58Z

What does this PR do?

Refactor MagCache tests.
Include TaylorSeer in our model-level caching test mixin.
Consider removing caching-related mixins from test_pipelines_common.py.

sayakpaul · 2026-03-10T03:28:28Z

@DN6 LMK your thoughts on removing the caching-related stuff we have in test_pipelines_common.py. IMO, they're not adding much value given we have model-level caching testers for all the caching methods we support.

sayakpaul · 2026-05-11T06:49:22Z

@DN6 a gentle ping.

HuggingFaceDocBuilderDev · 2026-05-11T07:02:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dg845

Thanks for the PR! I think we should consider retaining the original MagCache tests in tests/hooks/test_mag_cache.py in some form, since if I understand correctly those tests are testing if the MagCache implementation is logically correct, while MagCacheTesterMixin is testing whether MagCache works with a given model.

sayakpaul · 2026-05-31T07:41:50Z

@dg845 thanks! I added back the MagCache tests and migrated to pytest for consistency. I initially removed them in the interest of consistency (all caching methods don't have them).

LMK your thoughts on "Consider removing caching-related mixins from test_pipelines_common.py" as well.

sayakpaul · 2026-05-31T08:41:39Z

I also realized that the hook testing wasn't ever a part of our CI. Opened a PR #13848

dg845 · 2026-06-02T00:26:48Z

+    """
+
+    @torch.no_grad()
+    def _test_cache_inference(self):


It looks like FirstBlockCacheTesterMixin._test_cache_inference and TaylorSeerCacheTesterMixin._test_cache_inference are identical except for the cache context. Would it be possible to define a _test_cache_inference_with_context method in CacheTesterMixin that takes in the context name as an argument? Maybe something like

@torch.no_grad() def _test_cache_inference_with_context(self, context_name: str): ...

Feel free to do that in a follow-up. This PR is reserved for purely migration.

dg845 · 2026-06-02T00:30:26Z

+        )
+
+    @torch.no_grad()
+    def _test_reset_stateful_cache(self):


Similarly to #13235 (comment), I think we may be able to define a context-aware version in CacheTesterMixin:

@torch.no_grad() def _test_reset_stateful_cache_with_context(self, context_name: str): ... with model.cache_context(context_name): _ = model(**inputs_dict, return_dict=False)[0] ...

and then reuse it for both FirstBlockCacheTesterMixin and TaylorSeerCacheTesterMixin.

Same as above. If the method is fairly general, feel free to open a follow-up PR.

dg845

Thanks! Left some small comments.

dg845 · 2026-06-02T01:26:40Z

With respect to the caching mixins in test_pipelines_common.py, my current thoughts are that we probably should have pipeline-level cache testing mixins because cache techniques generally maintain state across different model forward passes within a denoising loop, and vary the model behavior on subsequent timesteps based on that state. For example, MagCacheState maintains an accumulated_steps counter that, if it reaches MagCacheConfig.max_skip_steps, will cause the next model forward pass to be fully computed.

So I think ideally we would have testing on three levels, following MagCache:

Hook tests (tests/hooks/test_mag_cache.py): test if the hook is working and logically correct in isolation (kind of analogous to unit tests)
Model tests (tests/models/testing_utils/cache.py::MagCacheTesterMixin): tests compatibility with a given model (integration tests at the model level)
Pipeline tests (tests/pipelines/test_pipelines_common.py::MagCacheTesterMixin): tests compatibility with a given pipeline (integration tests at the pipeline level)

In practice, I'm not sure what the right way to separate model tests and pipeline tests is. I think (but am not sure) that model compatibility does not necessarily imply pipeline compatibility, so pipeline-level tests are not necessarily redundant. For example, I think that testing output quality as done by test_mag_cache_inference in the pipeline-level MagCacheTesterMixin makes sense because this isn't something a model-level test can capture.

sayakpaul · 2026-06-02T01:52:18Z

Fair points esepcially because:

because cache techniques generally maintain state across different model forward passes within a denoising loop, and vary the model behavior on subsequent timesteps based on that state.

In practice, I'm not sure what the right way to separate model tests and pipeline tests is. I think (but am not sure) that model compatibility does not necessarily imply pipeline compatibility, so pipeline-level tests are not necessarily redundant. For example, I think that testing output quality as done by test_mag_cache_inference in the pipeline-level MagCacheTesterMixin makes sense because this isn't something a model-level test can capture.

I think a good boundary to consider when having both or just model-level tests is if the underlying feature depends on the specifics coming from pipelines. I think quantization is one of those cases. It doesn't matter if it comes from loading a pipeline or a model.

sayakpaul added 3 commits March 9, 2026 19:26

refactor magcache tests.

9cd3e6b

include taylorseer in the caching mixin.

9239908

Merge branch 'main' into refactor-caching-tests

81aa432

sayakpaul marked this pull request as ready for review March 10, 2026 03:27

sayakpaul changed the title ~~[wip] [tests] refactor caching tests.~~ [tests] refactor caching tests. Mar 10, 2026

sayakpaul requested a review from DN6 March 10, 2026 03:27

sayakpaul added 2 commits March 10, 2026 09:25

Merge branch 'main' into refactor-caching-tests

1e6578b

Merge branch 'main' into refactor-caching-tests

4b11663

github-actions Bot added tests size/L PR with diff > 200 LOC labels Apr 17, 2026

Merge branch 'main' into refactor-caching-tests

2a86400

github-actions Bot added size/L PR with diff > 200 LOC and removed size/L PR with diff > 200 LOC labels May 1, 2026

Merge branch 'main' into refactor-caching-tests

3bcf5f0

up

ce7f2e7

github-actions Bot added the models label May 11, 2026

Merge branch 'main' into refactor-caching-tests

f859c01

github-actions Bot removed the models label May 20, 2026

Merge branch 'main' into refactor-caching-tests

444b3a7

sayakpaul requested a review from dg845 May 27, 2026 02:53

Merge branch 'main' into refactor-caching-tests

d4fda40

dg845 reviewed May 30, 2026

View reviewed changes

add back magcache and migrate to pytest

f2a2313

sayakpaul requested a review from dg845 May 31, 2026 07:41

Merge branch 'main' into refactor-caching-tests

129f6f1

dg845 reviewed Jun 2, 2026

View reviewed changes

dg845 approved these changes Jun 2, 2026

View reviewed changes

Merge branch 'main' into refactor-caching-tests

e2b6714

sayakpaul merged commit ed07118 into main Jun 2, 2026
14 checks passed

sayakpaul deleted the refactor-caching-tests branch June 2, 2026 02:14

Conversation

sayakpaul commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

sayakpaul commented Mar 10, 2026

Uh oh!

sayakpaul commented May 11, 2026

Uh oh!

HuggingFaceDocBuilderDev commented May 11, 2026

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented May 31, 2026

Uh oh!

sayakpaul commented May 31, 2026

Uh oh!

dg845 Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

dg845 commented Jun 2, 2026

Uh oh!

sayakpaul commented Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sayakpaul commented Mar 9, 2026 •

edited

Loading