Xcodec fix #42095

eustlb · 2025-11-07T16:01:16Z

What does this PR do?

Before a deeper clean (#42039) on xcodec and to break things down a bit, this PR fixes some inefficient logic in X-codec, responsible for unnecessary memory spikes.

Makes me think that the whole _get_output_length when applying conv1d, which is repeated a lot throughout audio models, could benefit from some standardisation. In that objective, I've added a new audio util that I'll propagate in a subsequent PR.

HuggingFaceDocBuilderDev · 2025-11-07T16:12:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Cyrilvallez · 2025-11-11T17:39:49Z

src/transformers/models/xcodec/modeling_xcodec.py

+    @lru_cache
+    def _get_conv1d_layers(self, module):
+        """
+        Recursively iterate to fetch all Conv1d layers.
+        """
+
+        def get_conv1d_layers_recursive(module: nn.Module):
+            params_list = []
+
+            if isinstance(module, nn.Conv1d):
+                params_list.append(module)
+
+            # Recursively check all child modules
+            for child in module.children():
+                params_list.extend(get_conv1d_layers_recursive(child))
+
+            return params_list
+
+        return tuple(get_conv1d_layers_recursive(module))
+
+    def _get_conv1d_output_lengths(self, input_length, module=None):
+        """
+        For a given module, compute the output length that would be obtained after all Conv1d layers.
+        """
+        if module is None:
+            module = self
+
+        conv1d_layers = self._get_conv1d_layers(module)
+
+        for layer in conv1d_layers:
+            input_length = conv1d_output_length(layer, input_length)
+
+        return input_length


Is this something we can do earlier, in the config? It would avoid having to do these recursions on modules etc!

When we'll standardize this computation (see #41203), we could be able to do it from the config, provided we ensure that every convolution layer’s parameters are stored there, and in the correct order relative to their appearance in the forward pass. However, I don’t think expanding the config with such parameters is a good idea, especially since we decided to hardcode them to avoid exposing them to the user.

I was thinking more along the lines of handling this directly during model initialization, when the modules are already being iterated over. For now, is it okay to merge this?

Cyrilvallez

Alright, yes it's fine to merge it like that to unblock, since I see you're already working on a refined version! 🤗
CI not happy though it seems!

github-actions · 2025-11-25T14:20:15Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: xcodec

eustlb added 4 commits November 5, 2025 17:33

nit on dac!

88f03c1

Merge remote-tracking branch 'upstream/main' into main

903dd0e

fix

3b3311f

not for this pr

53fea56

eustlb requested a review from Cyrilvallez November 7, 2025 16:03

eustlb requested a review from ArthurZucker November 10, 2025 09:48

Merge branch 'main' into xcodec-fix

fe67d6c

Cyrilvallez reviewed Nov 11, 2025

View reviewed changes

Merge branch 'main' into xcodec-fix

6a79fbf

Cyrilvallez approved these changes Nov 20, 2025

View reviewed changes

Merge branch 'main' into xcodec-fix

d497e30

make style

806fbb0

eustlb enabled auto-merge (squash) November 25, 2025 14:30

eustlb merged commit f13b100 into huggingface:main Nov 25, 2025
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Xcodec fix #42095

Xcodec fix #42095

Uh oh!

eustlb commented Nov 7, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

Cyrilvallez Nov 11, 2025

Uh oh!

eustlb Nov 19, 2025

Uh oh!

Cyrilvallez left a comment

Uh oh!

github-actions bot commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Xcodec fix #42095

Xcodec fix #42095

Uh oh!

Conversation

eustlb commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

Cyrilvallez Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

eustlb Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eustlb commented Nov 7, 2025 •

edited

Loading