Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model #41490

KaparthyReddy · 2025-10-09T17:13:44Z

Summary

This PR fixes the _init_weights() method in Qwen2_5_VLForConditionalGeneration to safely skip int8 tensors during initialization. Previously, applying normal_() on int8 weights caused a RuntimeError when loading quantized models.

Changes

Added dtype check in _init_weights() to only initialize floating-point tensors (float16, float32, bfloat16).
Ensures int8 weights from quantized models are skipped safely.
Verified fix by successfully loading a quantized Qwen2.5-VL model on CPU.

Motivation

Quantized models (W8A8, int8 weights) could not be loaded directly due to the previous _init_weights() implementation. This fix allows smooth loading without RuntimeError, making contributions compatible with LLMCompressor quantized models.

Verification

Model loaded successfully on CPU after applying the fix.
_init_weights() safely ignores int8 tensors.

Rocketknight1 · 2025-10-10T11:55:57Z

cc @MekkCyber for quantization!

MekkCyber

Thanks @KaparthyReddy ! I think this the wrong commit

KaparthyReddy · 2025-10-10T15:24:35Z

Thanks for the feedback! I’ve updated the PR to modify the correct file under src/transformers/models/qwen2_5_vl/modeling_qwen2_5_vl.py. _init_weights now safely skips int8 tensors while initializing float tensors correctly.

…Reddy/hf-transformers-contributions into fix-llmcompressor-error the commit.

…logits_to_keep

github-actions · 2025-10-11T13:05:50Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen2_5_vl

ArthurZucker

Hey sorry we merged a big refacto in #41580 !

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model

0faf375

MekkCyber reviewed Oct 10, 2025

View reviewed changes

Kaparthy Reddy and others added 2 commits October 10, 2025 20:40

Fix _init_weights to safely skip int8 tensors

7bd3914

Delete test-fix.py

49ef9e2

Kaparthy Reddy added 4 commits October 10, 2025 21:41

Add tester file for _init_weights and logits_to_keep

7087663

Merge branch 'fix-llmcompressor-error' of https://github.com/Kaparthy…

41d3010

…Reddy/hf-transformers-contributions into fix-llmcompressor-error the commit.

Fix _init_weights to safely skip int8 tensors and update forward for …

1d3afa9

…logits_to_keep

Fix _init_weights to skip int8 tensors

33ef10c

Add init weights tester (fork only)

59e58e1

ArthurZucker reviewed Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model #41490

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model #41490

Uh oh!

KaparthyReddy commented Oct 9, 2025

Uh oh!

Rocketknight1 commented Oct 10, 2025

Uh oh!

MekkCyber left a comment

Uh oh!

KaparthyReddy commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 11, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model #41490

Are you sure you want to change the base?

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model #41490

Uh oh!

Conversation

KaparthyReddy commented Oct 9, 2025

Summary

Changes

Motivation

Verification

Uh oh!

Rocketknight1 commented Oct 10, 2025

Uh oh!

MekkCyber left a comment

Choose a reason for hiding this comment

Uh oh!

KaparthyReddy commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 11, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants