[wip] llama 4 scout expert quant #82

vkuzo · 2025-11-07T01:38:13Z

Summary:

Requires pytorch/ao#3303
Requires https://www.internalfb.com/phabricator/paste/view/P2028176312
Requires huggingface/transformers#41894

time CUDA_LAUNCH_BLOCKING=0 with-proxy python quantize_hf_model_with_torchao.py --model_name "meta-llama/Llama-4-Scout-17B-16E-Instruct" --save_model_to_disk True --device_map "auto" --ffn_only_llama_4_scout True

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

vkuzo force-pushed the 20251106_llama4_expert_quant branch from 4d43646 to 0ae5120 Compare November 7, 2025 12:06

[wip] llama 4 scout expert quant

f0be3c9

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

vkuzo force-pushed the 20251106_llama4_expert_quant branch from 0ae5120 to f0be3c9 Compare November 7, 2025 14:47

vkuzo merged commit 1b1cd42 into main Nov 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wip] llama 4 scout expert quant #82

[wip] llama 4 scout expert quant #82

vkuzo commented Nov 7, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[wip] llama 4 scout expert quant #82

[wip] llama 4 scout expert quant #82

Conversation

vkuzo commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vkuzo commented Nov 7, 2025 •

edited

Loading