Issue/170：GPTQ算子的CPU平台重构 #181

xgqdut2016 · 2025-04-21T08:13:47Z

对于a = torch.randn,b=1e-3 * torch.randn也能通过测试，但是由于CPU获取scale,zero涉及过多矩阵计算，测试非常慢，目前对于7B矩阵的测试，当group_size=-1的时候，需要3h，CUDA平台适配的是marlin，目前也能通过测试，国产芯片是arm架构，不支持immintrin.h，必须注释掉CPU平台和immintrin.h相关的函数和头文件才能编译成功
CUDA的性能如下图所示：

xgqdut2016 added 类型：重构模块：算子 labels Apr 25, 2025

xgqdut2016 force-pushed the issue/170 branch from ea9a650 to 7259593 Compare April 30, 2025 08:02

xgqdut2016 changed the base branch from marlin to main April 30, 2025 08:04

xgqdut2016 force-pushed the issue/170 branch 3 times, most recently from 38f7ad4 to c7f8aa6 Compare April 30, 2025 08:22

xgqdut2016 linked an issue Apr 30, 2025 that may be closed by this pull request

[DEV] GPTQ算子 - CPU平台 #170

Open

xgqdut2016 force-pushed the issue/170 branch 2 times, most recently from 318d48e to 2c512d5 Compare May 15, 2025 06:30

xgqdut2016 added the 准备好了 label May 15, 2025

xgqdut2016 requested a review from PanZezhong1725 May 16, 2025 01:36

xgqdut2016 added 准备好了 and removed 准备好了 labels May 19, 2025

PanZezhong1725 requested a review from YdrMaster May 19, 2025 09:32

xgqdut2016 force-pushed the issue/170 branch 2 times, most recently from 4a4fad1 to d605520 Compare May 29, 2025 02:58

xgqdut2016 added 5 commits August 5, 2025 16:36

issue/170: quantize_gptq

a5d1924

issue/170: add signed quant

096233a

issue/170: modified pack py

8c2b7ec

issue/170: debug marlin

c909d0e

issue/170: success marlin

830daeb

xgqdut2016 force-pushed the issue/170 branch from 873fcb2 to 830daeb Compare August 5, 2025 08:37

xgqdut2016 added 3 commits August 7, 2025 15:56

issue/170: error

d829fc1

issue/170: success register

1ed1a25

issue/170: success marlin, workspace=0

757bbeb

PanZezhong1725 force-pushed the main branch from 7300e69 to 37c76a9 Compare October 22, 2025 02:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue/170：GPTQ算子的CPU平台重构 #181

Issue/170：GPTQ算子的CPU平台重构 #181

Uh oh!

xgqdut2016 commented Apr 21, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Issue/170：GPTQ算子的CPU平台重构 #181

Are you sure you want to change the base?

Issue/170：GPTQ算子的CPU平台重构 #181

Uh oh!

Conversation

xgqdut2016 commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xgqdut2016 commented Apr 21, 2025 •

edited

Loading