add use_hf_format for export_compressed_model #1379

xin3he · 2023-11-07T14:05:38Z

Type of Change

feature

Description

Huggingface GPTQ models are using a popular repo https://github.com/qwopqwop200/GPTQ-for-LLaMa for compression.
I add an args in get_compressed_model(use_hf_fomat=True) to align with the popular one.

The main change for this arg is as below:

1: compression_dim: weight = 1, zeros = 0 and both are transposed.
2: zeros -= 1 before compression. Why we need it?
3: g_idx: use same number for one group instead of recording the channel order.
4. parameter name changed, such as 'packed_weight' -> 'qweight'.
5. zeros is always needed even for sym.

Expected Behavior & Potential Risk

UT pass

How has this PR been tested?

local test

Dependency Change?

N/A

Signed-off-by: Xin He <[email protected]>

docs/source/quantization_weight_only.md

Signed-off-by: Xin He <[email protected]>

xin3he added 21 commits November 7, 2023 21:58

add use_HF_format for export_compressed_model

9e106b8

Signed-off-by: Xin He <[email protected]>

fix g_idx

2326e15

Signed-off-by: Xin He <[email protected]>

Prevent broken id links

b6b98e3

Signed-off-by: Xin He <[email protected]>

add sym qzero

fec9c19

Signed-off-by: Xin He <[email protected]>

invert perm before compression

22e97de

Signed-off-by: Xin He <[email protected]>

fix typo

db6782b

Signed-off-by: Xin He <[email protected]>

fix bug in perm setting

526fce9

Signed-off-by: Xin He <[email protected]>

fix zero shift error

7429fa5

Signed-off-by: Xin He <[email protected]>

fix reload state_dict bug

22499e1

Signed-off-by: Xin He <[email protected]>

enhance ut

550a4f9

Signed-off-by: Xin He <[email protected]>

fix sym zeropoint

c59177f

Signed-off-by: Xin He <[email protected]>

add dtype to g_idx

f65eb1e

Signed-off-by: Xin He <[email protected]>

add export_compressed_model func for saved_dir

9c7454a

Signed-off-by: Xin He <[email protected]>

fix UT

8836f25

Signed-off-by: Xin He <[email protected]>

ignore pylint

cbdce15

Signed-off-by: Xin He <[email protected]>

fix bug

f2de9c6

Signed-off-by: Xin He <[email protected]>

add document

c0652e3

Signed-off-by: Xin He <[email protected]>

abandon old param names

ca785c8

Signed-off-by: Xin He <[email protected]>

remove useless code

ed07108

Signed-off-by: Xin He <[email protected]>

remove useless doc

468902a

Signed-off-by: Xin He <[email protected]>

Merge branch 'master' into xinhe/hf_format

7e91858

chensuyue requested review from YIYANGCAI and wenhuach21 November 14, 2023 06:57

chensuyue added the enhancement New feature or request label Nov 14, 2023

xin3he added 2 commits November 14, 2023 21:20

fix ut

dc6f51c

Signed-off-by: Xin He <[email protected]>

fix ut

340cf59

Signed-off-by: Xin He <[email protected]>

hshen14 reviewed Nov 16, 2023

View reviewed changes

docs/source/quantization_weight_only.md Outdated Show resolved Hide resolved

hshen14 reviewed Nov 16, 2023

View reviewed changes

docs/source/quantization_weight_only.md Show resolved Hide resolved

hshen14 approved these changes Nov 16, 2023

View reviewed changes

rename use_HF_format to use_hf_format

2e4e6b4

Signed-off-by: Xin He <[email protected]>

hshen14 approved these changes Nov 16, 2023

View reviewed changes

changwangss approved these changes Nov 17, 2023

View reviewed changes

chensuyue merged commit 5179da1 into master Nov 17, 2023

chensuyue deleted the xinhe/hf_format branch November 17, 2023 06:27

xin3he changed the title ~~add use_HF_format for export_compressed_model~~ add use_hf_format for export_compressed_model Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add use_hf_format for export_compressed_model #1379

add use_hf_format for export_compressed_model #1379

Uh oh!

xin3he commented Nov 7, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

add use_hf_format for export_compressed_model #1379

add use_hf_format for export_compressed_model #1379

Uh oh!

Conversation

xin3he commented Nov 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xin3he commented Nov 7, 2023 •

edited

Loading