[Finetune] Integrate Chat template #178

minmingzhu · 2024-04-08T03:25:41Z

No description provided.

carsonwang

Thanks for the work! Summarized the changes to update as we discussed offline:

Remove the added is_base_model parameter in the finetuning yaml file.
Allow user configuring chat_template in the yaml file. In most of the case, people don't configure it. Priority order: user configured chat_template > model's chat_template > our default template
Write the default template by following other models' (such as llama2 chat), that is, check roles in the message, etc.
The original data format needs to convert to chat format first, before applying the chat template.
Add unit tests to test the result after applying chat template, covering all use cases.
Support chat format as finetuning dataset format. Please follow openAI's format. We can support this in a separate PR.

harborn · 2024-04-18T01:34:08Z

docs/finetune_parameters.md

 |lora_config|task_type: CAUSAL_LM<br>r: 8<br>lora_alpha: 32<br>lora_dropout: 0.1|Will be passed to the LoraConfig `__init__()` method, then it'll be used as config to build Peft model object.|
 |deltatuner_config|"algo": "lora"<br>"denas": True<br>"best_model_structure": "/path/to/best_structure_of_deltatuner_model"|Will be passed to the DeltaTunerArguments `__init__()` method, then it'll be used as config to build [Deltatuner model](https://github.com/intel/e2eAIOK/tree/main/e2eAIOK/deltatuner) object.|
 |enable_gradient_checkpointing|False|enable gradient checkpointing to save GPU memory, but will cost more compute runtime|
+|chat_template|None|User-defined chat template.|


Have you compared the impact of different templates on fine-tuning performance?

Signed-off-by: minmingzhu <[email protected]>

2. modify chat template Signed-off-by: minmingzhu <[email protected]>

Signed-off-by: minmingzhu <[email protected]>

2. add unit test Signed-off-by: minmingzhu <[email protected]>

Signed-off-by: minmingzhu <[email protected]>

* update * fix blocking * update Signed-off-by: Wu, Xiaochang <[email protected]> * update Signed-off-by: Wu, Xiaochang <[email protected]> * fix setup and getting started Signed-off-by: Wu, Xiaochang <[email protected]> * update Signed-off-by: Wu, Xiaochang <[email protected]> * update Signed-off-by: Wu, Xiaochang <[email protected]> * nit Signed-off-by: Wu, Xiaochang <[email protected]> * Add dependencies for tests and update pyproject.toml Signed-off-by: Wu, Xiaochang <[email protected]> * Update dependencies and test workflow Signed-off-by: Wu, Xiaochang <[email protected]> * Update dependencies and fix torch_dist.py Signed-off-by: Wu, Xiaochang <[email protected]> * Update OpenAI SDK installation and start ray cluster Signed-off-by: Wu, Xiaochang <[email protected]> --------- Signed-off-by: Wu, Xiaochang <[email protected]>

* single test * single test * single test * single test * fix hang error

Signed-off-by: minmingzhu <[email protected]>

* use base model mpt-7b instead of mpt-7b-chat Signed-off-by: minmingzhu <[email protected]> * manual setting specify tokenizer Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update doc/finetune_parameters.md Signed-off-by: minmingzhu <[email protected]> --------- Signed-off-by: minmingzhu <[email protected]>

Signed-off-by: minmingzhu <[email protected]>

carsonwang suggested changes Apr 9, 2024

View reviewed changes

minmingzhu force-pushed the chat_template branch from 3e6ccac to 6a0bf63 Compare April 11, 2024 01:24

harborn reviewed Apr 18, 2024

View reviewed changes

minmingzhu force-pushed the chat_template branch from 42825d3 to 4fa89cc Compare April 22, 2024 06:17

minmingzhu force-pushed the chat_template branch from 46733e7 to b5383ec Compare May 9, 2024 06:41

minmingzhu and others added 20 commits May 16, 2024 12:00

implement fine-tuning chat template function

9081906

Signed-off-by: minmingzhu <[email protected]>

update

7f7d404

Signed-off-by: minmingzhu <[email protected]>

update

a3ce22f

Signed-off-by: minmingzhu <[email protected]>

update

b10cda3

Signed-off-by: minmingzhu <[email protected]>

integrate gbt for transformer 4.26.0

049304a

Signed-off-by: minmingzhu <[email protected]>

update

63a1217

Signed-off-by: minmingzhu <[email protected]>

update

58c9584

Signed-off-by: minmingzhu <[email protected]>

1. remove is_base_model tag

e2193ca

2. modify chat template Signed-off-by: minmingzhu <[email protected]>

update

1090bf0

Signed-off-by: minmingzhu <[email protected]>

1. update doc/finetune_parameters.md

6bdd664

2. add unit test Signed-off-by: minmingzhu <[email protected]>

update

4f0d118

Signed-off-by: minmingzhu <[email protected]>

[Tests] Add query single test (intel#156)

1bbaf22

* single test * single test * single test * single test * fix hang error

format

9498efe

Signed-off-by: minmingzhu <[email protected]>

fix license issues

cfa3064

Signed-off-by: minmingzhu <[email protected]>

Update finetune.yaml

c0e4d2d

refactor datap rocesser

b24c9f0

Signed-off-by: minmingzhu <[email protected]>

update

f0d94d1

update

6075c2c

Signed-off-by: minmingzhu <[email protected]>

minmingzhu force-pushed the chat_template branch from c43a192 to 6075c2c Compare May 16, 2024 06:27

minmingzhu added 4 commits May 17, 2024 14:22

update

c17ce45

Signed-off-by: minmingzhu <[email protected]>

update

678d6e2

update

294161d

update

c104a3e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Finetune] Integrate Chat template #178

[Finetune] Integrate Chat template #178

Uh oh!

minmingzhu commented Apr 8, 2024

Uh oh!

carsonwang left a comment •

edited

Loading

Uh oh!

harborn Apr 18, 2024

Uh oh!

minmingzhu Apr 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Finetune] Integrate Chat template #178

Are you sure you want to change the base?

[Finetune] Integrate Chat template #178

Uh oh!

Conversation

minmingzhu commented Apr 8, 2024

Uh oh!

carsonwang left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harborn Apr 18, 2024

Choose a reason for hiding this comment

Uh oh!

minmingzhu Apr 22, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

carsonwang left a comment •

edited

Loading