Fix fine-tuning bugs #265

harborn · 2024-07-05T01:29:23Z

fix default tokenize function for enabling padding.
instance training arguments before loading dataset and tokenizer, due to deepspeed training bug.
enable more training arguments on HPU, such as attn_softmax_bf16, use_flash_attention, pipelining_fwd_bwd

harborn force-pushed the fix-fine-tuning-bugs branch from e1b0418 to fd4d31f Compare July 17, 2024 03:00

Wu, Gangsheng added 12 commits July 17, 2024 10:45

fix default tokenize function for enabling padding

deb1756

update

40cc4d2

update

a151d7b

update

f886ae7

update

0adcb18

update

a426945

update

d4d2402

update

25fff77

update

ff80320

update

faa9e35

update

85e9b81

update

fd4d31f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix fine-tuning bugs #265

Fix fine-tuning bugs #265

Uh oh!

harborn commented Jul 5, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix fine-tuning bugs #265

Are you sure you want to change the base?

Fix fine-tuning bugs #265

Uh oh!

Conversation

harborn commented Jul 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

harborn commented Jul 5, 2024 •

edited

Loading