Skip to content

Conversation

@AndrejPer
Copy link
Contributor

For the two files needed for decoder training, added the option of setting the device to 'mps' besides 'coda' and 'cpu'.

@ajjimeno
Copy link
Member

@AndrejPer The code looks good, thanks! I would like to check with you that the performance with mps is good, I have seen issues with transformers and mps.

@AndrejPer
Copy link
Contributor Author

@ajjimeno The code can run, the training is going. It is quite slow on M3 Pro, the first 10 iterations need around 200-250h. Haven't finished that part, so cannot say if it is completely okay, but for now it is training at least.

@ajjimeno
Copy link
Member

ajjimeno commented Jun 17, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants