-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Description
Describe the feature you'd like
Please either provide a larger default memory instance type or allow for configuration for the instance type of the RepackModelStep. This step is currently added automatically for the CreateModel step if custom code is provided but does not allow configuration of instance type.
How would this feature be used? Please describe.
For Large Language Models, model sizes can be larger than 30MB. This would allow providing custom code for model artifacts of large sizes with an automated repackaging step.
Describe alternatives you've considered
The only other alternative is to manually package the inference and loading code directly into the model zip or implementing the packaging logic ourself.
Additional context
Code Link